home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Low_Saxon-LSDC: POS Tags: DET

There are 25 DET lemmas (1%), 115 DET types (2%) and 2221 DET tokens (10%). Out of 16 observed tags, the rank of DET is: 12 in number of lemmas, 8 in number of types and 5 in number of tokens.

The 10 most frequent DET lemmas: de, en, syn, myn, et, al, disse, ear, keyn, uns

The 10 most frequent DET types: de, en, den, dat, et, myn, dee, dem, syn, syne

The 10 most frequent ambiguous lemmas: de (DET 1303, X 1), en (DET 368, PART 1), myn (DET 80, PRON 4), et (PRON 205, DET 61), al (ADV 45, DET 36, PRON 26), disse (DET 36, PRON 4), keyn (DET 27, PRON 5), uns (DET 27, PRON 1), geyn (DET 21, PRON 3), dyn (DET 20, PRON 1)

The 10 most frequent ambiguous types: en (DET 273, CCONJ 170, PRON 11, PART 1), den (DET 208, PRON 20), dat (SCONJ 175, PRON 148, DET 116), et (PRON 189, DET 78), myn (DET 55, PRON 9), dee (PRON 139, DET 46), dem (DET 54, PRON 3), syn (DET 51, AUX 29, VERB 1), der (ADV 50, DET 39, PRON 23, X 2), ne (DET 37, PRON 6, PART 2)

Morphology

The form / lemma ratio of DET is 4.600000 (the average of all parts of speech is 1.500000).

The 1st highest number of forms (24) was observed with the lemma “de”: ‘m, ‘n, ‘r, ‘t, dat, de, deam, dean, dear, deas, dee, dem, den, der, des, det, en, er, et, m, me, myn, n, t.

The 2nd highest number of forms (11) was observed with the lemma “disse”: dease, disse, dissen, dit, düs, düsse, düssem, düssen, düäse, düäsen, düäser.

The 3rd highest number of forms (11) was observed with the lemma “en”: ‘ne, den, e, en, eyn, eyne, eynem, eynen, eyner, ne, nen.

DET occurs with 9 features: Case (2205; 99% instances), Number (2201; 99% instances), PronType (2195; 99% instances), Gender (2152; 97% instances), Definite (1712; 77% instances), Poss (309; 14% instances), Person[psor] (35; 2% instances), Number[psor] (34; 2% instances), Gender[psor] (17; 1% instances)

DET occurs with 31 feature-value pairs: Case=Acc, Case=Acc,Dat, Case=Dat, Case=Gen, Case=Nom, Definite=Def, Definite=Ind, Gender=Fem, Gender=Fem,Masc, Gender=Fem,Masc,Neut, Gender=Fem,Neut, Gender=Masc, Gender=Masc,Neut, Gender=Neut, Gender[psor]=Fem, Gender[psor]=Masc, Number=Plur, Number=Sing, Number[psor]=Plur, Number[psor]=Sing, Person[psor]=1, Person[psor]=2, Person[psor]=3, Poss=Yes, PronType=Art, PronType=Dem, PronType=Ind, PronType=Ind,Neg,Tot, PronType=Neg, PronType=Prs, PronType=Tot

DET occurs with 234 feature combinations. The most frequent feature combination is Case=Nom|Definite=Def|Gender=Masc|Number=Sing|PronType=Art (183 tokens). Examples: de, dee, der, Gyn, den, ne

Relations

DET nodes are attached to their parents using 7 different relations: det (2189; 99% instances), det:poss (24; 1% instances), nsubj (3; 0% instances), amod (2; 0% instances), advmod (1; 0% instances), fixed (1; 0% instances), nmod (1; 0% instances)

Parents of DET nodes belong to 7 different parts of speech: NOUN (2080; 94% instances), ADJ (67; 3% instances), PROPN (37; 2% instances), PRON (26; 1% instances), VERB (6; 0% instances), NUM (4; 0% instances), X (1; 0% instances)

2170 (98%) DET nodes are leaves.

51 (2%) DET nodes have one child.

The highest child degree of a DET node is 1.

Children of DET nodes are attached using 5 different relations: advmod (24; 47% instances), nmod:poss (13; 25% instances), punct (12; 24% instances), case (1; 2% instances), nmod (1; 2% instances)

Children of DET nodes belong to 6 different parts of speech: ADV (24; 47% instances), PUNCT (12; 24% instances), NOUN (9; 18% instances), PROPN (4; 8% instances), ADP (1; 2% instances), PRON (1; 2% instances)