home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Latin-UDante: POS Tags: DET

There are 50 DET lemmas (1%), 331 DET types (3%) and 3647 DET tokens (7%). Out of 16 observed tags, the rank of DET is: 7 in number of lemmas, 6 in number of types and 6 in number of tokens.

The 10 most frequent DET lemmas: hic, ille, omnis, alius, ipse, suus, unus, quidam, noster, totus

The 10 most frequent DET types: hoc, illa, illud, unum, hec, ipsum, omnes, aliud, sua, quedam

The 10 most frequent ambiguous lemmas: hic (DET 579, ADV 16), quicumque (DET 22, PRON 3), qui (PRON 1484, DET 7), quantum (SCONJ 31, ADV 4, DET 4)

The 10 most frequent ambiguous types: quantum (DET 40, SCONJ 25, ADV 2), hic (DET 23, ADV 14), una (DET 27, ADV 1), sola (DET 19, NOUN 1), sui (DET 20, PRON 16, X 1), aliqua (DET 16, PRON 5), solo (DET 14, NOUN 1), mea (DET 10, X 1), toto (DET 11, NOUN 1), aliquo (DET 10, PRON 7)

Morphology

The form / lemma ratio of DET is 6.620000 (the average of all parts of speech is 2.129200).

The 1st highest number of forms (17) was observed with the lemma “hic”: Harum, hac, hanc, has, hec, hee, hi, hic, hii, hiis, his, hoc, horum, hos, huic, huius, hunc.

The 2nd highest number of forms (16) was observed with the lemma “multus”: multa, multarum, multas, multe, multi, multis, multorum, multos, permultis, perplures, plura, plures, pluribus, plurima, plurium, plus.

The 3rd highest number of forms (13) was observed with the lemma “alius”: alia, aliam, aliarum, alias, alie, alii, aliis, alio, aliorum, alios, aliud, alium, alius.

DET occurs with 16 features: PronType (3647; 100% instances), InflClass (3606; 99% instances), Case (3597; 99% instances), Gender (3597; 99% instances), Number (3597; 99% instances), Person[psor] (496; 14% instances), Poss (496; 14% instances), Form (398; 11% instances), NumType (385; 11% instances), Reflex (255; 7% instances), Number[psor] (241; 7% instances), Compound (181; 5% instances), NumValue (175; 5% instances), Polarity (84; 2% instances), Degree (21; 1% instances), Proper (9; 0% instances)

DET occurs with 38 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Nom, Case=Voc, Compound=Yes, Degree=Abs, Degree=Cmp, Form=Emp, Gender=Fem, Gender=Masc, Gender=Neut, InflClass=IndEurA, InflClass=IndEurI, InflClass=IndEurO, InflClass=IndEurX, InflClass=LatPron, NumType=Card, NumValue=1, NumValue=2, Number=Plur, Number=Sing, Number[psor]=Plur, Number[psor]=Sing, Person[psor]=1, Person[psor]=2, Person[psor]=3, Polarity=Neg, Poss=Yes, PronType=Con, PronType=Dem, PronType=Ind, PronType=Prs, PronType=Rel, PronType=Tot, Proper=Yes, Reflex=Yes

DET occurs with 469 feature combinations. The most frequent feature combination is Case=Acc|Gender=Neut|InflClass=LatPron|Number=Sing|PronType=Dem (161 tokens). Examples: hoc, illud, istud

Relations

DET nodes are attached to their parents using 37 different relations: det (1902; 52% instances), nsubj (366; 10% instances), obl (360; 10% instances), obj (206; 6% instances), conj (184; 5% instances), nsubj:pass (108; 3% instances), obl:arg (107; 3% instances), nmod (106; 3% instances), root (46; 1% instances), advcl:cmp (30; 1% instances), advcl:pred (30; 1% instances), advcl (25; 1% instances), orphan (20; 1% instances), acl:relcl (19; 1% instances), obl:agent (16; 0% instances), advcl:relcl (15; 0% instances), cc (15; 0% instances), csubj (13; 0% instances), xcomp (13; 0% instances), conj:expl (12; 0% instances), ccomp (10; 0% instances), csubj:pass (6; 0% instances), amod (5; 0% instances), advmod (4; 0% instances), nsubj:outer (4; 0% instances), obl:lmod (4; 0% instances), ccomp:relcl (3; 0% instances), ccomp:reported (3; 0% instances), csubj:relcl (3; 0% instances), obl:cmp (3; 0% instances), flat:redup (2; 0% instances), obl:tmod (2; 0% instances), acl (1; 0% instances), advmod:lmod (1; 0% instances), appos (1; 0% instances), det:numgov (1; 0% instances), nsubj:cleft (1; 0% instances)

Parents of DET nodes belong to 12 different parts of speech: NOUN (1835; 50% instances), VERB (1187; 33% instances), ADJ (221; 6% instances), DET (180; 5% instances), PRON (76; 2% instances), PROPN (55; 2% instances), (46; 1% instances), ADV (20; 1% instances), NUM (17; 0% instances), AUX (7; 0% instances), PART (2; 0% instances), X (1; 0% instances)

2531 (69%) DET nodes are leaves.

607 (17%) DET nodes have one child.

224 (6%) DET nodes have two children.

285 (8%) DET nodes have three or more children.

The highest child degree of a DET node is 9.

Children of DET nodes are attached using 45 different relations: case (473; 21% instances), punct (349; 16% instances), acl:relcl (232; 10% instances), conj (161; 7% instances), cop (148; 7% instances), cc (146; 7% instances), nsubj (90; 4% instances), orphan (80; 4% instances), det (73; 3% instances), mark (69; 3% instances), nmod (47; 2% instances), obl (43; 2% instances), discourse (41; 2% instances), advcl (32; 1% instances), conj:expl (32; 1% instances), advmod:emph (31; 1% instances), acl (30; 1% instances), advcl:cmp (28; 1% instances), advmod:neg (26; 1% instances), advmod (18; 1% instances), fixed (16; 1% instances), csubj (12; 1% instances), amod (11; 0% instances), flat (7; 0% instances), csubj:cleft (6; 0% instances), xcomp (5; 0% instances), advcl:pred (4; 0% instances), advmod:lmod (4; 0% instances), advcl:relcl (3; 0% instances), advmod:tmod (3; 0% instances), obl:arg (3; 0% instances), appos (2; 0% instances), ccomp (2; 0% instances), ccomp:reported (2; 0% instances), cop:outer (2; 0% instances), flat:redup (2; 0% instances), nummod (2; 0% instances), vocative (2; 0% instances), csubj:relcl (1; 0% instances), dislocated:obj (1; 0% instances), nsubj:outer (1; 0% instances), obj (1; 0% instances), obl:agent (1; 0% instances), obl:cmp (1; 0% instances), obl:tmod (1; 0% instances)

Children of DET nodes belong to 16 different parts of speech: ADP (472; 21% instances), PUNCT (349; 16% instances), VERB (313; 14% instances), NOUN (225; 10% instances), DET (180; 8% instances), AUX (170; 8% instances), CCONJ (160; 7% instances), ADV (78; 3% instances), PRON (77; 3% instances), SCONJ (71; 3% instances), ADJ (64; 3% instances), PART (54; 2% instances), PROPN (16; 1% instances), X (8; 0% instances), NUM (5; 0% instances), INTJ (2; 0% instances)