home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-RNC: POS Tags: DET

There are 69 DET lemmas (1%), 923 DET types (3%) and 8182 DET tokens (5%). Out of 17 observed tags, the rank of DET is: 9 in number of lemmas, 5 in number of types and 8 in number of tokens.

The 10 most frequent DET lemmas: тотъ, весь, свой, твой, мой, сей, нашъ, который, иной, всякий

The 10 most frequent DET types: того, всеа, своего, которые, мои, все, те, сей, тех, тѣхъ

The 10 most frequent ambiguous lemmas: весь (DET 1100, NOUN 5), сей (DET 492, PRON 1), другой (DET 134, ADJ 3), самъ (DET 121, NOUN 1), кой (DET 36, PRON 2), единъ (NUM 45, DET 7), одинъ (NUM 149, DET 7, ADJ 3), премногий (ADJ 3, DET 2), злодѣй (NOUN 3, DET 1), смѣкальный (ADJ 8, DET 1)

The 10 most frequent ambiguous types: того (DET 219, PRON 125), все (DET 116, ADV 3), то (PRON 196, DET 91, SCONJ 33, PART 8), та (DET 80, PART 2), тое (DET 77, PRON 1), вся (DET 62, PRON 3), т. (DET 62, PRON 3), тем (DET 62, PRON 13), тому (DET 58, PRON 54), тѣмъ (DET 57, PRON 12)

Morphology

The form / lemma ratio of DET is 13.376812 (the average of all parts of speech is 2.481645).

The 1st highest number of forms (61) was observed with the lemma “весь”: ве(с), вес, вес[ь], весъ, весь, вс[ѣмъ], все, все(м), все[мъ], все[х]ъ, всеа, всево, всег[о], всего, всее, всеи, всей, всем, всеми, всему, всемъ, всемꙋ, всех, всехъ, всею, всея, всеѣ, всеꙗ, вси, всимъ, всихъ, всомъ, всь, всья, всю, вся, всяго, всєму, всіи, всѣ, всѣи, всѣм, всѣми, всѣмъ, всѣмь, всѣх, всѣхъ, всѣхь, всꙗ, въвесь, въсе, въсем, въсеми, въсемъ, въсехъ, въсю, въся, въсѣ, вьсе, вьси, вєсь.

The 2nd highest number of forms (61) was observed with the lemma “свой”: [своя], с., св[о]ему, св[о]є, сваей, сво[е]ю, сво[его, сво[еи], свово, свое, свое(м), свое[го], свое[ю], своев[о], своево, своег[о], своего, своегѡ, своее, своеи, своеим, своей, своем, своему, своемъ, своемь, своемꙋ, своею, своея, своеі, своеѣ, своеꙗ, свои, свои(м), своим, своими, своимъ, своимь, своимі, своих, своихъ, свой, свою, своя, своє, своєво, своєг[о, своєг[о], своєго, своєи, своєм, своєму, своєю, своі, своіми, своімъ, своіх, своіхъ, своꙗ, свъею, свѡегѡ.

The 3rd highest number of forms (60) was observed with the lemma “иной”: Инъ, и(н)ныхъ, инаа, иная, инаꙗ, инеми, ини, инии, иния, ино, инова, иново, иного, иное, инои, иной, ином, иному, иномъ, иномꙋ, иноє, иную, ины, ины(х), иныа, иные, иныи, иный, иным, иными, инымъ, иных, иныхъ, иныя, иныє, ині, иніи, инѡе, инѣи, инѣмь, инѣх, н(н)наꙗ, ыного, ыной, ыной], ыном, ыному, ыномъ, ыные, ыным, ыными, ынымо, ыных, ыныхъ, іное, інои, іному, інымъ, іныя, ініи.

DET occurs with 10 features: PronType (8141; 99% instances), Case (8109; 99% instances), Number (8107; 99% instances), Gender (8106; 99% instances), Poss (2853; 35% instances), Reflex (1088; 13% instances), Animacy (337; 4% instances), Variant (118; 1% instances), Abbr (76; 1% instances), Typo (3; 0% instances)

DET occurs with 27 feature-value pairs: Abbr=Yes, Animacy=Anim, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, Number=Dual, Number=Plur, Number=Sing, Poss=Yes, PronType=Dem, PronType=Emp, PronType=Exc, PronType=Ind, PronType=Int, PronType=Neg, PronType=Prs, PronType=Rel, PronType=Tot, Reflex=Yes, Typo=Yes, Variant=Short

DET occurs with 345 feature combinations. The most frequent feature combination is Case=Gen|Gender=Fem|Number=Sing|PronType=Tot (305 tokens). Examples: всеа, всея, всеи, всей, всеꙗ, иные, всее, всякие, всякия, другіе

Relations

DET nodes are attached to their parents using 28 different relations: det (7193; 88% instances), nsubj (280; 3% instances), obl (187; 2% instances), obj (142; 2% instances), obl:float (92; 1% instances), conj (72; 1% instances), iobj (62; 1% instances), nmod (50; 1% instances), nsubj:pass (30; 0% instances), acl (13; 0% instances), fixed (11; 0% instances), orphan (11; 0% instances), parataxis (8; 0% instances), root (6; 0% instances), ccomp (4; 0% instances), obl:tmod (3; 0% instances), advcl (2; 0% instances), appos (2; 0% instances), dep (2; 0% instances), obl:agent (2; 0% instances), obl:depict (2; 0% instances), xcomp (2; 0% instances), acl:relcl (1; 0% instances), advmod (1; 0% instances), dislocated (1; 0% instances), expl (1; 0% instances), expl:pv (1; 0% instances), reparandum (1; 0% instances)

Parents of DET nodes belong to 11 different parts of speech: NOUN (6754; 83% instances), VERB (744; 9% instances), PROPN (362; 4% instances), ADJ (149; 2% instances), PRON (98; 1% instances), DET (42; 1% instances), ADP (10; 0% instances), ADV (10; 0% instances), NUM (6; 0% instances), (6; 0% instances), PART (1; 0% instances)

7302 (89%) DET nodes are leaves.

727 (9%) DET nodes have one child.

88 (1%) DET nodes have two children.

65 (1%) DET nodes have three or more children.

The highest child degree of a DET node is 15.

Children of DET nodes are attached using 28 different relations: advmod (320; 27% instances), case (230; 20% instances), punct (91; 8% instances), appos (77; 7% instances), cc (74; 6% instances), acl:relcl (65; 6% instances), conj (54; 5% instances), discourse (31; 3% instances), acl (30; 3% instances), det (27; 2% instances), nsubj (26; 2% instances), orphan (26; 2% instances), parataxis (25; 2% instances), vocative (22; 2% instances), nmod (20; 2% instances), cop (11; 1% instances), obl (10; 1% instances), dislocated (9; 1% instances), mark (7; 1% instances), advcl (5; 0% instances), dep (4; 0% instances), obl:pronmod (3; 0% instances), amod (2; 0% instances), expl (2; 0% instances), fixed (2; 0% instances), aux (1; 0% instances), obl:tmod (1; 0% instances), reparandum (1; 0% instances)

Children of DET nodes belong to 15 different parts of speech: PART (350; 30% instances), ADP (223; 19% instances), NOUN (194; 16% instances), VERB (98; 8% instances), PUNCT (91; 8% instances), CCONJ (78; 7% instances), DET (42; 4% instances), ADJ (29; 2% instances), PRON (22; 2% instances), AUX (14; 1% instances), SCONJ (11; 1% instances), PROPN (10; 1% instances), ADV (6; 1% instances), NUM (4; 0% instances), X (4; 0% instances)