Treebank Statistics: UD_Russian-SynTagRus: POS Tags: DET
There are 40 DET
lemmas (0%), 343 DET
types (0%) and 41139 DET
tokens (3%).
Out of 17 observed tags, the rank of DET
is: 12 in number of lemmas, 8 in number of types and 11 in number of tokens.
The 10 most frequent DET
lemmas: этот, свой, весь, тот, такой, его, наш, их, мой, какой
The 10 most frequent DET
types: его, все, их, этот, эти, ее, этой, такой, всех, своей
The 10 most frequent ambiguous lemmas: такой (DET 3535, ADJ 1), какой (DET 1038, ADJ 1), один (NUM 2706, DET 984, NOUN 3), другой (ADJ 2063, DET 658), самый (ADJ 1600, DET 528), сам (ADJ 1186, DET 514), никакой (DET 508, ADJ 1), иной (ADJ 385, DET 90), прочий (ADJ 103, DET 58), кой (DET 34, ADJ 9)
The 10 most frequent ambiguous types: его (DET 2183, PRON 2063), все (DET 1667, PRON 1487, PART 677), их (PRON 1530, DET 1191), ее (PRON 1109, DET 897), этой (DET 923, PRON 1), всех (DET 715, PRON 231), это (PRON 4381, DET 568, PART 317), этом (PRON 1062, DET 656), этого (PRON 737, DET 644), том (PRON 1206, DET 546, NOUN 13)
- его
- все
- их
- ее
- этой
- всех
- это
- этом
- этого
- том
Morphology
The form / lemma ratio of DET
is 8.575000 (the average of all parts of speech is 2.654430).
The 1st highest number of forms (15) was observed with the lemma “свой”: свое, своего, своей, своем, своему, своею, свои, своим, своими, своих, свой, свою, своя, своё, своём.
The 2nd highest number of forms (14) was observed with the lemma “весь”: весь, все, всего, всей, всем, всеми, всему, всех, всею, всея, всю, вся, всё, всём.
The 3rd highest number of forms (14) was observed with the lemma “мой”: мое, моего, моей, моем, моему, моею, мои, моим, моими, моих, мой, мою, моя, моём.
DET
occurs with 10 features: Number (36546; 89% instances), Case (36545; 89% instances), PronType (27807; 68% instances), Gender (24512; 60% instances), Poss (8758; 21% instances), Reflex (3223; 8% instances), Animacy (1767; 4% instances), Abbr (17; 0% instances), Typo (2; 0% instances), Variant (1; 0% instances)
DET
occurs with 24 feature-value pairs: Abbr=Yes
, Animacy=Anim
, Animacy=Inan
, Case=Acc
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Number=Plur
, Number=Sing
, Poss=Yes
, PronType=Dem
, PronType=Ind
, PronType=Int,Rel
, PronType=Neg
, PronType=Prs
, PronType=Tot
, Reflex=Yes
, Typo=Yes
, Variant=Short
DET
occurs with 203 feature combinations.
The most frequent feature combination is Poss=Yes|PronType=Prs
(3073 tokens).
Examples: его, их, ее, её
Relations
DET
nodes are attached to their parents using 24 different relations: det (37906; 92% instances), nsubj (936; 2% instances), obl (474; 1% instances), amod (448; 1% instances), fixed (430; 1% instances), root (266; 1% instances), conj (232; 1% instances), obj (104; 0% instances), parataxis (50; 0% instances), nmod (48; 0% instances), xcomp (45; 0% instances), iobj (41; 0% instances), nsubj:pass (39; 0% instances), orphan (29; 0% instances), ccomp (28; 0% instances), advmod (19; 0% instances), acl (15; 0% instances), appos (14; 0% instances), advcl (5; 0% instances), csubj (5; 0% instances), acl:relcl (2; 0% instances), dislocated (1; 0% instances), obl:agent (1; 0% instances), obl:tmod (1; 0% instances)
Parents of DET
nodes belong to 14 different parts of speech: NOUN (35147; 85% instances), VERB (2084; 5% instances), ADJ (1088; 3% instances), PRON (1047; 3% instances), PROPN (476; 1% instances), ADP (384; 1% instances), DET (325; 1% instances), (266; 1% instances), NUM (170; 0% instances), ADV (87; 0% instances), PART (45; 0% instances), X (11; 0% instances), SYM (8; 0% instances), CCONJ (1; 0% instances)
35762 (87%) DET
nodes are leaves.
3546 (9%) DET
nodes have one child.
1094 (3%) DET
nodes have two children.
737 (2%) DET
nodes have three or more children.
The highest child degree of a DET
node is 20.
Children of DET
nodes are attached using 31 different relations: advmod (2422; 28% instances), punct (1370; 16% instances), acl:relcl (794; 9% instances), obl (651; 8% instances), case (564; 7% instances), conj (498; 6% instances), cc (438; 5% instances), nmod (328; 4% instances), nsubj (318; 4% instances), det (196; 2% instances), parataxis (177; 2% instances), fixed (146; 2% instances), amod (142; 2% instances), cop (130; 2% instances), ccomp (109; 1% instances), mark (88; 1% instances), advcl (65; 1% instances), acl (50; 1% instances), orphan (34; 0% instances), flat:foreign (20; 0% instances), appos (19; 0% instances), expl (13; 0% instances), discourse (11; 0% instances), nummod:gov (7; 0% instances), aux (6; 0% instances), csubj (6; 0% instances), iobj (3; 0% instances), nummod (3; 0% instances), dislocated (1; 0% instances), nsubj:outer (1; 0% instances), obl:tmod (1; 0% instances)
Children of DET
nodes belong to 15 different parts of speech: PART (2278; 26% instances), PUNCT (1370; 16% instances), VERB (983; 11% instances), NOUN (795; 9% instances), ADJ (585; 7% instances), ADP (568; 7% instances), ADV (544; 6% instances), CCONJ (414; 5% instances), PRON (370; 4% instances), DET (325; 4% instances), AUX (138; 2% instances), PROPN (113; 1% instances), SCONJ (105; 1% instances), NUM (15; 0% instances), X (8; 0% instances)