Treebank Statistics: UD_Russian-SynTagRus: POS Tags: PRON
There are 35 PRON
lemmas (0%), 144 PRON
types (0%) and 72945 PRON
tokens (5%).
Out of 17 observed tags, the rank of PRON
is: 13 in number of lemmas, 9 in number of types and 7 in number of tokens.
The 10 most frequent PRON
lemmas: он, это, я, который, они, то, мы, она, что, все
The 10 most frequent PRON
types: это, он, я, мы, что, они, его, она, все, то
The 10 most frequent ambiguous lemmas: это (PRON 8070, PART 364), то (PRON 5654, SCONJ 1547, PART 319), что (SCONJ 10960, PRON 4061, ADV 15, NOUN 1, PART 1), все (PRON 3056, PART 746, ADV 1, DET 1), вы (PRON 1632, X 1), что-то (PRON 548, ADV 12), многие (PRON 75, ADJ 5), прочее (PRON 34, NOUN 1), нечего (PRON 23, NOUN 12, ADV 8, VERB 5), немногие (ADJ 31, PRON 7)
The 10 most frequent ambiguous types: это (PRON 4381, DET 568, PART 317), что (SCONJ 10907, PRON 2419, ADV 11, NOUN 1), его (DET 2183, PRON 2063), все (DET 1667, PRON 1487, PART 677), то (SCONJ 1519, PRON 1398, DET 381, PART 319), их (PRON 1530, DET 1191), того (PRON 1312, DET 245, PART 1), том (PRON 1206, DET 546, NOUN 13), ее (PRON 1109, DET 897), этом (PRON 1062, DET 656)
- это
- что
- его
- все
- то
- их
- того
- том
- ее
- этом
Morphology
The form / lemma ratio of PRON
is 4.114286 (the average of all parts of speech is 2.654430).
The 1st highest number of forms (12) was observed with the lemma “который”: которая, которого, которое, которой, котором, которому, которую, которые, который, которым, которыми, которых.
The 2nd highest number of forms (9) was observed with the lemma “он”: его, ему, им, него, нем, нему, ним, нём, он.
The 3rd highest number of forms (9) was observed with the lemma “она”: ее, ей, ею, её, нее, ней, нею, неё, она.
PRON
occurs with 9 features: Case (72259; 99% instances), Number (59240; 81% instances), PronType (49059; 67% instances), Person (38523; 53% instances), Gender (35076; 48% instances), Animacy (19066; 26% instances), Reflex (1587; 2% instances), Abbr (22; 0% instances), Typo (1; 0% instances)
PRON
occurs with 25 feature-value pairs: Abbr=Yes
, Animacy=Anim
, Animacy=Inan
, Case=Acc
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, PronType=Dem
, PronType=Ind
, PronType=Int,Rel
, PronType=Neg
, PronType=Prs
, PronType=Tot
, Reflex=Yes
, Typo=Yes
PRON
occurs with 202 feature combinations.
The most frequent feature combination is Case=Nom|PronType=Int,Rel
(4476 tokens).
Examples: что, которые, кто, который, которая, которое, че
Relations
PRON
nodes are attached to their parents using 32 different relations: nsubj (32282; 44% instances), obl (14292; 20% instances), obj (9226; 13% instances), iobj (5412; 7% instances), nmod (4260; 6% instances), nsubj:pass (1494; 2% instances), root (1285; 2% instances), conj (843; 1% instances), fixed (789; 1% instances), expl (689; 1% instances), parataxis (647; 1% instances), cc (512; 1% instances), advmod (206; 0% instances), mark (154; 0% instances), discourse (141; 0% instances), ccomp (120; 0% instances), appos (113; 0% instances), advcl (95; 0% instances), obl:agent (89; 0% instances), orphan (87; 0% instances), acl:relcl (74; 0% instances), amod (32; 0% instances), acl (29; 0% instances), xcomp (21; 0% instances), det (17; 0% instances), csubj (15; 0% instances), flat:name (12; 0% instances), dislocated (3; 0% instances), flat (2; 0% instances), nsubj:outer (2; 0% instances), case (1; 0% instances), obl:tmod (1; 0% instances)
Parents of PRON
nodes belong to 17 different parts of speech: VERB (53669; 74% instances), NOUN (8607; 12% instances), ADJ (5198; 7% instances), ADV (1882; 3% instances), (1285; 2% instances), PRON (826; 1% instances), DET (370; 1% instances), ADP (317; 0% instances), NUM (310; 0% instances), PROPN (237; 0% instances), PART (150; 0% instances), CCONJ (54; 0% instances), SCONJ (13; 0% instances), SYM (13; 0% instances), X (8; 0% instances), INTJ (4; 0% instances), AUX (2; 0% instances)
48805 (67%) PRON
nodes are leaves.
16602 (23%) PRON
nodes have one child.
4476 (6%) PRON
nodes have two children.
3062 (4%) PRON
nodes have three or more children.
The highest child degree of a PRON
node is 13.
Children of PRON
nodes are attached using 35 different relations: case (15457; 41% instances), punct (5584; 15% instances), acl (3021; 8% instances), advmod (2855; 8% instances), fixed (1472; 4% instances), nsubj (1456; 4% instances), det (996; 3% instances), cc (951; 3% instances), conj (900; 2% instances), amod (881; 2% instances), parataxis (762; 2% instances), nmod (657; 2% instances), obl (475; 1% instances), acl:relcl (428; 1% instances), advcl (392; 1% instances), cop (328; 1% instances), mark (249; 1% instances), orphan (231; 1% instances), appos (210; 1% instances), discourse (98; 0% instances), expl (64; 0% instances), nummod:gov (61; 0% instances), csubj (52; 0% instances), iobj (43; 0% instances), vocative (37; 0% instances), aux (13; 0% instances), nummod (11; 0% instances), ccomp (8; 0% instances), dislocated (7; 0% instances), flat:name (6; 0% instances), obj (5; 0% instances), flat (4; 0% instances), obl:tmod (2; 0% instances), compound (1; 0% instances), dep (1; 0% instances)
Children of PRON
nodes belong to 17 different parts of speech: ADP (15378; 41% instances), PUNCT (5584; 15% instances), VERB (4354; 12% instances), PART (2872; 8% instances), NOUN (2569; 7% instances), ADJ (1626; 4% instances), ADV (1480; 4% instances), DET (1047; 3% instances), CCONJ (934; 2% instances), PRON (826; 2% instances), AUX (351; 1% instances), SCONJ (333; 1% instances), PROPN (253; 1% instances), NUM (97; 0% instances), INTJ (7; 0% instances), SYM (4; 0% instances), X (3; 0% instances)