home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-PUD: POS Tags: ADJ

There are 1017 ADJ lemmas (20%), 1559 ADJ types (20%) and 2089 ADJ tokens (11%). Out of 17 observed tags, the rank of ADJ is: 2 in number of lemmas, 2 in number of types and 5 in number of tokens.

The 10 most frequent ADJ lemmas: новый, другой, самый, первый, большой, последний, северный, высокий, британский, многий

The 10 most frequent ADJ types: другие, первой, других, многие, новые, самым, новых, последние, Северной, большой

The 10 most frequent ambiguous lemmas: военный (ADJ 8, NOUN 1), рабочий (ADJ 7, NOUN 3), возможный (ADJ 5, ADV 1), 1 (ADJ 4, NUM 4), данный (ADJ 4, NOUN 1), 10 (NUM 3, ADJ 2), 31 (ADJ 2, NUM 1), 5 (ADJ 2, NUM 1), 9 (ADJ 2, NUM 1), неизвестный (ADJ 2, NOUN 1)

The 10 most frequent ambiguous types: многие (ADJ 6, NOUN 3), 1 (ADJ 4, NUM 4), возможно (ADV 5, ADJ 4), главным (ADJ 4, NOUN 1), лучше (ADJ 3, ADV 3), 10 (NUM 3, ADJ 2), 31 (ADJ 2, NUM 1), 5 (ADJ 2, NUM 2), 9 (ADJ 2, NUM 1), похоже (ADJ 2, ADV 2)

Morphology

The form / lemma ratio of ADJ is 1.532940 (the average of all parts of speech is 1.496727).

The 1st highest number of forms (10) was observed with the lemma “первый”: первая, первого, первое, первой, первом, первую, первые, первый, первым, первых.

The 2nd highest number of forms (10) was observed with the lemma “самый”: самая, самого, самое, самой, самом, самые, самый, самым, самыми, самых.

The 3rd highest number of forms (9) was observed with the lemma “большой”: большая, большие, большим, большими, больших, большое, большой, большую, наибольшее.

ADJ occurs with 6 features: Degree (1878; 90% instances), Number (1872; 90% instances), Case (1770; 85% instances), Gender (1274; 61% instances), Animacy (144; 7% instances), Variant (101; 5% instances)

ADJ occurs with 17 feature-value pairs: Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Degree=Cmp, Degree=Pos, Degree=Sup, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Variant=Short

ADJ occurs with 40 feature combinations. The most frequent feature combination is _ (210 tokens). Examples: 2014, 2015, III, 1, 1492, 2010, 2012, 2013, 2017, 1992

Relations

ADJ nodes are attached to their parents using 17 different relations: amod (1778; 85% instances), conj (74; 4% instances), root (62; 3% instances), xcomp (45; 2% instances), flat:name (18; 1% instances), obl (18; 1% instances), ccomp (16; 1% instances), nmod (16; 1% instances), fixed (13; 1% instances), acl (11; 1% instances), nsubj (11; 1% instances), parataxis (9; 0% instances), acl:relcl (7; 0% instances), advcl (6; 0% instances), appos (2; 0% instances), obj (2; 0% instances), iobj (1; 0% instances)

Parents of ADJ nodes belong to 10 different parts of speech: NOUN (1712; 82% instances), VERB (111; 5% instances), ADJ (88; 4% instances), PROPN (86; 4% instances), (62; 3% instances), PRON (14; 1% instances), ADP (11; 1% instances), NUM (3; 0% instances), ADV (1; 0% instances), AUX (1; 0% instances)

1743 (83%) ADJ nodes are leaves.

167 (8%) ADJ nodes have one child.

49 (2%) ADJ nodes have two children.

130 (6%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 7.

Children of ADJ nodes are attached using 24 different relations: punct (178; 22% instances), advmod (109; 14% instances), nsubj (84; 10% instances), conj (77; 10% instances), cc (67; 8% instances), obl (63; 8% instances), cop (32; 4% instances), amod (30; 4% instances), case (25; 3% instances), mark (24; 3% instances), csubj (18; 2% instances), iobj (18; 2% instances), nmod (15; 2% instances), xcomp (15; 2% instances), advcl (12; 1% instances), parataxis (12; 1% instances), ccomp (10; 1% instances), aux (5; 1% instances), acl:relcl (4; 0% instances), nsubj:pass (4; 0% instances), nummod:gov (2; 0% instances), acl (1; 0% instances), aux:pass (1; 0% instances), expl (1; 0% instances)

Children of ADJ nodes belong to 14 different parts of speech: PUNCT (178; 22% instances), NOUN (126; 16% instances), ADV (97; 12% instances), ADJ (88; 11% instances), VERB (78; 10% instances), CCONJ (70; 9% instances), PRON (49; 6% instances), AUX (39; 5% instances), SCONJ (25; 3% instances), ADP (23; 3% instances), PART (17; 2% instances), PROPN (14; 2% instances), NUM (2; 0% instances), DET (1; 0% instances)