home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-Birchbark: POS Tags: ADJ

There are 487 ADJ lemmas (10%), 873 ADJ types (7%) and 940 ADJ tokens (3%). Out of 17 observed tags, the rank of ADJ is: 5 in number of lemmas, 5 in number of types and 10 in number of tokens.

The 10 most frequent ADJ lemmas: свѧтыи, третии, добрыи, другыи, борзыи, пѧтыи, великыи, лоньскыи, четвертыи, сторовыи

The 10 most frequent ADJ types: добро, проста, велика, ст҃ѣ, гнѣд, добра, здорово, пѧта, ст҃го, трьтиѧ

The 10 most frequent ambiguous lemmas: другыи (ADJ 20, DET 10), ·ѕ҃· (NUM 48, ADJ 3), ·в҃· (NUM 157, ADJ 2), ·д҃· (NUM 56, ADJ 2), двои (NUM 8, ADJ 2), ·г҃· (NUM 109, ADV 3, ADJ 1), ·е҃· (NUM 70, ADJ 1), ·з҃· (NUM 30, ADJ 1), ·и҃· (NUM 16, ADJ 1), трои (ADJ 1, NUM 1)

The 10 most frequent ambiguous types: добро (ADJ 8, NOUN 3, SCONJ 1), добре (ADJ 2, ADV 1), добръ (ADJ 2, NOUN 1), другую (ADJ 2, DET 1), пѧте (NUM 3, ADJ 2), пѧть (NUM 13, ADJ 2), сменова (ADJ 2, PROPN 1), :ѕ҃: (NUM 4, ADJ 1), боле (ADV 2, ADJ 1, NUM 1), ви (ADJ 1, X 1)

Morphology

The form / lemma ratio of ADJ is 1.792608 (the average of all parts of speech is 2.410435).

The 1st highest number of forms (25) was observed with the lemma “свѧтыи”: (свѧ)тꙑе, (св҃)[т]аго, ст҃[у], свгто, свѧ[т]-[го, свѧтее, свѧтое, свѧтꙑ, свѧтꙑмъ, свѧтꙑѧ, св҃ѧ[т]о(го, сгто, ст҃ѣ, ст҃го, ст҃ее, ст҃ого, ст҃ому, ст҃хъ, ст҃ье, ст҃ѣ, ст҃ꙑ, ст҃ꙑ[и, ст҃ꙑи, ст҃ꙑх, ст[о]го.

The 2nd highest number of forms (23) was observed with the lemma “третии”: [тре]теее, теретеѧ, тр[ьтиѧ], третиꙗ, треть, тре[тии, трет)[и]ѥ, третеѧ, третиеи, третии, третиѥго, третиѧ, третиӏ, третье, третьемъ, третьи, третьюю, третьѣ, третьѣѣ, трьтие, трьтие——–, трьтиѧ, трьтиѧѧ.

The 3rd highest number of forms (19) was observed with the lemma “другыи”: [д]рѹ[г]…, другии, другого, другои, другому, другоѥ, другую, другꙑ, друогꙑ, дрѹгемо, дрѹги:и, дрѹгии, дрѹгоѥ, дрѹгъхо, дрѹгѹю, дрꙋ(г)ꙋю, дрꙋгаѧ, дрꙋгее, дрꙋгои.

ADJ occurs with 9 features: Case (913; 97% instances), Number (882; 94% instances), Gender (844; 90% instances), Poss (297; 32% instances), Variant (269; 29% instances), Degree (13; 1% instances), NumForm (10; 1% instances), Typo (3; 0% instances), Animacy (1; 0% instances)

ADJ occurs with 22 feature-value pairs: Animacy=Anim, Case=Acc, Case=Acc,Gen, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, Degree=Cmp, Degree=Pos, Gender=Fem, Gender=Masc, Gender=Neut, NumForm=Digit, Number=Count, Number=Dual, Number=Plur, Number=Sing, Poss=Yes, Typo=Yes, Variant=Short

ADJ occurs with 145 feature combinations. The most frequent feature combination is Case=Gen|Gender=Masc|Number=Sing|Poss=Yes (66 tokens). Examples: сменова, ѥванова, (ѡ)фоносова, [фили]пова, [хр҃т҃о]в, би҃ѧ, богусл]алѧ, бѣшкова, бѹѧкъва, вармина

Relations

ADJ nodes are attached to their parents using 24 different relations: amod (467; 50% instances), flat:name (92; 10% instances), root (84; 9% instances), conj (77; 8% instances), nmod (73; 8% instances), obl (42; 4% instances), nsubj (20; 2% instances), obj (15; 2% instances), dep (13; 1% instances), advcl (12; 1% instances), parataxis (8; 1% instances), acl (7; 1% instances), orphan (6; 1% instances), ccomp (4; 0% instances), iobj (4; 0% instances), xcomp (3; 0% instances), advmod (2; 0% instances), appos (2; 0% instances), flat (2; 0% instances), list (2; 0% instances), vocative (2; 0% instances), compound (1; 0% instances), dislocated (1; 0% instances), nsubj:pass (1; 0% instances)

Parents of ADJ nodes belong to 10 different parts of speech: NOUN (487; 52% instances), PROPN (131; 14% instances), VERB (107; 11% instances), (84; 9% instances), NUM (67; 7% instances), ADJ (50; 5% instances), PRON (7; 1% instances), X (5; 1% instances), ADP (1; 0% instances), PART (1; 0% instances)

570 (61%) ADJ nodes are leaves.

200 (21%) ADJ nodes have one child.

78 (8%) ADJ nodes have two children.

92 (10%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 8.

Children of ADJ nodes are attached using 30 different relations: punct (176; 25% instances), case (118; 16% instances), cc (77; 11% instances), conj (67; 9% instances), nsubj (63; 9% instances), advmod (49; 7% instances), cop (32; 4% instances), iobj (25; 3% instances), nmod (16; 2% instances), obl (13; 2% instances), dep (12; 2% instances), mark (12; 2% instances), nummod:gov (7; 1% instances), det (6; 1% instances), advcl (5; 1% instances), appos (5; 1% instances), flat (5; 1% instances), parataxis (5; 1% instances), orphan (4; 1% instances), amod (3; 0% instances), vocative (3; 0% instances), aux (2; 0% instances), flat:name (2; 0% instances), list (2; 0% instances), xcomp (2; 0% instances), ccomp (1; 0% instances), csubj (1; 0% instances), dislocated (1; 0% instances), obj (1; 0% instances), reparandum (1; 0% instances)

Children of ADJ nodes belong to 15 different parts of speech: PUNCT (176; 25% instances), ADP (118; 16% instances), CCONJ (73; 10% instances), NOUN (73; 10% instances), ADJ (50; 7% instances), PART (43; 6% instances), AUX (34; 5% instances), PRON (28; 4% instances), DET (22; 3% instances), PROPN (21; 3% instances), VERB (20; 3% instances), NUM (19; 3% instances), SCONJ (14; 2% instances), X (14; 2% instances), ADV (11; 2% instances)