home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-Ruthenian: POS Tags: VERB

There are 1402 VERB lemmas (18%), 3837 VERB types (19%) and 9304 VERB tokens (8%). Out of 17 observed tags, the rank of VERB is: 3 in number of lemmas, 3 in number of types and 5 in number of tokens.

The 10 most frequent VERB lemmas: мети, дати, писати, держати, быти, хотети, чинити, давати, учинити, бити

The 10 most frequent VERB types: дали, псан, мають, мают(ь), дал, маеть, казали, держати, послали, далъ

The 10 most frequent ambiguous lemmas: быти (AUX 1523, VERB 218, SCONJ 1), потреба (NOUN 28, VERB 3), встыдъ (NOUN 1, VERB 1), жаль (NOUN 1, VERB 1)

The 10 most frequent ambiguous types: было (AUX 87, VERB 33), были (AUX 115, VERB 32), брати (VERB 25, NOUN 2), был (AUX 64, VERB 17), быти (AUX 59, VERB 19), бꙋдеть (AUX 42, VERB 12), была (AUX 21, VERB 11), былъ (AUX 36, VERB 9), хотѧ (VERB 9, SCONJ 6, PART 1), будучи (VERB 8, AUX 2)

Morphology

The form / lemma ratio of VERB is 2.736805 (the average of all parts of speech is 2.589846).

The 1st highest number of forms (51) was observed with the lemma “писати”: П(и)сал, Писа(н), Пишет(ь), Пишу, пис(ал), писа(л), писа(н̑), писал, писали, писалъ, писан, писан(а), писана, писаннꙋю, писано, писаного, писаное, писаному, писаномъ, писаную, писанъ, писаны, писаные, писаныи, писаный, писаным, писанымъ, писаных, писаныхъ, писати, пишем, пишемъ, пишемы, пишемь, пишет(е), пишете, пишетъ, пишеть, пишетѣ, пишом, пишю, пишѣм, пишѣт(е), пишѣте, псал, псан, псан(а), псано, псанъ, пішете, пішетѣ.

The 2nd highest number of forms (44) was observed with the lemma “учинити”: Учинили, вчинена, вчиненыи, вчинил, вчинили, вчинилъ, вчинимъ, вчинит(ь), вчините, вчинити, вчинитѣ, вчинте, вчиньтѣ, вчынили, вчынити, въчинили, оуч(и)нила, оучилилъ, оучиненыи, оучини, оучинил, оучинил(о), оучинила, оучинили, оучинилъ, оучиним, оучинимъ, оучинит(и), оучинити, оучинить, оучинѣмо, учинено, учинилъ, учинити, ѹчинили, ꙋчинено, ꙋчинили, ꙋчинит(е), ꙋчинѣна, ꙋчинѣны, ꙋчынили, ꙋчынилъ, ꙋчынит(ь), ꙋчынят(ь).

The 3rd highest number of forms (41) was observed with the lemma “держати”: д(е)ржали, де(р)жѣмо, дер[ж]али, держал, держала, держали, держалъ, держано, держаны, держаные, держат, держат(и), держат(ь), держати, держать, держим, держимъ, держит(ь), держите, держить, держитѣ, держу, держыть, держышъ, держѧт(и), держꙋ, деръжали, деръжалъ, деръжати, деръжать, деръжить, деръжыть, дрѣжите, дѣржали, дѣржат(и), дѣржати, дѣржим, дѣржу, дѣржѧт(и), дѣржѧтъ, дѣрзѧти.

VERB occurs with 16 features: VerbForm (9223; 99% instances), Voice (9223; 99% instances), Tense (7072; 76% instances), Aspect (6703; 72% instances), Number (6661; 72% instances), Gender (2628; 28% instances), Person (2298; 25% instances), Mood (2277; 24% instances), Analyt (1068; 11% instances), Case (774; 8% instances), Variant (543; 6% instances), Polarity (20; 0% instances), Animacy (9; 0% instances), Reflex (3; 0% instances), Abbr (2; 0% instances), Typo (2; 0% instances)

VERB occurs with 37 feature-value pairs: Abbr=Yes, Analyt=Yes, Animacy=Anim, Aspect=Imp, Aspect=Perf, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, Mood=Imp, Mood=Ind, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Reflex=Yes, Tense=Fut, Tense=Imp, Tense=Past, Tense=Pres, Typo=Yes, Variant=Short, VerbForm=Conv, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, VerbForm=PartRes, Voice=Act, Voice=Mid, Voice=Pass

VERB occurs with 276 feature combinations. The most frequent feature combination is Aspect=Perf|Gender=Masc|Number=Sing|Tense=Past|VerbForm=PartRes|Voice=Act (728 tokens). Examples: дал, далъ, поведил, держалъ, поведилъ, положил, взял, продал, записал, поведал

Relations

VERB nodes are attached to their parents using 25 different relations: root (2946; 32% instances), conj (2408; 26% instances), advcl (1339; 14% instances), xcomp (954; 10% instances), acl:relcl (478; 5% instances), acl (393; 4% instances), ccomp (317; 3% instances), parataxis (158; 2% instances), csubj (106; 1% instances), dislocated (72; 1% instances), amod (34; 0% instances), iobj (22; 0% instances), parataxis:discourse (20; 0% instances), obj (9; 0% instances), obl (9; 0% instances), nmod (7; 0% instances), orphan (7; 0% instances), reparandum (7; 0% instances), obl:depict (5; 0% instances), appos (3; 0% instances), dep (3; 0% instances), fixed (3; 0% instances), nsubj (2; 0% instances), csubj:pass (1; 0% instances), obl:tmod (1; 0% instances)

Parents of VERB nodes belong to 13 different parts of speech: VERB (4940; 53% instances), (2946; 32% instances), NOUN (877; 9% instances), ADJ (192; 2% instances), PRON (152; 2% instances), DET (68; 1% instances), PROPN (59; 1% instances), ADV (51; 1% instances), NUM (9; 0% instances), AUX (4; 0% instances), CCONJ (4; 0% instances), ADP (1; 0% instances), PART (1; 0% instances)

227 (2%) VERB nodes are leaves.

685 (7%) VERB nodes have one child.

948 (10%) VERB nodes have two children.

7444 (80%) VERB nodes have three or more children.

The highest child degree of a VERB node is 17.

Children of VERB nodes are attached using 42 different relations: punct (8008; 20% instances), obl (6475; 16% instances), obj (4128; 10% instances), cc (3966; 10% instances), nsubj (3365; 8% instances), advmod (3272; 8% instances), iobj (2631; 6% instances), conj (2538; 6% instances), advcl (1331; 3% instances), mark (1231; 3% instances), aux (1107; 3% instances), xcomp (985; 2% instances), ccomp (375; 1% instances), parataxis (207; 1% instances), nsubj:pass (185; 0% instances), expl (180; 0% instances), obl:tmod (128; 0% instances), aux:pass (125; 0% instances), discourse (88; 0% instances), dislocated (76; 0% instances), vocative (54; 0% instances), csubj (41; 0% instances), acl (38; 0% instances), expl:pv (31; 0% instances), dep (30; 0% instances), obl:agent (16; 0% instances), case (12; 0% instances), obl:float (12; 0% instances), parataxis:discourse (12; 0% instances), det (11; 0% instances), obl:depict (11; 0% instances), cop (8; 0% instances), reparandum (8; 0% instances), nummod:gov (6; 0% instances), nmod (5; 0% instances), orphan (5; 0% instances), acl:relcl (3; 0% instances), csubj:pass (1; 0% instances), flat:name (1; 0% instances), goeswith (1; 0% instances), list (1; 0% instances), nummod (1; 0% instances)

Children of VERB nodes belong to 17 different parts of speech: NOUN (9914; 24% instances), PUNCT (8008; 20% instances), PRON (5385; 13% instances), VERB (4940; 12% instances), CCONJ (3971; 10% instances), ADV (2100; 5% instances), PART (1321; 3% instances), AUX (1255; 3% instances), PROPN (1240; 3% instances), SCONJ (1237; 3% instances), DET (757; 2% instances), ADJ (436; 1% instances), NUM (114; 0% instances), X (15; 0% instances), ADP (11; 0% instances), SYM (4; 0% instances), INTJ (1; 0% instances)