Treebank Statistics: UD_Erzya-JR: POS Tags: VERB
There are 926 VERB
lemmas (28%), 2417 VERB
types (35%) and 3729 VERB
tokens (18%).
Out of 16 observed tags, the rank of VERB
is: 2 in number of lemmas, 2 in number of types and 3 in number of tokens.
The 10 most frequent VERB
lemmas: молемс, меремс, теемс, лисемс, самс, ваномс, ютамс, саемс, аштемс, туемс
The 10 most frequent VERB
types: мерсь, лиссь, ютась, мольсь, ашти, неяви, совась, маряви, саизе, сась
The 10 most frequent ambiguous lemmas: пелемс (VERB 21, NOUN 1), чавомс (VERB 13, ADJ 1), стямс (VERB 12, ADV 1), улемс (AUX 50, VERB 6), эрявомс (AUX 21, VERB 8), карамс (VERB 7, ADV 1), азё (VERB 5, PART 1), менемс (VERB 4, NOUN 1), ульнемс (AUX 45, VERB 3), молема (NOUN 1, VERB 1)
The 10 most frequent ambiguous types: эрямо (VERB 5, NOUN 2), Улить (VERB 4, AUX 2), азё (PART 1, VERB 1), вечкевикс (VERB 3, ADJ 2), кенере (VERB 3, NOUN 1), эряви (AUX 13, VERB 3), Ярсамодо (VERB 2, NOUN 1), вечкема (VERB 2, NOUN 1), кадык (AUX 3, VERB 2), лисема (NOUN 1, VERB 1)
- эрямо
- Улить
- азё
- PART 1: Редясызь мейле , ды азё содык : панжадо кадовозь кенкшесь тесэ чумо — туевсь ве ёнов , эли верьгиз каподизе , эли кодамояк чопача салызе , паряк , сонсь Вирьаваськак кедензэ венстнизе .
- VERB 1: Федоров мик кортамояк эзь карма , поп - аванзо кучизе : азё , келя , мерть : попонтень тынк марто кортамс а мезть .
- вечкевикс
- кенере
- эряви
- Ярсамодо
- вечкема
- кадык
- лисема
Morphology
The form / lemma ratio of VERB
is 2.610151 (the average of all parts of speech is 2.080194).
The 1st highest number of forms (29) was observed with the lemma “самс”: Сыде, Сынек, са, садо, сазо, сазь, сак, сакшность, сакшнось, самаль, само, самодо, самодон, самодост, самосонзо, самось, самоськак, самс, састь, састькак, сась, сат, сы, сыль, сынь, сыть, сыця, сыцятненень, сыцятнень.
The 2nd highest number of forms (22) was observed with the lemma “марямс”: Марить, Марясть, Марясы, Марясынек, Марясынк, мари, мариде, маризе, маризь, маринек, маринь, мария, маря, марязь, марямга, марямо, маряса, марясак, маряст, марясынзе, марясь, маряяк.
The 3rd highest number of forms (21) was observed with the lemma “неемс”: Неезденть, Нейсы, нее, неезь, неемеде, неемс, нееяк, неи, неизе, неиль, неинек, неинзе, нейсамизь, нейсызь, нейсь, несамам, несы, несызь, несынзе, несь, неят.
VERB
occurs with 20 features: Person[subj] (2612; 70% instances), Number[subj] (2611; 70% instances), Mood (2598; 70% instances), Tense (2492; 67% instances), VerbForm (933; 25% instances), Case (515; 14% instances), Person[obj] (509; 14% instances), Number[obj] (505; 14% instances), Derivation (269; 7% instances), Number (226; 6% instances), Definite (212; 6% instances), Connegative (183; 5% instances), Nomzr (79; 2% instances), Clitic (49; 1% instances), Aspect (48; 1% instances), Number[psor] (38; 1% instances), Person[psor] (38; 1% instances), PartForm (15; 0% instances), Style (14; 0% instances), Typo (2; 0% instances)
VERB
occurs with 64 feature-value pairs: Aspect=Hab
, Aspect=Inch
, Case=Abl
, Case=Dat
, Case=Ela
, Case=Gen
, Case=Ill
, Case=Ine
, Case=Loc
, Case=Nom
, Case=Prl
, Case=Tra
, Clitic=Add
, Connegative=Yes
, Definite=Def
, Definite=Ind
, Derivation=OkshnOms
, Derivation=Omka
, Derivation=OvOms
, Derivation=Ovt
, Derivation=Ozj
, Derivation=VGen
, Derivation=VSj
, Mood=Cnd
, Mood=CndSub
, Mood=Des
, Mood=Imp
, Mood=Ind
, Mood=Nec
, Mood=Opt
, Mood=Prec
, Mood=Sub
, Nomzr=Ag
, Number=Plur
, Number=Plur,Sing
, Number=Sing
, Number[obj]=Plur
, Number[obj]=Sing
, Number[psor]=Plur
, Number[psor]=Sing
, Number[subj]=Plur
, Number[subj]=Plur,Sing
, Number[subj]=Sing
, PartForm=PastDyn
, PartForm=PrsDet
, PartForm=PrsTra
, Person[obj]=1
, Person[obj]=2
, Person[obj]=3
, Person[psor]=1
, Person[psor]=2
, Person[psor]=3
, Person[subj]=1
, Person[subj]=2
, Person[subj]=3
, Style=Arch
, Tense=Past
, Tense=Pres
, Typo=Yes
, VerbForm=Conv
, VerbForm=Conv,Part
, VerbForm=Inf
, VerbForm=Part
, VerbForm=Vnoun
VERB
occurs with 259 feature combinations.
The most frequent feature combination is Mood=Ind|Number[subj]=Sing|Person[subj]=3|Tense=Past
(731 tokens).
Examples: мерсь, лиссь, мольсь, ютась, совась, сась, пшкадсь, тейсь, чийсь, кадовсь
Relations
VERB
nodes are attached to their parents using 31 different relations: root (1680; 45% instances), conj (886; 24% instances), advcl (309; 8% instances), parataxis (188; 5% instances), xcomp (151; 4% instances), acl (128; 3% instances), ccomp (90; 2% instances), acl:relcl (59; 2% instances), appos (31; 1% instances), obl (27; 1% instances), nsubj (25; 1% instances), csubj (22; 1% instances), amod (19; 1% instances), xcomp:ds (18; 0% instances), compound (17; 0% instances), advcl:tcl (15; 0% instances), nmod (15; 0% instances), obl:cmp (11; 0% instances), obj (10; 0% instances), fixed (5; 0% instances), advcl:eval (4; 0% instances), discourse (4; 0% instances), nsubj:cop (3; 0% instances), advcl:cmp (2; 0% instances), dislocated (2; 0% instances), obl:tmod (2; 0% instances), vocative (2; 0% instances), csubj:cop (1; 0% instances), obl:inst (1; 0% instances), obl:lmod (1; 0% instances), orphan (1; 0% instances)
Parents of VERB
nodes belong to 12 different parts of speech: (1680; 45% instances), VERB (1549; 42% instances), NOUN (319; 9% instances), ADJ (72; 2% instances), ADV (48; 1% instances), PRON (30; 1% instances), INTJ (12; 0% instances), PROPN (9; 0% instances), ADP (3; 0% instances), AUX (3; 0% instances), DET (3; 0% instances), NUM (1; 0% instances)
269 (7%) VERB
nodes are leaves.
436 (12%) VERB
nodes have one child.
628 (17%) VERB
nodes have two children.
2396 (64%) VERB
nodes have three or more children.
The highest child degree of a VERB
node is 11.
Children of VERB
nodes are attached using 62 different relations: punct (3261; 28% instances), nsubj (1571; 14% instances), obl (1347; 12% instances), obj (954; 8% instances), conj (885; 8% instances), advmod (461; 4% instances), advmod:tmod (414; 4% instances), obl:lmod (375; 3% instances), aux (354; 3% instances), advcl (294; 3% instances), cc (260; 2% instances), xcomp (193; 2% instances), parataxis (147; 1% instances), ccomp (108; 1% instances), mark (106; 1% instances), aux:aspect (104; 1% instances), obl:inst (80; 1% instances), discourse (79; 1% instances), advmod:lmod (76; 1% instances), obl:tmod (73; 1% instances), vocative (69; 1% instances), advmod:eval (34; 0% instances), obl:agent (33; 0% instances), appos (25; 0% instances), advmod:foc (24; 0% instances), nmod:gobj (23; 0% instances), nmod (20; 0% instances), advcl:tcl (18; 0% instances), aux:nec (18; 0% instances), xcomp:ds (18; 0% instances), aux:neg (16; 0% instances), advmod:mmod (15; 0% instances), aux:cnd (12; 0% instances), csubj (12; 0% instances), nmod:gsubj (12; 0% instances), advmod:deg (10; 0% instances), compound (10; 0% instances), det (9; 0% instances), nsubj:cop (9; 0% instances), acl (8; 0% instances), aux:opt (8; 0% instances), case (8; 0% instances), aux:q (7; 0% instances), cop (7; 0% instances), dislocated (6; 0% instances), amod (5; 0% instances), aux:imp (5; 0% instances), expl (5; 0% instances), acl:relcl (4; 0% instances), cc:preconj (3; 0% instances), compound:prt (3; 0% instances), dep (3; 0% instances), obl:cmp (3; 0% instances), nmod:poss (2; 0% instances), nummod (2; 0% instances), obl:own (2; 0% instances), orphan (2; 0% instances), advcl:eval (1; 0% instances), advmod:cau (1; 0% instances), advmod:cmp (1; 0% instances), fixed (1; 0% instances), obl:cau (1; 0% instances)
Children of VERB
nodes belong to 16 different parts of speech: PUNCT (3261; 28% instances), NOUN (3225; 28% instances), VERB (1549; 13% instances), ADV (1196; 10% instances), PRON (725; 6% instances), AUX (540; 5% instances), PROPN (430; 4% instances), CCONJ (265; 2% instances), ADP (111; 1% instances), ADJ (106; 1% instances), SCONJ (50; 0% instances), INTJ (46; 0% instances), PART (46; 0% instances), DET (40; 0% instances), NUM (26; 0% instances), X (1; 0% instances)