Treebank Statistics: UD_Russian-Taiga: POS Tags: VERB
There are 4372 VERB
lemmas (21%), 9935 VERB
types (26%) and 24748 VERB
tokens (13%).
Out of 17 observed tags, the rank of VERB
is: 2 in number of lemmas, 2 in number of types and 3 in number of tokens.
The 10 most frequent VERB
lemmas: быть, мочь, можно, сказать, нет, хотеть, знать, делать, работать, говорить
The 10 most frequent VERB
types: есть, можно, нет, может, надо, могу, делать, хочу, здравствуйте, нравится
The 10 most frequent ambiguous lemmas: быть (AUX 1315, VERB 790), мочь (VERB 590, NOUN 2), нет (VERB 297, PART 69), знать (VERB 248, NOUN 1), стать (VERB 199, NOUN 1), надо (VERB 174, ADP 2), пропасть (VERB 11, NOUN 2), некогда (ADV 4, VERB 4), пора (NOUN 67, VERB 4), пасть (VERB 3, NOUN 2)
The 10 most frequent ambiguous types: есть (VERB 403, AUX 144), нет (VERB 271, PART 46), надо (VERB 156, ADP 1), было (AUX 230, VERB 74, PART 2), быть (AUX 112, VERB 57), был (AUX 181, VERB 33), была (AUX 117, VERB 27), начала (VERB 18, NOUN 15), были (AUX 119, VERB 15), дали (VERB 18, NOUN 3)
- есть
- нет
- надо
- было
- быть
- был
- была
- начала
- были
- дали
Morphology
The form / lemma ratio of VERB
is 2.272415 (the average of all parts of speech is 1.879397).
The 1st highest number of forms (21) was observed with the lemma “идти”: идем, идет, иди, идите, идти, иду, идут, идущая, идущей, идущие, идущий, идя, идём, идёт, идёшь, шедшая, шел, шла, шли, шло, шёл.
The 2nd highest number of forms (20) was observed with the lemma “иметь”: Имей, имеем, имеет, имееть, имеешь, имееют, имейте, имел, имела, имели, имело, иметь, имею, имеют, имеющее, имеющий, имеющим, имеющих, имея, имут.
The 3rd highest number of forms (20) was observed with the lemma “работать”: работавший, работаем, работает, работаешь, работал, работала, работали, работало, работать, работаю, работают, работающей, работающие, работающии, работающий, работающим, работающих, работая, работют, рботаю.
VERB
occurs with 15 features: VerbForm (23646; 96% instances), Voice (23646; 96% instances), Aspect (23625; 95% instances), Number (18408; 74% instances), Tense (17935; 72% instances), Mood (16770; 68% instances), Person (10203; 41% instances), Gender (5885; 24% instances), Case (1011; 4% instances), Variant (641; 3% instances), Polarity (356; 1% instances), Typo (304; 1% instances), Animacy (116; 0% instances), Abbr (24; 0% instances), Foreign (1; 0% instances)
VERB
occurs with 35 feature-value pairs: Abbr=Yes
, Animacy=Anim
, Animacy=Inan
, Aspect=Imp
, Aspect=Perf
, Case=Acc
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Foreign=Yes
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Mood=Imp
, Mood=Ind
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, Polarity=Neg
, Tense=Fut
, Tense=Past
, Tense=Pres
, Typo=Yes
, Variant=Short
, VerbForm=Conv
, VerbForm=Fin
, VerbForm=Inf
, VerbForm=Part
, Voice=Act
, Voice=Mid
, Voice=Pass
VERB
occurs with 291 feature combinations.
The most frequent feature combination is Aspect=Imp|Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act
(2928 tokens).
Examples: есть, может, стоит, работает, говорит, отвечает, знает, хочет, хватает, бывает
Relations
VERB
nodes are attached to their parents using 28 different relations: root (9573; 39% instances), conj (5289; 21% instances), parataxis (2171; 9% instances), xcomp (2020; 8% instances), advcl (1504; 6% instances), csubj (1273; 5% instances), acl (963; 4% instances), ccomp (820; 3% instances), acl:relcl (610; 2% instances), amod (318; 1% instances), fixed (59; 0% instances), nmod (29; 0% instances), appos (27; 0% instances), obl (17; 0% instances), obj (14; 0% instances), nsubj (11; 0% instances), flat (9; 0% instances), case (7; 0% instances), vocative (7; 0% instances), list (6; 0% instances), iobj (5; 0% instances), orphan (5; 0% instances), dep (4; 0% instances), csubj:outer (2; 0% instances), reparandum (2; 0% instances), csubj:pass (1; 0% instances), discourse (1; 0% instances), dislocated (1; 0% instances)
Parents of VERB
nodes belong to 17 different parts of speech: VERB (10161; 41% instances), (9573; 39% instances), NOUN (2603; 11% instances), ADJ (1314; 5% instances), PRON (465; 2% instances), ADV (183; 1% instances), DET (116; 0% instances), PROPN (105; 0% instances), NUM (71; 0% instances), AUX (51; 0% instances), X (46; 0% instances), PART (41; 0% instances), INTJ (10; 0% instances), CCONJ (4; 0% instances), SCONJ (2; 0% instances), SYM (2; 0% instances), ADP (1; 0% instances)
1143 (5%) VERB
nodes are leaves.
3061 (12%) VERB
nodes have one child.
5197 (21%) VERB
nodes have two children.
15347 (62%) VERB
nodes have three or more children.
The highest child degree of a VERB
node is 15.
Children of VERB
nodes are attached using 44 different relations: punct (16918; 22% instances), obl (10282; 14% instances), nsubj (9303; 12% instances), advmod (9271; 12% instances), obj (7029; 9% instances), conj (4957; 7% instances), cc (3676; 5% instances), parataxis (2469; 3% instances), xcomp (2322; 3% instances), mark (2232; 3% instances), iobj (1997; 3% instances), advcl (1452; 2% instances), ccomp (1101; 1% instances), csubj (813; 1% instances), nsubj:pass (582; 1% instances), discourse (526; 1% instances), aux (367; 0% instances), vocative (211; 0% instances), aux:pass (152; 0% instances), obl:agent (80; 0% instances), acl (66; 0% instances), cop (55; 0% instances), expl (37; 0% instances), det (36; 0% instances), case (34; 0% instances), orphan (17; 0% instances), dep (11; 0% instances), fixed (11; 0% instances), flat (10; 0% instances), appos (8; 0% instances), nummod (8; 0% instances), dislocated (7; 0% instances), nummod:gov (6; 0% instances), flat:foreign (5; 0% instances), goeswith (5; 0% instances), list (4; 0% instances), nmod (4; 0% instances), reparandum (4; 0% instances), acl:relcl (3; 0% instances), amod (3; 0% instances), nsubj:outer (2; 0% instances), csubj:outer (1; 0% instances), csubj:pass (1; 0% instances), obl:tmod (1; 0% instances)
Children of VERB
nodes belong to 17 different parts of speech: NOUN (20391; 27% instances), PUNCT (16918; 22% instances), VERB (10161; 13% instances), PRON (8128; 11% instances), ADV (6434; 8% instances), CCONJ (3673; 5% instances), PART (3538; 5% instances), SCONJ (2111; 3% instances), PROPN (1397; 2% instances), ADJ (1318; 2% instances), AUX (615; 1% instances), DET (436; 1% instances), SYM (347; 0% instances), NUM (275; 0% instances), X (139; 0% instances), ADP (109; 0% instances), INTJ (89; 0% instances)