home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Bengali-BRU: POS Tags: VERB

There are 27 VERB lemmas (23%), 48 VERB types (32%) and 65 VERB tokens (20%). Out of 14 observed tags, the rank of VERB is: 2 in number of lemmas, 1 in number of types and 2 in number of tokens.

The 10 most frequent VERB lemmas: খাওয়া, করা, দেখা, ধোয়া, পড়া, যাওয়া, হওয়া, আঁকা, আসা, গাওয়া

The 10 most frequent VERB types: আঁকতে, আসে, খেতে, খেয়েছ, গাইতে, ঘুরতে, চল, জানো, ধুয়ে, ধোবো

The 10 most frequent ambiguous lemmas: পড়া (VERB 4, NOUN 1), হওয়া (VERB 3, AUX 1)

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of VERB is 1.777778 (the average of all parts of speech is 1.290598).

The 1st highest number of forms (6) was observed with the lemma “করা”: কর, করবে, করি, করে, করেছ, করেছি.

The 2nd highest number of forms (6) was observed with the lemma “খাওয়া”: খাও, খাব, খেতে, খেয়ে, খেয়েছ, খেলে.

The 3rd highest number of forms (4) was observed with the lemma “দেখা”: দেখ, দেখব, দেখি, দেখেছি.

VERB occurs with 6 features: VerbForm (65; 100% instances), Mood (45; 69% instances), Tense (44; 68% instances), Person (42; 65% instances), Aspect (27; 42% instances), Voice (1; 2% instances)

VERB occurs with 15 feature-value pairs: Aspect=Imp, Aspect=Perf, Mood=Cnd, Mood=Imp, Mood=Ind, Person=1, Person=2, Person=3, Tense=Fut, Tense=Past, Tense=Pres, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Pass

VERB occurs with 18 feature combinations. The most frequent feature combination is Aspect=Imp|VerbForm=Part (11 tokens). Examples: আঁকতে, খেতে, গাইতে, ঘুরতে, পড়তে, পরতে

Relations

VERB nodes are attached to their parents using 8 different relations: root (41; 63% instances), compound (12; 18% instances), advcl (4; 6% instances), xcomp (3; 5% instances), conj (2; 3% instances), acl (1; 2% instances), acl:relcl (1; 2% instances), parataxis (1; 2% instances)

Parents of VERB nodes belong to 4 different parts of speech: (41; 63% instances), VERB (22; 34% instances), NOUN (1; 2% instances), PRON (1; 2% instances)

12 (18%) VERB nodes are leaves.

4 (6%) VERB nodes have one child.

12 (18%) VERB nodes have two children.

37 (57%) VERB nodes have three or more children.

The highest child degree of a VERB node is 7.

Children of VERB nodes are attached using 21 different relations: punct (50; 28% instances), obj (29; 16% instances), nsubj (27; 15% instances), advmod (18; 10% instances), compound (12; 7% instances), obl (7; 4% instances), aux (6; 3% instances), advcl (4; 2% instances), discourse (4; 2% instances), compound:lvc (3; 2% instances), iobj (3; 2% instances), mark (3; 2% instances), xcomp (3; 2% instances), conj (2; 1% instances), det (2; 1% instances), vocative (2; 1% instances), acl (1; 1% instances), ccomp (1; 1% instances), nmod:poss (1; 1% instances), nsubj:pass (1; 1% instances), parataxis (1; 1% instances)

Children of VERB nodes belong to 12 different parts of speech: PUNCT (50; 28% instances), NOUN (42; 23% instances), PRON (31; 17% instances), VERB (22; 12% instances), ADV (10; 6% instances), PART (9; 5% instances), AUX (6; 3% instances), INTJ (4; 2% instances), DET (2; 1% instances), SCONJ (2; 1% instances), ADJ (1; 1% instances), NUM (1; 1% instances)