Treebank Statistics: UD_Lithuanian-ALKSNIS: POS Tags: VERB
There are 2293 VERB
lemmas (26%), 5496 VERB
types (30%) and 10180 VERB
tokens (15%).
Out of 17 observed tags, the rank of VERB
is: 2 in number of lemmas, 2 in number of types and 3 in number of tokens.
The 10 most frequent VERB
lemmas: galėti, turėti, būti, nustatyti, nebūti, pateikti, skirti, siekti, teikti, reikėti
The 10 most frequent VERB
types: gali, turi, yra, galima, nėra, reikia, nustatyta, pateikti, siekiant, nebuvo
The 10 most frequent ambiguous lemmas: būti (AUX 684, VERB 225), prašyti (VERB 10, ADV 1), imtis (NOUN 7, VERB 7), skirtis (VERB 6, NOUN 1), susirgti (VERB 3, NOUN 1), mėgstamas (ADJ 1, VERB 1), neatskiriamas (ADJ 1, VERB 1), rungtis (NOUN 1, VERB 1), viltis (NOUN 6, VERB 1)
The 10 most frequent ambiguous types: yra (AUX 269, VERB 78, X 1), buvo (AUX 149, VERB 28), būti (AUX 72, VERB 23), būtų (AUX 69, VERB 10), esi (VERB 7, AUX 5), mano (PRON 52, VERB 9), neįmanoma (VERB 8, ADJ 4), bus (AUX 47, VERB 6), būna (AUX 6, VERB 5), kuria (DET 10, VERB 6)
- yra
- buvo
- būti
- būtų
- esi
- mano
- neįmanoma
- bus
- būna
- kuria
Morphology
The form / lemma ratio of VERB
is 2.396860 (the average of all parts of speech is 2.065341).
The 1st highest number of forms (42) was observed with the lemma “būti”: bus, buvo, buvusieji, buvusio, buvusiuoju, buvusius, buvusių, buvę, buvęs, būdamas, būdami, būdavo, būkite, būna, būnu, būsimam, būsimą, būsite, būti, būtina, būtų, esama, esame, esamos, esamą, esamų, esant, esanti, esantis, esantys, esančiais, esančias, esančio, esančioje, esančiomis, esančios, esančių, esate, esi, esu, esąs, yra.
The 2nd highest number of forms (37) was observed with the lemma “turėti”: TURĖJUSIŲ, Turėjom, Turėsite, turi, turimas, turime, turimomis, turimos, turimus, turimų, turint, turintis, turintys, turintį, turinčiam, turinčioms, turinčios, turinčius, turinčią, turinčių, turinčiųjų, turite, turiu, turėdami, turėdavai, turėdavo, turėjau, turėjo, turėjęs, turėkite, turės, turėsiu, turėti, turėtum, turėtume, turėtumėte, turėtų.
The 3rd highest number of forms (26) was observed with the lemma “atlikti”: atlieka, atliekama, atliekami, atliekamos, atliekamų, atliekant, atliekančiu, atliekančių, atliekate, atlieki, atlikdami, atlikite, atliko, atlikote, atliks, atlikta, atliktais, atliktas, atlikti, atliktos, atliktą, atlikus, atlikusi, atlikusios, atlikę, atlikęs.
VERB
occurs with 14 features: Polarity (10179; 100% instances), VerbForm (10177; 100% instances), Tense (7482; 73% instances), Number (6831; 67% instances), Mood (4256; 42% instances), Person (4231; 42% instances), Gender (3471; 34% instances), Definite (3266; 32% instances), Voice (3241; 32% instances), Case (2856; 28% instances), Aspect (1761; 17% instances), Reflex (1248; 12% instances), Hyph (4; 0% instances), Degree (1; 0% instances)
VERB
occurs with 38 feature-value pairs: Aspect=Hab
, Aspect=Perf
, Case=Acc
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Case=Voc
, Definite=Def
, Definite=Ind
, Degree=Sup
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Hyph=Yes
, Mood=Cnd
, Mood=Imp
, Mood=Ind
, Mood=Nec
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, Polarity=Neg
, Polarity=Pos
, Reflex=Yes
, Tense=Fut
, Tense=Past
, Tense=Pres
, VerbForm=Conv
, VerbForm=Fin
, VerbForm=Ger
, VerbForm=Inf
, VerbForm=Part
, Voice=Act
, Voice=Pass
VERB
occurs with 430 feature combinations.
The most frequent feature combination is Polarity=Pos|VerbForm=Inf
(1651 tokens).
Examples: teikti, pateikti, užtikrinti, būti, sukurti, dirbti, kurti, skatinti, įgyvendinti, padaryti
Relations
VERB
nodes are attached to their parents using 15 different relations: root (2643; 26% instances), acl (1893; 19% instances), conj (1732; 17% instances), advcl (1279; 13% instances), xcomp (1138; 11% instances), csubj (450; 4% instances), ccomp (412; 4% instances), acl:relcl (332; 3% instances), parataxis (247; 2% instances), appos (28; 0% instances), dep (13; 0% instances), csubj:pass (5; 0% instances), flat (3; 0% instances), obj (3; 0% instances), nsubj (2; 0% instances)
Parents of VERB
nodes belong to 13 different parts of speech: VERB (4294; 42% instances), (2643; 26% instances), NOUN (2405; 24% instances), ADJ (481; 5% instances), ADV (153; 2% instances), DET (105; 1% instances), X (30; 0% instances), PRON (29; 0% instances), PROPN (22; 0% instances), NUM (13; 0% instances), PART (3; 0% instances), CCONJ (1; 0% instances), INTJ (1; 0% instances)
974 (10%) VERB
nodes are leaves.
1919 (19%) VERB
nodes have one child.
1894 (19%) VERB
nodes have two children.
5393 (53%) VERB
nodes have three or more children.
The highest child degree of a VERB
node is 12.
Children of VERB
nodes are attached using 29 different relations: punct (6782; 24% instances), obl:arg (3074; 11% instances), nsubj (2773; 10% instances), obj (2512; 9% instances), obl (2030; 7% instances), advmod (1956; 7% instances), conj (1681; 6% instances), cc (1089; 4% instances), xcomp (1079; 4% instances), advcl (1007; 4% instances), mark (995; 4% instances), nsubj:pass (470; 2% instances), parataxis (426; 2% instances), advmod:emph (419; 2% instances), ccomp (415; 1% instances), cop (342; 1% instances), nummod (241; 1% instances), csubj (208; 1% instances), dep (62; 0% instances), nmod (52; 0% instances), acl (20; 0% instances), amod (18; 0% instances), det (12; 0% instances), appos (6; 0% instances), csubj:pass (6; 0% instances), acl:relcl (4; 0% instances), iobj (4; 0% instances), flat (3; 0% instances), discourse (2; 0% instances)
Children of VERB
nodes belong to 17 different parts of speech: NOUN (8840; 32% instances), PUNCT (6782; 24% instances), VERB (4294; 16% instances), ADV (2016; 7% instances), PRON (1151; 4% instances), CCONJ (1108; 4% instances), SCONJ (951; 3% instances), DET (590; 2% instances), PART (486; 2% instances), NUM (402; 1% instances), AUX (342; 1% instances), PROPN (264; 1% instances), ADJ (238; 1% instances), X (182; 1% instances), ADP (25; 0% instances), INTJ (12; 0% instances), SYM (5; 0% instances)