Treebank Statistics: UD_Lithuanian-HSE: POS Tags: VERB
There are 360 VERB
lemmas (22%), 584 VERB
types (24%) and 708 VERB
tokens (13%).
Out of 16 observed tags, the rank of VERB
is: 2 in number of lemmas, 2 in number of types and 3 in number of tokens.
The 10 most frequent VERB
lemmas: galėti, būti, žinoti, turėti, sakyti, laikyti, vadinti, kalbėti, bandyti, gaminti
The 10 most frequent VERB
types: gali, turi, negali, vadinami, žinoma, būti, galima, nėra, sako, žino
The 10 most frequent ambiguous lemmas: būti (AUX 112, VERB 20), įprastas (ADJ 1, VERB 1)
The 10 most frequent ambiguous types: būti (AUX 4, VERB 4), nėra (AUX 11, VERB 3), kalba (VERB 3, NOUN 2), būna (AUX 3, VERB 2), būtų (AUX 9, VERB 2), yra (AUX 30, VERB 2), nebuvo (AUX 3, VERB 1), nebūtų (AUX 4, VERB 1)
- būti
- nėra
- kalba
- VERB 3: Filosofas Vytautas Radžvilas kalba apie globalistinę indoktrinaciją , smegenų plovimą , apie eurokolaboravimą .
- NOUN 2: Tuo tarpu tautai kaip tokiai , bent po Stalino epochos , didelio pavojaus nebuvo – tą neginčijamai įrodo faktas , kad tauta ir kalba išliko neišnykusios , nes nesumažėjusios per penkiasdešimt su viršum metų .
- būna
- būtų
- yra
- nebuvo
- nebūtų
Morphology
The form / lemma ratio of VERB
is 1.622222 (the average of all parts of speech is 1.442977).
The 1st highest number of forms (12) was observed with the lemma “būti”: Būtina, Nesame, buvęs, būna, būti, būtų, esama, esantį, nebuvo, nebūtų, nėra, yra.
The 2nd highest number of forms (11) was observed with the lemma “galėti”: gali, galima, galimas, galime, galiu, galėjo, galėsi, galėtų, negali, negalima, negalėjo.
The 3rd highest number of forms (10) was observed with the lemma “sakyti”: Sakiau, nesakoma, sakant, sako, sakydavome, sakysi, sakysiu, sakyti, sakyčiau, sakė.
VERB
occurs with 13 features: Polarity (707; 100% instances), VerbForm (707; 100% instances), Voice (698; 99% instances), Number (538; 76% instances), Tense (523; 74% instances), Mood (393; 56% instances), Person (384; 54% instances), Gender (178; 25% instances), Definite (168; 24% instances), Case (146; 21% instances), Reflex (53; 7% instances), Aspect (38; 5% instances), Variant (12; 2% instances)
VERB
occurs with 36 feature-value pairs: Aspect=Hab
, Aspect=Perf
, Case=Acc
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Definite=Def
, Definite=Ind
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Mood=Cnd
, Mood=Imp
, Mood=Ind
, Mood=Nec
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, Polarity=Neg
, Polarity=Pos
, Reflex=Yes
, Tense=Fut
, Tense=Past
, Tense=Pres
, Variant=Short
, VerbForm=Conv
, VerbForm=Fin
, VerbForm=Ger
, VerbForm=Inf
, VerbForm=Part
, Voice=Act
, Voice=Pass
VERB
occurs with 148 feature combinations.
The most frequent feature combination is Polarity=Pos|VerbForm=Inf|Voice=Act
(116 tokens).
Examples: būti, mylėti, turėti, vadinti, atsirasti, diskriminuoti, gaminti, padaryti, pagaminti, remti
Relations
VERB
nodes are attached to their parents using 16 different relations: root (187; 26% instances), conj (120; 17% instances), xcomp (109; 15% instances), parataxis (66; 9% instances), advcl (61; 9% instances), acl:relcl (42; 6% instances), acl (39; 6% instances), ccomp (38; 5% instances), amod (29; 4% instances), csubj (7; 1% instances), fixed (3; 0% instances), appos (2; 0% instances), nmod (2; 0% instances), iobj (1; 0% instances), obj (1; 0% instances), obl (1; 0% instances)
Parents of VERB
nodes belong to 11 different parts of speech: VERB (308; 44% instances), (187; 26% instances), NOUN (146; 21% instances), ADJ (39; 6% instances), PROPN (8; 1% instances), ADV (7; 1% instances), PRON (5; 1% instances), DET (3; 0% instances), PART (3; 0% instances), AUX (1; 0% instances), X (1; 0% instances)
43 (6%) VERB
nodes are leaves.
109 (15%) VERB
nodes have one child.
127 (18%) VERB
nodes have two children.
429 (61%) VERB
nodes have three or more children.
The highest child degree of a VERB
node is 9.
Children of VERB
nodes are attached using 29 different relations: punct (513; 24% instances), nsubj (275; 13% instances), obj (244; 11% instances), obl (215; 10% instances), advmod (165; 8% instances), conj (118; 5% instances), cc (103; 5% instances), xcomp (93; 4% instances), mark (92; 4% instances), parataxis (89; 4% instances), iobj (62; 3% instances), advcl (42; 2% instances), ccomp (42; 2% instances), aux (26; 1% instances), advmod:emph (25; 1% instances), nmod (10; 0% instances), obl:agent (7; 0% instances), cop (5; 0% instances), amod (4; 0% instances), acl:relcl (3; 0% instances), acl (2; 0% instances), aux:pass (2; 0% instances), case (2; 0% instances), csubj (2; 0% instances), discourse (2; 0% instances), vocative (2; 0% instances), appos (1; 0% instances), det (1; 0% instances), nsubj:outer (1; 0% instances)
Children of VERB
nodes belong to 15 different parts of speech: NOUN (527; 25% instances), PUNCT (513; 24% instances), VERB (308; 14% instances), ADV (186; 9% instances), PRON (177; 8% instances), CCONJ (101; 5% instances), PROPN (93; 4% instances), SCONJ (88; 4% instances), PART (51; 2% instances), ADJ (44; 2% instances), AUX (33; 2% instances), ADP (9; 0% instances), DET (9; 0% instances), INTJ (6; 0% instances), X (3; 0% instances)