Treebank Statistics: UD_Croatian-SET: POS Tags: VERB
There are 2177 VERB
lemmas (11%), 5442 VERB
types (15%) and 17387 VERB
tokens (9%).
Out of 17 observed tags, the rank of VERB
is: 4 in number of lemmas, 4 in number of types and 5 in number of tokens.
The 10 most frequent VERB
lemmas: moći, imati, kazati, trebati, reći, morati, izjaviti, raditi, postojati, željeti
The 10 most frequent VERB
types: može, izjavio, ima, rekao, kaže, kazao, treba, mogu, nema, mora
The 10 most frequent ambiguous lemmas: moći (VERB 703, NOUN 1), imati (VERB 473, ADV 6), kazati (VERB 408, ADV 4, ADJ 1), reći (VERB 258, ADV 6, ADJ 4), izjaviti (VERB 242, ADV 1), raditi (VERB 196, ADJ 1, ADV 1), željeti (VERB 154, ADV 1), dobiti (VERB 153, ADJ 1), očekivati (VERB 123, ADJ 5, ADV 1), smatrati (VERB 115, ADJ 1, ADV 1)
The 10 most frequent ambiguous types: mora (VERB 94, NOUN 20), radi (VERB 65, ADP 15, ADV 1), nalazi (VERB 42, NOUN 1), pomoći (VERB 38, NOUN 26), tvrdi (VERB 37, ADJ 3), nalaze (VERB 36, NOUN 1), iznosi (VERB 34, NOUN 1), dobiti (VERB 31, NOUN 10), koristi (VERB 29, NOUN 10), postaje (VERB 25, NOUN 8)
- mora
- radi
- VERB 65: Za razliku od konkurenata , iPad radi na operativnom sustavu iOS 3.2.2 .
- ADP 15: Glasači će se 12. prosinca uputiti na birališta radi prijevremenih izbora .
- ADV 1: Za sljedeću godinu planira se koncentracija na uvođenje novih funkcionalnosti u okviru postojećih usluga i općenito poboljšanje postojećih usluga radi njihova daljnjeg prilagođavanja zahtjevima korisnika .
- nalazi
- pomoći
- tvrdi
- nalaze
- VERB 36: Pederi se brinu o nabavi drva za zimu , nalaze nekoga tko će ta drva iscijepati i posložiti .
- NOUN 1: S obzirom na nova saznanja , tehnološki razvoj i zakonske zahtjeve , kontinuirano se provode dodatne kontrole sustava ( i potrebne aktivnosti s obzirom na nalaze ) i modernizacija sustava u cilju povećanja sigurnosti rada , smanjenja rizika i zaštite okoliša .
- iznosi
- dobiti
- koristi
- postaje
Morphology
The form / lemma ratio of VERB
is 2.499770 (the average of all parts of speech is 1.848309).
The 1st highest number of forms (13) was observed with the lemma “imati”: ima, imaj, imaju, imala, imale, imali, imalo, imam, imamo, imao, imat, imate, imati.
The 2nd highest number of forms (13) was observed with the lemma “moći”: Nemojmo, mogao, mogla, mogle, mogli, moglo, mogu, moći, može, možemo, možete, možeš, nemojte.
The 3rd highest number of forms (13) was observed with the lemma “raditi”: rade, radi, radila, radile, radili, radilo, radim, radimo, radio, radit, radite, raditi, radiš.
VERB
occurs with 7 features: VerbForm (17387; 100% instances), Number (14004; 81% instances), Tense (13785; 79% instances), Mood (7914; 46% instances), Person (7914; 46% instances), Gender (6090; 35% instances), Voice (6090; 35% instances)
VERB
occurs with 16 feature-value pairs: Gender=Fem
, Gender=Masc
, Gender=Neut
, Mood=Imp
, Mood=Ind
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, Tense=Past
, Tense=Pres
, VerbForm=Fin
, VerbForm=Inf
, VerbForm=Part
, Voice=Act
VERB
occurs with 17 feature combinations.
The most frequent feature combination is Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin
(4487 tokens).
Examples: može, ima, kaže, treba, nema, mora, postoji, radi, očekuje, navodi
Relations
VERB
nodes are attached to their parents using 23 different relations: root (6300; 36% instances), acl (2647; 15% instances), conj (2394; 14% instances), xcomp (1788; 10% instances), advcl (1362; 8% instances), ccomp (1244; 7% instances), parataxis (1220; 7% instances), csubj (279; 2% instances), nsubj (78; 0% instances), appos (16; 0% instances), obj (12; 0% instances), amod (10; 0% instances), nmod (9; 0% instances), flat (7; 0% instances), obl (5; 0% instances), list (4; 0% instances), case (2; 0% instances), dep (2; 0% instances), discourse (2; 0% instances), iobj (2; 0% instances), orphan (2; 0% instances), flat:foreign (1; 0% instances), vocative (1; 0% instances)
Parents of VERB
nodes belong to 16 different parts of speech: VERB (6506; 37% instances), (6300; 36% instances), NOUN (2764; 16% instances), ADJ (936; 5% instances), ADV (284; 2% instances), DET (220; 1% instances), PROPN (147; 1% instances), AUX (144; 1% instances), PRON (55; 0% instances), NUM (15; 0% instances), PART (5; 0% instances), X (5; 0% instances), ADP (2; 0% instances), SCONJ (2; 0% instances), CCONJ (1; 0% instances), SYM (1; 0% instances)
237 (1%) VERB
nodes are leaves.
1247 (7%) VERB
nodes have one child.
2210 (13%) VERB
nodes have two children.
13693 (79%) VERB
nodes have three or more children.
The highest child degree of a VERB
node is 11.
Children of VERB
nodes are attached using 34 different relations: punct (11296; 17% instances), nsubj (10048; 15% instances), obl (9346; 14% instances), obj (8095; 12% instances), aux (6470; 10% instances), advmod (3965; 6% instances), mark (2996; 4% instances), xcomp (2538; 4% instances), conj (2482; 4% instances), expl (2327; 3% instances), cc (2278; 3% instances), ccomp (1551; 2% instances), advcl (1279; 2% instances), parataxis (1138; 2% instances), iobj (596; 1% instances), discourse (530; 1% instances), csubj (131; 0% instances), amod (88; 0% instances), case (39; 0% instances), nmod (25; 0% instances), vocative (17; 0% instances), acl (15; 0% instances), cop (14; 0% instances), compound (10; 0% instances), appos (5; 0% instances), flat (5; 0% instances), list (5; 0% instances), dep (4; 0% instances), det:numgov (4; 0% instances), dislocated (4; 0% instances), det (3; 0% instances), fixed (3; 0% instances), nummod (3; 0% instances), nummod:gov (2; 0% instances)
Children of VERB
nodes belong to 17 different parts of speech: NOUN (20446; 30% instances), PUNCT (11296; 17% instances), AUX (6800; 10% instances), VERB (6506; 10% instances), PRON (4561; 7% instances), ADV (4132; 6% instances), PROPN (3161; 5% instances), SCONJ (2922; 4% instances), DET (2689; 4% instances), CCONJ (2235; 3% instances), ADJ (1282; 2% instances), PART (945; 1% instances), NUM (139; 0% instances), ADP (107; 0% instances), X (57; 0% instances), SYM (27; 0% instances), INTJ (7; 0% instances)