home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hebrew-IAHLTwiki: POS Tags: VERB

There are 1446 VERB lemmas (13%), 3680 VERB types (23%) and 10645 VERB tokens (8%). Out of 16 observed tags, the rank of VERB is: 4 in number of lemmas, 2 in number of types and 5 in number of tokens.

The 10 most frequent VERB lemmas: יש, כלל, ניתן, החל, הגיע, היה, כתב, יצא, קיבל, נמצא

The 10 most frequent VERB types: יש, ניתן, כתב, אין, הלחין, יצא, זכה, החל, היו, החלו

The 10 most frequent ambiguous lemmas: כלל (VERB 144, NOUN 54, DET 11, ADV 8), ניתן (VERB 138, ADJ 2), החל (VERB 128, ADP 22), הגיע (VERB 118, NOUN 1), היה (AUX 580, VERB 117), כתב (VERB 117, NOUN 40), יצא (VERB 103, ADJ 1), נמצא (VERB 90, ADJ 2), גרם (VERB 87, NOUN 1), זכה (VERB 84, NOUN 4)

The 10 most frequent ambiguous types: ניתן (VERB 111, ADJ 2), כתב (VERB 79, NOUN 30), אין (VERB 65, ADV 11, ADJ 1, PROPN 1), החל (VERB 51, ADP 22), היו (AUX 136, VERB 41), כולל (VERB 40, ADJ 6), כלל (NOUN 42, VERB 37, DET 11, ADV 8), נמצא (VERB 37, ADJ 1), היה (AUX 231, VERB 31), הגיע (VERB 27, NOUN 1)

Morphology

The form / lemma ratio of VERB is 2.544952 (the average of all parts of speech is 1.479807).

The 1st highest number of forms (15) was observed with the lemma “בא”: בָּאוּ, בא, באה, באו, באת, בואו, בואי, יָבוֹא, יבוא, יבואו, לבוא, נבוא, תָבוֹא, תבוא, תבואי.

The 2nd highest number of forms (15) was observed with the lemma “נתן”: ייתן, ייתנו, ליתן, לתת, נותן, נותנים, נותנת, ניתן, נתון, נתונה, נתונות, נתן, נתנה, נתנו, נתתם.

The 3rd highest number of forms (14) was observed with the lemma “אמר”: ַיֹּאמֶר, אומר, אומרים, אומרת, אמור, אמורה, אמורות, אמר, אמרה, אמרו, אמרתי, לֵאמֹר, לומר, תאמר.

VERB occurs with 12 features: HebBinyan (10273; 97% instances), Voice (10129; 95% instances), Number (8870; 83% instances), Person (8832; 83% instances), Gender (8830; 83% instances), Tense (8745; 82% instances), VerbForm (4133; 39% instances), Polarity (312; 3% instances), VerbType (65; 1% instances), Aspect (47; 0% instances), Mood (34; 0% instances), Typo (27; 0% instances)

VERB occurs with 30 feature-value pairs: Aspect=Prog, Gender=Fem, Gender=Fem,Masc, Gender=Masc, HebBinyan=HIFIL, HebBinyan=HITPAEL, HebBinyan=HUFAL, HebBinyan=NIFAL, HebBinyan=NITPAEL, HebBinyan=PAAL, HebBinyan=PIEL, HebBinyan=PUAL, Mood=Imp, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Polarity=Pos, Tense=Fut, Tense=Past, Tense=Pres, Typo=Yes, VerbForm=Inf, VerbForm=Part, VerbType=Mod, Voice=Act, Voice=Mid, Voice=Pass

VERB occurs with 424 feature combinations. The most frequent feature combination is Gender=Masc|HebBinyan=PAAL|Number=Sing|Person=3|Tense=Past|Voice=Act (784 tokens). Examples: כתב, יצא, זכה, כלל, אמר, טען, שר, עלה, קבע, עמד

Relations

VERB nodes are attached to their parents using 25 different relations: root (3860; 36% instances), acl:relcl (2524; 24% instances), conj (1770; 17% instances), xcomp (587; 6% instances), advcl (564; 5% instances), ccomp (291; 3% instances), acl (279; 3% instances), parataxis (241; 2% instances), csubj (190; 2% instances), appos (71; 1% instances), csubj:pass (71; 1% instances), obl (48; 0% instances), amod (34; 0% instances), case (32; 0% instances), nsubj (16; 0% instances), nmod (15; 0% instances), compound (13; 0% instances), fixed (13; 0% instances), dep (10; 0% instances), obj (9; 0% instances), nmod:poss (2; 0% instances), orphan (2; 0% instances), discourse (1; 0% instances), mark (1; 0% instances), nsubj:pass (1; 0% instances)

Parents of VERB nodes belong to 16 different parts of speech: (3860; 36% instances), VERB (3295; 31% instances), NOUN (2864; 27% instances), PROPN (229; 2% instances), ADJ (196; 2% instances), PRON (127; 1% instances), ADV (28; 0% instances), NUM (16; 0% instances), ADP (10; 0% instances), X (6; 0% instances), AUX (4; 0% instances), SCONJ (4; 0% instances), DET (2; 0% instances), SYM (2; 0% instances), CCONJ (1; 0% instances), INTJ (1; 0% instances)

101 (1%) VERB nodes are leaves.

738 (7%) VERB nodes have one child.

2080 (20%) VERB nodes have two children.

7726 (73%) VERB nodes have three or more children.

The highest child degree of a VERB node is 13.

Children of VERB nodes are attached using 37 different relations: obl (9941; 27% instances), punct (6620; 18% instances), nsubj (4675; 13% instances), mark (3449; 9% instances), obj (3018; 8% instances), conj (1774; 5% instances), cc (1668; 5% instances), advmod (1368; 4% instances), nsubj:pass (857; 2% instances), xcomp (713; 2% instances), advcl (544; 1% instances), ccomp (365; 1% instances), aux (356; 1% instances), parataxis (261; 1% instances), csubj (135; 0% instances), case (126; 0% instances), obl:unmarked (123; 0% instances), csubj:pass (82; 0% instances), nsubj:outer (78; 0% instances), cop (77; 0% instances), nmod (37; 0% instances), dep (36; 0% instances), det (22; 0% instances), acl:relcl (21; 0% instances), dislocated (13; 0% instances), compound (12; 0% instances), fixed (6; 0% instances), reparandum (6; 0% instances), appos (4; 0% instances), nummod (4; 0% instances), nmod:poss (3; 0% instances), nmod:unmarked (3; 0% instances), discourse (2; 0% instances), orphan (2; 0% instances), compound:affix (1; 0% instances), csubj:outer (1; 0% instances), vocative (1; 0% instances)

Children of VERB nodes belong to 16 different parts of speech: NOUN (14726; 40% instances), PUNCT (6620; 18% instances), SCONJ (3323; 9% instances), VERB (3295; 9% instances), PROPN (2227; 6% instances), CCONJ (1681; 5% instances), PRON (1603; 4% instances), ADV (1382; 4% instances), NUM (448; 1% instances), AUX (408; 1% instances), ADJ (334; 1% instances), ADP (281; 1% instances), DET (32; 0% instances), SYM (27; 0% instances), X (16; 0% instances), INTJ (1; 0% instances)