Treebank Statistics: UD_Hebrew-IAHLTknesset: POS Tags: ADV
There are 273 ADV
lemmas (5%), 278 ADV
types (3%) and 4292 ADV
tokens (6%).
Out of 16 observed tags, the rank of ADV
is: 5 in number of lemmas, 5 in number of types and 7 in number of tokens.
The 10 most frequent ADV
lemmas: לא, גם, מאוד, פה, אז, כאן, היום, רק, יותר, עכשיו
The 10 most frequent ADV
types: לא, גם, מאוד, פה, אז, כאן, היום, רק, יותר, עכשיו
The 10 most frequent ambiguous lemmas: לא (ADV 978, DET 1, INTJ 1), פה (ADV 137, NOUN 25), היום (ADV 114, NOUN 1), כך (ADV 71, PRON 70), שם (ADV 68, NOUN 24, VERB 20, PROPN 5, ADP 1), כן (ADV 47, PRON 11), עוד (DET 48, ADV 47, ADP 1), איך (ADV 39, PRON 1), האם (ADV 29, SCONJ 10), יחד (ADV 27, ADP 2)
The 10 most frequent ambiguous types: לא (ADV 978, DET 1, INTJ 1), פה (ADV 137, NOUN 7), היום (ADV 114, NOUN 1), כך (ADV 71, PRON 70), שם (ADV 68, NOUN 17, PROPN 5, VERB 2), כן (ADV 47, PRON 11), עוד (DET 48, ADV 47, ADP 1), איך (ADV 39, PRON 1), האם (ADV 29, SCONJ 10), יחד (ADV 27, ADP 2)
- לא
- פה
- היום
- כך
- שם
- כן
- עוד
- DET 48: אני אשמח פה לקבל עוד פרטים ו באמת לסייע איפה ש צריך .
- ADV 47: עוד דקה , אדונ י .
- ADP 1: דיברתי עם נשיא ה אוניברסיטה , ו הוא אומר ל י : אני מציע ש הם יישארו כאן , כי אם הם ילכו ל - 14 ימי סגר ו בידוד ב ישראל , כש הם יחזרו ל כאן יצטרכו עוד 14 ימי בידוד , ו זה יכול לפגוע ב לימודים , מה עוד ש מערכת ה בריאות ב ירדן היא מערכת טובה .
- איך
- האם
- יחד
Morphology
The form / lemma ratio of ADV
is 1.018315 (the average of all parts of speech is 1.545540).
The 1st highest number of forms (2) was observed with the lemma “די”: די, דיי.
The 2nd highest number of forms (2) was observed with the lemma “הנה”: הינה, הנה.
The 3rd highest number of forms (2) was observed with the lemma “כול”: כולי, כל.
ADV
occurs with 4 features: Polarity (994; 23% instances), PronType (159; 4% instances), Prefix (62; 1% instances), Typo (2; 0% instances)
ADV
occurs with 5 feature-value pairs: Polarity=Neg
, Prefix=Yes
, PronType=Int
, PronType=Rel
, Typo=Yes
ADV
occurs with 7 feature combinations.
The most frequent feature combination is _
(3079 tokens).
Examples: גם, מאוד, פה, אז, כאן, היום, רק, יותר, עכשיו, כך
Relations
ADV
nodes are attached to their parents using 26 different relations: advmod (3797; 88% instances), conj (72; 2% instances), compound:affix (66; 2% instances), fixed (62; 1% instances), obl (56; 1% instances), root (56; 1% instances), mark (43; 1% instances), case (37; 1% instances), parataxis (23; 1% instances), obj (19; 0% instances), ccomp (9; 0% instances), acl:relcl (8; 0% instances), advcl (8; 0% instances), nmod (7; 0% instances), appos (5; 0% instances), compound (5; 0% instances), nsubj (4; 0% instances), discourse (3; 0% instances), obl:unmarked (3; 0% instances), csubj (2; 0% instances), orphan (2; 0% instances), acl (1; 0% instances), cc (1; 0% instances), dep (1; 0% instances), nmod:poss (1; 0% instances), xcomp (1; 0% instances)
Parents of ADV
nodes belong to 15 different parts of speech: VERB (2382; 55% instances), NOUN (784; 18% instances), ADJ (569; 13% instances), ADV (234; 5% instances), PRON (106; 2% instances), PROPN (60; 1% instances), (56; 1% instances), DET (51; 1% instances), NUM (35; 1% instances), AUX (7; 0% instances), ADP (3; 0% instances), SCONJ (2; 0% instances), CCONJ (1; 0% instances), SYM (1; 0% instances), X (1; 0% instances)
3566 (83%) ADV
nodes are leaves.
510 (12%) ADV
nodes have one child.
119 (3%) ADV
nodes have two children.
97 (2%) ADV
nodes have three or more children.
The highest child degree of a ADV
node is 6.
Children of ADV
nodes are attached using 28 different relations: punct (387; 36% instances), advmod (170; 16% instances), fixed (103; 9% instances), case (88; 8% instances), cc (73; 7% instances), conj (43; 4% instances), nsubj (41; 4% instances), acl:relcl (29; 3% instances), advcl (23; 2% instances), parataxis (22; 2% instances), mark (20; 2% instances), obl (19; 2% instances), csubj (12; 1% instances), nmod (12; 1% instances), cop (8; 1% instances), acl (7; 1% instances), appos (6; 1% instances), vocative (6; 1% instances), det (4; 0% instances), obl:unmarked (3; 0% instances), orphan (3; 0% instances), compound (2; 0% instances), xcomp (2; 0% instances), ccomp (1; 0% instances), compound:affix (1; 0% instances), dep (1; 0% instances), nmod:unmarked (1; 0% instances), obj (1; 0% instances)
Children of ADV
nodes belong to 12 different parts of speech: PUNCT (387; 36% instances), ADV (234; 22% instances), ADP (135; 12% instances), CCONJ (75; 7% instances), VERB (73; 7% instances), NOUN (71; 7% instances), PRON (48; 4% instances), SCONJ (28; 3% instances), DET (16; 1% instances), ADJ (10; 1% instances), PROPN (6; 1% instances), AUX (5; 0% instances)