Treebank Statistics: UD_Hebrew-HTB: POS Tags: ADV
There are 332 ADV
lemmas (3%), 400 ADV
types (2%) and 6551 ADV
tokens (4%).
Out of 15 observed tags, the rank of ADV
is: 6 in number of lemmas, 6 in number of types and 9 in number of tokens.
The 10 most frequent ADV
lemmas: לא, גם, רק, יותר, _, מה, עוד, כך, ביותר, אתמול
The 10 most frequent ADV
types: לא, גם, רק, יותר, מה, עוד, כך, ביותר, אתמול, כבר
The 10 most frequent ambiguous lemmas: _ (NOUN 365, VERB 326, ADJ 230, ADV 192, AUX 169, CCONJ 109, X 76, PRON 57, SCONJ 46, DET 33), מה (ADV 166, PRON 5), עוד (ADV 144, ADP 4, NOUN 1, PROPN 1), כך (PRON 204, ADV 117, PROPN 35), אפשר (ADV 98, VERB 34), מי (ADV 96, PRON 3), שם (ADV 91, NOUN 87, VERB 18), כן (ADV 71, PRON 3, ADJ 1, NOUN 1), אי (ADV 62, NOUN 21), אף (ADV 58, CCONJ 38, DET 20, NOUN 13)
The 10 most frequent ambiguous types: יותר (ADV 230, VERB 1), עוד (ADV 141, ADP 4, NOUN 1), כך (PRON 204, ADV 117, PROPN 35), מי (ADV 96, NOUN 3), אין (VERB 152, ADV 92, NOUN 2), שם (ADV 91, NOUN 39, VERB 6), אף (ADV 72, CCONJ 38, DET 20, NOUN 12), כן (ADV 72, PRON 51), אי (ADV 62, NOUN 20), יש (VERB 210, ADV 49)
- יותר
- עוד
- כך
- מי
- אין
- שם
- אף
- כן
- אי
- יש
Morphology
The form / lemma ratio of ADV
is 1.204819 (the average of all parts of speech is 1.702584).
The 1st highest number of forms (61) was observed with the lemma “_”: אחר, אין, אל, אם, אסורים, אף, אפשר, ביה, בינוני, בסיבוב, בקלות, בתמים, דו, דומה, הותר, הרי, ו, חזרה, חלילה, טובה, יתרה, כ, כאמור, כדומה, כו, כמו, כן, לאו, להתראות, למחר, לעילא, לראשונה, מיניה, מעוניינות, מעוניינים, מעין, מערבית, מצויין, מתמיד, נא, נאלצים, נהדר, נוסף, נוספים, נכונים, סבור, סבורה, סבורים, סוף, סמוך, עומדת, עין, עצם, עתידה, פלוס, פתע, קל, רגלית, שלא, שלישית, תו.
The 2nd highest number of forms (4) was observed with the lemma “לבד”: לבד, לבדה, לבדו, לבדם.
The 3rd highest number of forms (4) was observed with the lemma “עוד”: עוד, עודו, עודן, עודנה.
ADV
occurs with 4 features: Polarity (1034; 16% instances), PronType (386; 6% instances), Prefix (235; 4% instances), Abbr (1; 0% instances)
ADV
occurs with 4 feature-value pairs: Abbr=Yes
, Polarity=Neg
, Prefix=Yes
, PronType=Int
ADV
occurs with 5 feature combinations.
The most frequent feature combination is _
(4895 tokens).
Examples: גם, רק, יותר, עוד, כך, ביותר, אתמול, כבר, אפשר, שם
Relations
ADV
nodes are attached to their parents using 24 different relations: advmod (5322; 81% instances), compound:affix (235; 4% instances), fixed (230; 4% instances), root (176; 3% instances), conj (106; 2% instances), obl (91; 1% instances), nsubj (79; 1% instances), ccomp (42; 1% instances), advcl (37; 1% instances), obj (37; 1% instances), nmod (36; 1% instances), mark:q (34; 1% instances), acl:relcl (31; 0% instances), case (18; 0% instances), nmod:poss (14; 0% instances), nsubj:cop (14; 0% instances), dep (10; 0% instances), appos (9; 0% instances), acl (8; 0% instances), nsubj:outer (7; 0% instances), amod (6; 0% instances), compound:smixut (5; 0% instances), dislocated (2; 0% instances), parataxis (2; 0% instances)
Parents of ADV
nodes belong to 14 different parts of speech: VERB (3061; 47% instances), NOUN (1171; 18% instances), ADJ (852; 13% instances), ADP (489; 7% instances), ADV (415; 6% instances), (176; 3% instances), PRON (144; 2% instances), PROPN (73; 1% instances), DET (51; 1% instances), AUX (41; 1% instances), NUM (40; 1% instances), CCONJ (25; 0% instances), SCONJ (12; 0% instances), X (1; 0% instances)
5409 (83%) ADV
nodes are leaves.
534 (8%) ADV
nodes have one child.
312 (5%) ADV
nodes have two children.
296 (5%) ADV
nodes have three or more children.
The highest child degree of a ADV
node is 14.
Children of ADV
nodes are attached using 30 different relations: punct (422; 18% instances), fixed (279; 12% instances), advmod (254; 11% instances), dep (175; 7% instances), xcomp (172; 7% instances), case (167; 7% instances), acl:relcl (153; 6% instances), cc (148; 6% instances), obl (125; 5% instances), mark (98; 4% instances), conj (71; 3% instances), advcl (65; 3% instances), nsubj (48; 2% instances), ccomp (30; 1% instances), det (29; 1% instances), cop (28; 1% instances), compound:affix (25; 1% instances), csubj (23; 1% instances), case:gen (20; 1% instances), parataxis (12; 1% instances), case:acc (11; 0% instances), acl (7; 0% instances), obj (7; 0% instances), dislocated (5; 0% instances), nsubj:cop (5; 0% instances), mark:q (3; 0% instances), appos (2; 0% instances), amod (1; 0% instances), discourse (1; 0% instances), nsubj:outer (1; 0% instances)
Children of ADV
nodes belong to 15 different parts of speech: VERB (484; 20% instances), PUNCT (422; 18% instances), ADV (415; 17% instances), ADP (291; 12% instances), NOUN (204; 9% instances), CCONJ (188; 8% instances), SCONJ (118; 5% instances), PRON (74; 3% instances), ADJ (73; 3% instances), DET (59; 2% instances), AUX (30; 1% instances), PROPN (16; 1% instances), NUM (9; 0% instances), X (3; 0% instances), INTJ (1; 0% instances)