Statistics of NUM in UD_Hebrew-IAHLTwiki

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Hebrew-IAHLTwiki: POS Tags: `NUM`

There are 557 NUM lemmas (5%), 576 NUM types (4%) and 3127 NUM tokens (2%). Out of 16 observed tags, the rank of NUM is: 5 in number of lemmas, 5 in number of types and 12 in number of tokens.

The 10 most frequent NUM lemmas: אחת, שתיים, שלוש, 2, 1, ארבע, מיליון, מאה, עשר, אלף

The 10 most frequent NUM types: אחד, שני, שתי, 2, אחת, 1, מיליון, שלושה, שלוש, 2017

The 10 most frequent ambiguous lemmas: אחת (NUM 193, NOUN 6, ADJ 1), שתיים (NUM 187, NOUN 2, ADJ 1), שלוש (NUM 98, ADJ 1), 1 (NUM 53, ADJ 2), ארבע (NUM 52, PROPN 2, ADJ 1), מאה (NOUN 60, NUM 47), עשר (NUM 37, ADJ 1), חמש (NUM 30, ADJ 1), 15 (NUM 29, ADJ 3), 20 (NUM 29, ADJ 17)

The 10 most frequent ambiguous types: אחד (NUM 124, NOUN 5, ADJ 2), שני (NUM 120, ADJ 58, PROPN 2), אחת (NUM 60, ADJ 1, NOUN 1), 1 (NUM 53, ADJ 2), שלוש (NUM 43, ADJ 1), 15 (NUM 29, ADJ 2), 20 (NUM 29, ADJ 17), 3 (NUM 29, ADJ 1), 5 (NUM 29, ADJ 2), מאה (NOUN 59, NUM 27)

אחד
- NUM 124: מדליון זה הוא ה יחיד ה מציג זוג ציפורים יחד ב מדליון אחד .
- NOUN 5: אחד מ מערכוני ו ב תוכנית ה תיאטרון ש ביים גדעון שמר היה פרודיה על להקת ה נח”ל .
- ADJ 2: ב כל סוגי ה דיאליזה דם נמצא מ ציד ה ה אחד של ה ממברנה ו נוזל דיאליזה מ ציד ה ה שני .
שני
- NUM 120: נקודת ה אמצע ש בין שני ה שערים הללו , היא ריבית בנק ישראל .
- ADJ 58: פרק שני
- PROPN 2: ה בניין ש מעל מערת ה מכפלה הוא אחד ה מונומנטים ה נאים ביותר ש השתמרו מ תקופת בית שני .
אחת
- NUM 60: אף אחת מ הצעות אלה לא הגיעה ל קריאה ראשונה .
- ADJ 1: ב עשור ה שני של ה מאה ה - 21 , לא ניתן להיכנס ל קפלה ללא אישור , ו ה מעבר ממנ ה ל תחנה ה אחת - עשרה על ה גולגולתא סגור .
- NOUN 1: כך למשל אדם ש מכר שתי מניות , כש על ה אחת הרוויח ו על ה שנייה הפסיד , רשאי לקזז את ה הפסדים מ ה רווחים קודם ל חישוב ה מס .
1
- NUM 53: אונר”א החלה לפעול ב שטח ב ־ 1 ב מאי 1950 , כ סוכנות סעד זמנית .
- ADJ 2: השתתפו ב ה עשרות טנקים סורים של ה דיוויזיה ה - 1 ש הסתייעו גם ב ירי של טילי נ”ט .
שלוש
- NUM 43: “ אזור יהודה ו שומרון “ חולק ל שלוש קטגוריות : .
- ADJ 1: ב ממשלת ישראל ה שלושים ו שלוש
15
- NUM 29: ה טיפול ב מטופלת ה שלישית הצליח ו היא המשיכה לחיות כ - 15 שנה לאחרי ו .
- ADJ 2: ראשית ה של אומנות ה קרמיקה ה ארמנית היא ב מאה ה - 15 ב ערים כגון איזניק או קוטחיה ש ב טורקיה , אולם ה מפגש עם ה אמנות ה ארץ - ישראלית ה עתיקה ו עם ה מוטיבים ה נוצריים יצר סינתזה אמנותית ייחודית .
20
- NUM 29: ב חברון הוא מצא 20 משפחות יהודיות .
- ADJ 17: ה שימוש ב מק”ם החל ב שנות ה - 60 של ה מאה ה - 20 .
3
- NUM 29: סכום ה תוצאות ב מדד זה נע בין 3 ל - 15 ( כמו ב מבוגרים ) .
- ADJ 1: ב לילה תקפה מ מזרח חטיבה ממוכנת סורית מ דיוויזיית ה שריון ה - 3 את ה אגף ה ימני של אוגדה 146 .
5
- NUM 29: פלזמה קפואה היא ה טיפול ה רגיל עבור מחסור ב פקטור 5 ( V ) .
- ADJ 2: כיס זה הפך למעשה ל מגנן נ”ט גדול ש אויש על ידי דיוויזיית ה רגלים ה - 5 ו ה דיוויזיה ה משוריינת ה - 1 .
מאה
- NOUN 59: גרסה זו לגבי מקום ה גולגולתא חדשה יחסית ו התקבעה ב שלהי ה מאה ה - 19 .
- NUM 27: ה נוסע רבי בנימין מ טודלה ביקר ב מערה ב מאה ה - 12 , ו כתב :

Morphology

The form / lemma ratio of NUM is 1.034111 (the average of all parts of speech is 1.479807).

The 1st highest number of forms (5) was observed with the lemma “אחת”: אַחַד, אחד, אחדות, אחדים, אחת.

The 2nd highest number of forms (5) was observed with the lemma “שתיים”: שני, שניים, שנים, שתי, שתיים.

The 3rd highest number of forms (4) was observed with the lemma “עשר”: עשר, עשרה, עשרות, עשרת.

NUM occurs with 6 features: Gender (858; 27% instances), NumType (756; 24% instances), Number (451; 14% instances), Definite (246; 8% instances), Prefix (3; 0% instances), Typo (1; 0% instances)

NUM occurs with 10 feature-value pairs: Definite=Cons, Gender=Fem, Gender=Fem,Masc, Gender=Masc, NumType=Card, Number=Dual, Number=Plur, Number=Sing, Prefix=Yes, Typo=Yes

NUM occurs with 33 feature combinations. The most frequent feature combination is _ (2241 tokens). Examples: 2, 1, 2017, 15, 20, 3, 7, 5, 1948, 4

Relations

NUM nodes are attached to their parents using 24 different relations: nummod (1145; 37% instances), compound (552; 18% instances), obl (382; 12% instances), nmod:unmarked (362; 12% instances), dep (127; 4% instances), nmod (114; 4% instances), conj (80; 3% instances), appos (79; 3% instances), nsubj (58; 2% instances), parataxis (58; 2% instances), flat (53; 2% instances), nmod:poss (30; 1% instances), xcomp (15; 0% instances), obj (13; 0% instances), root (12; 0% instances), list (11; 0% instances), nsubj:pass (11; 0% instances), acl:relcl (7; 0% instances), obl:unmarked (6; 0% instances), compound:affix (4; 0% instances), orphan (3; 0% instances), fixed (2; 0% instances), nsubj:outer (2; 0% instances), amod (1; 0% instances)

Parents of NUM nodes belong to 10 different parts of speech: NOUN (1853; 59% instances), VERB (448; 14% instances), NUM (352; 11% instances), PROPN (340; 11% instances), SYM (67; 2% instances), ADJ (33; 1% instances), PRON (15; 0% instances), (12; 0% instances), ADV (4; 0% instances), X (3; 0% instances)

1902 (61%) NUM nodes are leaves.

745 (24%) NUM nodes have one child.

265 (8%) NUM nodes have two children.

215 (7%) NUM nodes have three or more children.

The highest child degree of a NUM node is 8.

Children of NUM nodes are attached using 25 different relations: case (582; 29% instances), nmod (312; 15% instances), punct (300; 15% instances), det (170; 8% instances), advmod (132; 7% instances), nmod:unmarked (114; 6% instances), nummod (89; 4% instances), conj (75; 4% instances), cc (51; 3% instances), obl (37; 2% instances), compound (33; 2% instances), appos (31; 2% instances), amod (28; 1% instances), nsubj (14; 1% instances), acl:relcl (13; 1% instances), cop (12; 1% instances), orphan (8; 0% instances), mark (4; 0% instances), nmod:poss (4; 0% instances), compound:affix (3; 0% instances), dep (3; 0% instances), parataxis (3; 0% instances), aux (1; 0% instances), list (1; 0% instances), obl:unmarked (1; 0% instances)

Children of NUM nodes belong to 15 different parts of speech: ADP (523; 26% instances), NUM (352; 17% instances), PUNCT (300; 15% instances), PROPN (186; 9% instances), DET (170; 8% instances), NOUN (145; 7% instances), ADV (139; 7% instances), SYM (58; 3% instances), CCONJ (50; 2% instances), ADJ (37; 2% instances), PRON (33; 2% instances), VERB (16; 1% instances), AUX (7; 0% instances), SCONJ (4; 0% instances), X (1; 0% instances)

Treebank Statistics: UD_Hebrew-IAHLTwiki: POS Tags: NUM

Morphology

Relations

Treebank Statistics: UD_Hebrew-IAHLTwiki: POS Tags: `NUM`