home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hebrew-IAHLTknesset: POS Tags: DET

There are 22 DET lemmas (0%), 27 DET types (0%) and 6476 DET tokens (10%). Out of 16 observed tags, the rank of DET is: 11 in number of lemmas, 12 in number of types and 5 in number of tokens.

The 10 most frequent DET lemmas: ה, כול, כמה, הרבה, שום, עוד, אף, איזה, איזשהו, מיני

The 10 most frequent DET types: ה, כל, כול, כמה, הרבה, שום, עוד, אף, איזה, איזשהו

The 10 most frequent ambiguous lemmas: ה (DET 5628, SCONJ 60, NUM 1, PRON 1), כול (DET 499, NOUN 37, ADV 2, ADJ 1), כמה (DET 77, ADV 2, SCONJ 1), הרבה (DET 71, ADV 18, NOUN 1), עוד (DET 48, ADV 47, ADP 1), אף (DET 27, ADV 8, CCONJ 5, NOUN 1), איזה (DET 26, PRON 9, NOUN 1), המון (DET 5, NOUN 2, ADV 1), מדי (ADV 9, DET 5), מספר (NOUN 22, DET 4, VERB 1)

The 10 most frequent ambiguous types: ה (DET 5628, PRON 240, SCONJ 60, ADP 1, NUM 1), כל (DET 418, NOUN 6, ADV 1), כול (DET 81, NOUN 31), כמה (DET 77, ADV 2, SCONJ 1), הרבה (DET 71, ADV 18, NOUN 1), עוד (DET 48, ADV 47, ADP 1), אף (DET 27, ADV 8, CCONJ 5), איזה (DET 19, PRON 3, NOUN 1), איזו (DET 6, PRON 2), מיני (DET 6, NOUN 6)

Morphology

The form / lemma ratio of DET is 1.227273 (the average of all parts of speech is 1.545540).

The 1st highest number of forms (3) was observed with the lemma “איזה”: איזה, איזו, אלו.

The 2nd highest number of forms (3) was observed with the lemma “איזשהו”: איזושהי, איזשהו, איזשהם.

The 3rd highest number of forms (2) was observed with the lemma “כול”: כול, כל.

DET occurs with 5 features: Definite (6324; 98% instances), PronType (6158; 95% instances), Gender (19; 0% instances), Number (18; 0% instances), Typo (2; 0% instances)

DET occurs with 13 feature-value pairs: Definite=Cons, Definite=Def, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing, PronType=Art, PronType=Ind, PronType=Int, PronType=Neg, PronType=Rel, PronType=Tot, Typo=Yes

DET occurs with 19 feature combinations. The most frequent feature combination is Definite=Def|PronType=Art (5628 tokens). Examples: ה

Relations

DET nodes are attached to their parents using 17 different relations: det (6360; 98% instances), advmod (43; 1% instances), fixed (18; 0% instances), nsubj (14; 0% instances), dep (7; 0% instances), conj (6; 0% instances), compound (5; 0% instances), mark (5; 0% instances), obj (5; 0% instances), obl (5; 0% instances), nmod:poss (2; 0% instances), advcl (1; 0% instances), ccomp (1; 0% instances), compound:affix (1; 0% instances), nsubj:pass (1; 0% instances), obl:unmarked (1; 0% instances), parataxis (1; 0% instances)

Parents of DET nodes belong to 11 different parts of speech: NOUN (4130; 64% instances), ADJ (1084; 17% instances), PRON (552; 9% instances), PROPN (552; 9% instances), NUM (58; 1% instances), VERB (49; 1% instances), DET (24; 0% instances), ADV (16; 0% instances), ADP (5; 0% instances), SYM (3; 0% instances), X (3; 0% instances)

6355 (98%) DET nodes are leaves.

100 (2%) DET nodes have one child.

17 (0%) DET nodes have two children.

4 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 4.

Children of DET nodes are attached using 16 different relations: fixed (49; 33% instances), punct (23; 16% instances), det (16; 11% instances), advmod (13; 9% instances), nmod:poss (10; 7% instances), case (8; 5% instances), cc (6; 4% instances), acl:relcl (5; 3% instances), nmod (5; 3% instances), conj (4; 3% instances), amod (2; 1% instances), dep (2; 1% instances), compound (1; 1% instances), mark (1; 1% instances), nsubj (1; 1% instances), nsubj:outer (1; 1% instances)

Children of DET nodes belong to 11 different parts of speech: ADV (51; 35% instances), DET (24; 16% instances), PUNCT (23; 16% instances), PRON (17; 12% instances), ADP (8; 5% instances), CCONJ (7; 5% instances), VERB (6; 4% instances), NOUN (4; 3% instances), SCONJ (4; 3% instances), ADJ (2; 1% instances), NUM (1; 1% instances)