home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hebrew-IAHLTknesset: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc. Some words have combined values of the feature; 1 combinations have been observed: Fem|Masc.

27741 tokens (41%) have a non-empty value of Gender. 5941 types (79%) occur at least once with a non-empty value of Gender. 3529 lemmas (76%) occur at least once with a non-empty value of Gender. The feature is used with 9 part-of-speech tags: NOUN (12993; 19% instances), PRON (5548; 8% instances), VERB (4879; 7% instances), ADJ (3090; 5% instances), AUX (648; 1% instances), NUM (439; 1% instances), PROPN (107; 0% instances), DET (19; 0% instances), SYM (18; 0% instances).

NOUN

12993 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Definite=EMPTY (10285; 79%), Number=Sing (9629; 74%).

NOUN tokens may have the following values of Gender:

Paradigm פניםFem,MascMascFem
Definite=Cons|Number=Plurפניפני
Number=Singפנים
Number=Plurפני, פניםפנים, פניפני

Gender seems to be lexical feature of NOUN. 97% lemmas (2051) occur only with one value of Gender.

PRON

5548 PRON tokens (87% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Poss=EMPTY (4906; 88%), Case=EMPTY (4896; 88%), Definite=EMPTY (4819; 87%), PronType=Prs (4087; 74%), Number=Sing (3965; 71%), Person=3 (3436; 62%).

PRON tokens may have the following values of Gender:

Paradigm הואFem,MascMascFem
Case=Acc|Number=Sing|Person=1ני
Case=Acc|Number=Sing|Person=2אתה
Case=Acc|Number=Sing|Person=3ה
Case=Acc|Number=Plur|Person=1נו
Case=Acc|Number=Plur|Person=2כם
Case=Acc|Number=Plur|Person=3ן
Case=Gen|Definite=Def|Number=Sing|Person=1|Poss=Yesייי
Case=Gen|Definite=Def|Number=Sing|Person=2|Poss=Yesך
Case=Gen|Definite=Def|Number=Sing|Person=3|Poss=Yesו, הוה
Case=Gen|Definite=Def|Number=Plur|Person=1|Poss=Yesנונונו
Case=Gen|Definite=Def|Number=Plur|Person=2|Poss=Yesכם
Case=Gen|Definite=Def|Number=Plur|Person=3|Poss=Yesם, הם, נון, הן
Definite=Def|Number=Sing|Person=3ו
Number=Sing|Person=1אני, יאני, יאני, י
Number=Sing|Person=2ךאתה, ךאת, ך
Number=Sing|Person=3הוא, ו, יוהיא, ה
Number=Plur|Person=1אנחנו, נואנחנו, נו, אנונו, אנחנו
Number=Plur|Person=2אתם, כם
Number=Plur|Person=3הם, ם, המה, נוהן, ן
Number=Plur|Person=3|Typo=Yesה
Person=2ך

VERB

4879 VERB tokens (66% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Voice=Act (3937; 81%), Person=3 (3120; 64%), Number=Sing (3081; 63%), VerbForm=Part (2557; 52%), Tense=Pres (2497; 51%).

VERB tokens may have the following values of Gender:

Paradigm אמרFem,MascMascFem
Aspect=Prog|Number=Sing|Person=1|Tense=Past|VerbForm=Partאומר
Aspect=Prog|Number=Sing|Person=3|Tense=Past|VerbForm=Partאומר
Mood=Irr|Number=Sing|Person=3|VerbForm=Partאומר
Mood=Irr|Number=Plur|Person=3|Tense=Pres|VerbForm=Partאומרים
Number=Sing|Person=1|Tense=Fut|VerbForm=Partאומר
Number=Sing|Person=1|Tense=Futאומר
Number=Sing|Person=1|Tense=Pastאמרתיאמרתיאמרתי
Number=Sing|Person=1|Tense=Pres|VerbForm=Partאומראומרת
Number=Sing|Person=2|Tense=Futתאמר
Number=Sing|Person=2|Tense=Pastאמרתאמרתאמרת
Number=Sing|Person=2|Tense=Pres|VerbForm=Partאומראומרת
Number=Sing|Person=3|Tense=Futנאמר
Number=Sing|Person=3|Tense=Pastאמראמרה
Number=Sing|Person=3|Tense=Pres|VerbForm=Partאומראומרת
Number=Plur|Person=1|Tense=Pastאמרנואמרנו
Number=Plur|Person=1|Tense=Pres|VerbForm=Partאומרים
Number=Plur|Person=2|Tense=Pastאמרתם
Number=Plur|Person=3|Tense=Pastאמרואמרו
Number=Plur|Person=3|Tense=Pres|VerbForm=Partאומרים

ADJ

3090 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (2339; 76%).

ADJ tokens may have the following values of Gender:

Paradigm אחרMascFem
Number=Singאחראחרת
Number=Plurאחריםאחרות

AUX

648 AUX tokens (91% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: VerbForm=EMPTY (472; 73%), Number=Sing (451; 70%), Tense=EMPTY (352; 54%), Person=EMPTY (330; 51%), HebBinyan=PAAL (328; 51%).

AUX tokens may have the following values of Gender:

Paradigm יכלFem,MascMascFem
HebBinyan=PAAL|Number=Sing|Person=1|Tense=Fut|VerbType=Modאוכל
HebBinyan=PAAL|Number=Sing|Person=1|Tense=Past|VerbType=Modיכולתייכולתי
HebBinyan=PAAL|Number=Sing|Person=3|Tense=Fut|VerbType=Modיוכלתוכל
HebBinyan=PAAL|Number=Sing|Person=3|Tense=Pres|VerbForm=Partיכול
HebBinyan=PAAL|Number=Sing|Person=3|Tense=Pres|VerbForm=Part|VerbType=Modיכולה
HebBinyan=PAAL|Number=Sing|Person=3|VerbForm=Partיכולה
HebBinyan=PAAL|Number=Sing|Person=3|VerbForm=Part|VerbType=Modיכוליכולה
HebBinyan=PAAL|Number=Sing|VerbForm=Partיכולה
HebBinyan=PAAL|Number=Sing|VerbForm=Part|VerbType=Modיכוליכולה
HebBinyan=PAAL|Number=Plur|Person=1|Tense=Fut|VerbType=Modנוכל
HebBinyan=PAAL|Number=Plur|Person=3|Tense=Fut|VerbType=Modיוכלו
HebBinyan=PAAL|Number=Plur|Person=3|Tense=Pres|VerbForm=Partיכולים
HebBinyan=PAAL|Number=Plur|Person=3|VerbForm=Partיכולים
HebBinyan=PAAL|Number=Plur|Person=3|VerbForm=Part|VerbType=Modיכולים
HebBinyan=PAAL|Number=Plur|Tense=Pres|VerbForm=Part|VerbType=Modיכולים
HebBinyan=PAAL|Number=Plur|VerbForm=Partיכולים
HebBinyan=PAAL|Number=Plur|VerbForm=Part|VerbType=Modיכולים
Number=Sing|Person=3|Tense=Fut|VerbType=Modיוכל
Number=Sing|Person=3|VerbForm=Partיכולה
Number=Sing|VerbForm=Partיכולה
Number=Sing|VerbForm=Part|VerbType=Modיכול
Number=Plurיכולות
Number=Plur|Person=1|Tense=Fut|VerbType=Modנוכלנוכל
Number=Plur|Person=3|Tense=Fut|VerbType=Modיוכלויוכלו
Number=Plur|VerbForm=Partיכולים
Number=Plur|VerbForm=Part|VerbType=Modיכוליםיכולות

NUM

439 NUM tokens (63% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Definite=EMPTY (361; 82%), NumType=EMPTY (269; 61%).

NUM tokens may have the following values of Gender:

Paradigm אחתMascFem
_אחת
Definite=Consאחת
Definite=Cons|Number=Singאחד
Definite=Cons|Number=Sing|NumType=Cardאחד
Definite=Cons|NumType=Cardאחת
Number=Singאחד
Number=Sing|NumType=Cardאחדאחת
NumType=Cardאחת

PROPN

107 PROPN tokens (5% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Definite=EMPTY (95; 89%).

PROPN tokens may have the following values of Gender:

Gender seems to be lexical feature of PROPN. 100% lemmas (69) occur only with one value of Gender.

DET

19 DET tokens (0% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Definite=EMPTY (17; 89%), PronType=Ind (16; 84%).

DET tokens may have the following values of Gender:

Paradigm איזהMascFem
Definite=Cons|PronType=Indאיזו
Number=Sing|PronType=Indאיזו
Number=Sing|PronType=Intאיזו
Number=Plur|PronType=Int|Typo=Yesאלו

SYM

18 SYM tokens (45% of all SYM tokens) have a non-empty value of Gender.

SYM tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (2179; 100%), VERB –[nsubj]–> PRON (1425; 83%), NOUN –[compound]–> NOUN (1070; 52%), VERB –[nsubj]–> NOUN (887; 62%), VERB –[conj]–> VERB (649; 70%), NOUN –[acl:relcl]–> VERB (610; 71%), NOUN –[conj]–> NOUN (610; 68%), NOUN –[det]–> PRON (564; 96%), NOUN –[nmod]–> NOUN (528; 52%), NOUN –[nsubj]–> PRON (350; 80%).