home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic-NYUAD: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

477701 tokens (65%) have a non-empty value of Gender. 1 types (0) occur at least once with a non-empty value of Gender. 4838 lemmas (96%) occur at least once with a non-empty value of Gender. The feature is used with 16 part-of-speech tags: NOUN (221645; 30% instances), ADJ (69179; 9% instances), VERB (55373; 7% instances), PROPN (54272; 7% instances), PRON (43070; 6% instances), ADV (19509; 3% instances), DET (6065; 1% instances), AUX (4101; 1% instances), NUM (3526; 0% instances), X (482; 0% instances), ADP (192; 0% instances), PUNCT (154; 0% instances), CCONJ (88; 0% instances), SCONJ (25; 0% instances), PART (17; 0% instances), INTJ (3; 0% instances).

NOUN

221645 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (197322; 89%), Case=Gen (142652; 64%).

NOUN tokens may have the following values of Gender:

ADJ

69179 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (66123; 96%), Definite=Def (45840; 66%), Case=Gen (40733; 59%).

ADJ tokens may have the following values of Gender:

VERB

55373 VERB tokens (100% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=3 (51943; 94%), Voice=Act (51452; 93%), Mood=Ind (50158; 91%), Number=Sing (49732; 90%), Aspect=Perf (28891; 52%).

VERB tokens may have the following values of Gender:

PROPN

54272 PROPN tokens (95% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (53581; 99%), Case=EMPTY (43512; 80%), Definite=Ind (40325; 74%).

PROPN tokens may have the following values of Gender:

Gender seems to be lexical feature of PROPN. 100% lemmas (4789) occur only with one value of Gender.

PRON

43070 PRON tokens (99% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (36378; 84%), PronType=Prs (30458; 71%), Definite=Def (30207; 70%), Person=3 (29809; 69%).

PRON tokens may have the following values of Gender:

ADV

19509 ADV tokens (81% of all ADV tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADV and Gender co-occurred: Polarity=EMPTY (19509; 100%), Number=Sing (19508; 100%), Definite=Com (15109; 77%), Case=Acc (13032; 67%).

ADV tokens may have the following values of Gender:

DET

6065 DET tokens (95% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Definite=Ind (6055; 100%), Number=Sing (5881; 97%).

DET tokens may have the following values of Gender:

AUX

4101 AUX tokens (45% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Voice=Act (4076; 99%), Person=3 (3924; 96%), Number=Sing (3824; 93%), Mood=Ind (3347; 82%).

AUX tokens may have the following values of Gender:

NUM

3526 NUM tokens (23% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (3328; 94%), Number=Sing (3125; 89%), Definite=Com (2440; 69%), Case=Gen (2114; 60%).

NUM tokens may have the following values of Gender:

X

482 X tokens (52% of all X tokens) have a non-empty value of Gender.

The most frequent other feature values with which X and Gender co-occurred: Number=Sing (424; 88%), Mood=EMPTY (285; 59%), Person=EMPTY (277; 57%), Voice=EMPTY (277; 57%).

X tokens may have the following values of Gender:

Gender seems to be lexical feature of X. 93% lemmas (25) occur only with one value of Gender.

ADP

192 ADP tokens (0% of all ADP tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADP and Gender co-occurred: AdpType=Prep (176; 92%).

ADP tokens may have the following values of Gender:

PUNCT

154 PUNCT tokens (0% of all PUNCT tokens) have a non-empty value of Gender.

PUNCT tokens may have the following values of Gender:

CCONJ

88 CCONJ tokens (0% of all CCONJ tokens) have a non-empty value of Gender.

CCONJ tokens may have the following values of Gender:

SCONJ

25 SCONJ tokens (0% of all SCONJ tokens) have a non-empty value of Gender.

SCONJ tokens may have the following values of Gender:

PART

17 PART tokens (1% of all PART tokens) have a non-empty value of Gender.

PART tokens may have the following values of Gender:

INTJ

3 INTJ tokens (5% of all INTJ tokens) have a non-empty value of Gender.

INTJ tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (44733; 82%), NOUN –[nmod:poss]–> NOUN (31172; 57%), NOUN –[obj]–> NOUN (26620; 59%), VERB –[obj]–> NOUN (18078; 55%), VERB –[nsubj]–> NOUN (16417; 88%), PROPN –[flat]–> PROPN (13240; 93%), NOUN –[nmod:poss]–> PRON (9184; 58%), NOUN –[nmod]–> NOUN (8141; 70%), VERB –[iobj]–> NOUN (7589; 55%), ADV –[nmod:poss]–> NOUN (7578; 70%).