home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Ukrainian-IU: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

50784 tokens (41%) have a non-empty value of Gender. 23066 types (73%) occur at least once with a non-empty value of Gender. 13003 lemmas (72%) occur at least once with a non-empty value of Gender. The feature is used with 9 part-of-speech tags: NOUN (28890; 24% instances), ADJ (8213; 7% instances), VERB (3609; 3% instances), PROPN (3409; 3% instances), DET (2994; 2% instances), PRON (2770; 2% instances), AUX (482; 0% instances), NUM (406; 0% instances), X (11; 0% instances).

NOUN

28890 NOUN tokens (98% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Animacy=Inan (24711; 86%), Number=Sing (20762; 72%).

NOUN tokens may have the following values of Gender:

Paradigm головаMascFem
Animacy=Anim|Case=Acc|Number=SingГолову
Animacy=Anim|Case=Dat|Number=Singголові
Animacy=Anim|Case=Gen|Number=Singголови
Animacy=Anim|Case=Gen|Number=Plurголів
Animacy=Anim|Case=Ins|Number=Singголовою
Animacy=Anim|Case=Ins|Number=Plurголовами
Animacy=Anim|Case=Nom|Number=SingголоваГолова
Animacy=Inan|Case=Acc|Number=Singголову
Animacy=Inan|Case=Acc|Number=Plurголови
Animacy=Inan|Case=Gen|Number=Singголови
Animacy=Inan|Case=Ins|Number=Singголовою
Animacy=Inan|Case=Loc|Number=Singголові
Animacy=Inan|Case=Nom|Number=Singголова
Animacy=Inan|Case=Nom|Number=Plurголови

Gender seems to be lexical feature of NOUN. 100% lemmas (6895) occur only with one value of Gender.

ADJ

8213 ADJ tokens (68% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (8213; 100%), Animacy=EMPTY (7662; 93%), Aspect=EMPTY (7460; 91%), VerbForm=EMPTY (7460; 91%), Voice=EMPTY (7460; 91%), Degree=EMPTY (5870; 71%).

ADJ tokens may have the following values of Gender:

Paradigm українськийMascFemNeut
Animacy=Inan|Case=Accукраїнський
Case=Accукраїнськуукраїнське
Case=DatукраїнськомуукраїнськійУкраїнському
Case=Genукраїнськогоукраїнськоїукраїнського
Case=Insукраїнськимукраїнською
Case=Locукраїнськомуукраїнськійукраїнському
Case=Nomукраїнськийукраїнськаукраїнське

VERB

3609 VERB tokens (28% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=Ind (3609; 100%), Number=Sing (3609; 100%), Person=EMPTY (3609; 100%), Tense=Past (3609; 100%), VerbForm=Fin (3609; 100%), Aspect=Perf (1963; 54%).

VERB tokens may have the following values of Gender:

Paradigm бутиMascFemNeut
бувбулабуло

PROPN

3409 PROPN tokens (97% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (3386; 99%), Uninflect=EMPTY (2902; 85%), Animacy=Anim (2010; 59%).

PROPN tokens may have the following values of Gender:

Paradigm І.MascFem
Case=Acc|NameType=GivІ
Case=Gen|NameType=GivІ
Case=Gen|NameType=PatІ
Case=Nom|NameType=GivІІ
Case=Nom|NameType=PatІ

Gender seems to be lexical feature of PROPN. 99% lemmas (1519) occur only with one value of Gender.

DET

2994 DET tokens (64% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (2994; 100%), Animacy=EMPTY (2671; 89%), Reflex=EMPTY (2601; 87%), Person=EMPTY (2432; 81%), Poss=EMPTY (2163; 72%).

DET tokens may have the following values of Gender:

Paradigm якийMascFemNeut
Animacy=Anim|Case=Acc|PronType=Relякого
Animacy=Inan|Case=Acc|PronType=Indякий
Animacy=Inan|Case=Acc|PronType=Intякий
Animacy=Inan|Case=Acc|PronType=Relякий
Case=Acc|PronType=Relякуяке
Case=Dat|PronType=Relякомуякій
Case=Gen|PronType=Indбудь-якої
Case=Gen|PronType=IntЯкогоякої
Case=Gen|PronType=Relякогоякоїякого
Case=Ins|PronType=Indяким
Case=Ins|PronType=Relякимякоюяким
Case=Loc|PronType=Indякій
Case=Loc|PronType=Relякомуякійякому
Case=Nom|PronType=Indякийяка
Case=Nom|PronType=Intяка
Case=Nom|PronType=Relякийякаяке

PRON

2770 PRON tokens (55% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (2770; 100%), Animacy=EMPTY (1386; 50%), Person=3 (1386; 50%), PronType=Prs (1386; 50%).

PRON tokens may have the following values of Gender:

Gender seems to be lexical feature of PRON. 100% lemmas (22) occur only with one value of Gender.

AUX

482 AUX tokens (46% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Aspect=Imp (482; 100%), Mood=Ind (482; 100%), Number=Sing (482; 100%), Person=EMPTY (482; 100%), Tense=Past (482; 100%), VerbForm=Fin (482; 100%).

AUX tokens may have the following values of Gender:

Paradigm бутиMascFemNeut
бувбулабуло

NUM

406 NUM tokens (23% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (406; 100%), Case=Nom (210; 52%), Uninflect=EMPTY (209; 51%).

NUM tokens may have the following values of Gender:

Paradigm дваMascFemNeut
Case=Accдва, двохдві, двохдва
Case=Genдвохдвохдвох
Case=Insдвомадвома
Case=Locдвохдвох
Case=Nomдвадвідва

X

11 X tokens (2% of all X tokens) have a non-empty value of Gender.

The most frequent other feature values with which X and Gender co-occurred: Foreign=Yes (11; 100%).

X tokens may have the following values of Gender:

Gender seems to be lexical feature of X. 100% lemmas (11) occur only with one value of Gender.

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (6486; 69%), NOUN –[det]–> DET (2072; 69%), PROPN –[flat:name]–> PROPN (530; 100%), VERB –[conj]–> VERB (481; 64%), ADJ –[conj]–> ADJ (382; 96%), NOUN –[flat:title]–> PROPN (364; 76%), ADJ –[nsubj]–> NOUN (317; 64%), VERB –[nsubj]–> PROPN (294; 65%), NOUN –[appos]–> NOUN (277; 57%), PROPN –[conj]–> PROPN (149; 76%).