home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-PUD: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

9006 tokens (47%) have a non-empty value of Gender. 5708 types (74%) occur at least once with a non-empty value of Gender. 3792 lemmas (74%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (4892; 25% instances), ADJ (1274; 7% instances), PROPN (1101; 6% instances), VERB (811; 4% instances), PRON (446; 2% instances), DET (262; 1% instances), AUX (155; 1% instances), NUM (65; 0% instances).

NOUN

4892 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Animacy=Inan (4263; 87%), Number=Sing (3522; 72%).

NOUN tokens may have the following values of Gender:

Paradigm представительMascFem
Case=Gen|Number=Plurпредставителей
Case=Nom|Number=SingПредставительПредставитель
Case=Nom|Number=Plurпредставители

Gender seems to be lexical feature of NOUN. 100% lemmas (1903) occur only with one value of Gender.

ADJ

1274 ADJ tokens (61% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (1273; 100%), Degree=Pos (1267; 99%).

ADJ tokens may have the following values of Gender:

Paradigm новыйMascFemNeut
Animacy=Inan|Case=Accновый
Case=Accновуюновое
Case=Datновому
Case=Genновогоновойнового
Case=Insновой
Case=LocНовой
Case=Nomновыйноваяновое

PROPN

1101 PROPN tokens (91% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (1051; 95%), Animacy=Anim (577; 52%).

PROPN tokens may have the following values of Gender:

Paradigm ТрампMascFem
Case=AccТрампа
Case=DatТрампу
Case=GenТрампа
Case=InsТрампом
Case=LocТрампе
Case=NomТрампТрамп

Gender seems to be lexical feature of PROPN. 99% lemmas (729) occur only with one value of Gender.

VERB

811 VERB tokens (38% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Number=Sing (811; 100%), Person=EMPTY (811; 100%), Tense=Past (756; 93%), Mood=Ind (585; 72%), VerbForm=Fin (585; 72%), Aspect=Perf (576; 71%), Voice=Act (482; 59%).

VERB tokens may have the following values of Gender:

Paradigm статьMascFemNeut
Animacy=Anim|Case=Acc|VerbForm=Part|Voice=Midставшего
Case=Gen|VerbForm=Part|Voice=Midставшей
Mood=Ind|VerbForm=Fin|Voice=Actстал
Mood=Ind|VerbForm=Fin|Voice=Midсталсталастало

PRON

446 PRON tokens (56% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (446; 100%), Case=Nom (256; 57%), Person=EMPTY (226; 51%).

PRON tokens may have the following values of Gender:

Paradigm которыйMascFemNeut
Animacy=Inan|Case=Accкоторый
Case=Accкоторуюкоторое
Case=Datкоторому
Case=Genкоторогокоторой
Case=Insкоторым
Case=Locкоторомкоторойкотором
Case=Nomкоторыйкотораякоторое

Gender seems to be lexical feature of PRON. 93% lemmas (13) occur only with one value of Gender.

DET

262 DET tokens (53% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (262; 100%).

DET tokens may have the following values of Gender:

Paradigm этотMascFemNeut
Animacy=Inan|Case=Accэтот
Case=Accэтуэто
Case=Datэтойэтому
Case=Genэтогоэтойэтого
Case=Locэтомэтой
Case=Nomэтотэтаэто

AUX

155 AUX tokens (52% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Aspect=Imp (155; 100%), Mood=Ind (155; 100%), Number=Sing (155; 100%), Person=EMPTY (155; 100%), Tense=Past (155; 100%), VerbForm=Fin (155; 100%), Voice=Act (155; 100%).

AUX tokens may have the following values of Gender:

Paradigm бытьMascFemNeut
былбылабыло

NUM

65 NUM tokens (22% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Animacy=EMPTY (52; 80%), Number=Sing (40; 62%).

NUM tokens may have the following values of Gender:

Paradigm одинMascFemNeut
Animacy=Anim|Case=Accодного
Case=Accодну
Case=Datодному
Case=Genодногооднойодного
Case=InsоднимоднойОдним
Case=Locодномодной
Case=Nomодиноднаодно

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (962; 58%), NOUN –[det]–> DET (242; 52%), VERB –[nsubj]–> PROPN (132; 70%), VERB –[nsubj]–> PRON (127; 59%), PROPN –[flat:name]–> PROPN (102; 99%), NOUN –[flat:name]–> PROPN (92; 96%), VERB –[aux:pass]–> AUX (80; 88%), PROPN –[amod]–> ADJ (67; 97%), VERB –[nsubj:pass]–> NOUN (66; 55%), VERB –[conj]–> VERB (57; 52%).