home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-SynTagRus: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

629742 tokens (42%) have a non-empty value of Gender. 110073 types (78%) occur at least once with a non-empty value of Gender. 42366 lemmas (81%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (358921; 24% instances), ADJ (95431; 6% instances), VERB (55693; 4% instances), PROPN (49816; 3% instances), PRON (35076; 2% instances), DET (24512; 2% instances), AUX (6196; 0% instances), NUM (4097; 0% instances).

NOUN

358921 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Animacy=Inan (309383; 86%), Number=Sing (253320; 71%).

NOUN tokens may have the following values of Gender:

Paradigm оMascFemNeut
Case=Acc|Number=Singо
Case=Acc|Number=Plurо.
Case=Nom|Number=SingОо

Gender seems to be lexical feature of NOUN. 100% lemmas (18953) occur only with one value of Gender.

ADJ

95431 ADJ tokens (65% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (95431; 100%), Degree=Pos (94911; 99%).

ADJ tokens may have the following values of Gender:

Paradigm новыйMascFemNeut
Animacy=Anim|Case=Acc|Degree=Posнового
Animacy=Inan|Case=Acc|Degree=Posновый
Case=Acc|Degree=Posновуюновое
Case=Acc|Degree=Supновейшуюновейшее
Case=Dat|Degree=Posновомуновойновому
Case=Gen|Degree=Posновогоновойнового
Case=Gen|Degree=Supновейшего
Case=Ins|Degree=Posновымновойновым
Case=Ins|Degree=Supновейшейновейшим
Case=Loc|Degree=Posновомновойновом
Case=Loc|Degree=Supновейшемновейшей
Case=Nom|Degree=Posновыйноваяновое
Case=Nom|Degree=Supновейшаяновейшее
Degree=Pos|Variant=Shortновнованово

VERB

55693 VERB tokens (32% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Number=Sing (55693; 100%), Person=EMPTY (55693; 100%), Tense=Past (51982; 93%), Mood=Ind (40924; 73%), VerbForm=Fin (40924; 73%), Aspect=Perf (35091; 63%), Voice=Act (33371; 60%).

VERB tokens may have the following values of Gender:

Paradigm мочьMascFemNeut
Aspect=Imp|Case=Acc|Tense=Pres|VerbForm=Partмогущую
Aspect=Imp|Case=Nom|Tense=Pres|VerbForm=Partмогущее
Aspect=Imp|Mood=Ind|Tense=Past|VerbForm=Finмогмогламогло
Aspect=Perf|Mood=Ind|Tense=Past|VerbForm=Finсмогсмогласмогло

PROPN

49816 PROPN tokens (89% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (48942; 98%), Animacy=Anim (26589; 53%).

PROPN tokens may have the following values of Gender:

Paradigm GONGOMascFemNeut
Case=Gen|Number=SingGONGO
Case=Ins|Number=PlurGONGO
Case=Nom|Number=PlurGONGO

Gender seems to be lexical feature of PROPN. 98% lemmas (8578) occur only with one value of Gender.

PRON

35076 PRON tokens (48% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (35073; 100%), Person=EMPTY (19190; 55%).

PRON tokens may have the following values of Gender:

Paradigm которыйMascFemNeut
Animacy=Anim|Case=Accкоторого
Animacy=Inan|Case=Accкоторый
Case=Accкоторуюкоторое
Case=Datкоторомукоторойкоторому
Case=Genкоторогокоторойкоторого
Case=Insкоторымкоторойкоторым
Case=Locкоторомкоторойкотором
Case=Nomкоторыйкотораякоторое

DET

24512 DET tokens (60% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (24510; 100%), Poss=EMPTY (20550; 84%).

DET tokens may have the following values of Gender:

Paradigm этотMascFemNeut
Animacy=Anim|Case=Accэтого
Animacy=Inan|Case=Accэтот
Case=Accэтуэто
Case=Acc|PronType=Demэтот, этого, этоэтуэто
Case=Datэтомуэтойэтому
Case=Dat|PronType=Demэтомуэтойэтому
Case=Genэтогоэтойэтого
Case=Gen|PronType=Demэтогоэтойэтого
Case=Insэтимэтойэтим
Case=Ins|PronType=Demэтимэтойэтим
Case=Locэтомэтойэтом
Case=Loc|PronType=Demэтомэтойэтом
Case=Nomэтотэтаэто
Case=Nom|PronType=Demэтотэтаэто

AUX

6196 AUX tokens (45% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Aspect=Imp (6196; 100%), Number=Sing (6196; 100%), Person=EMPTY (6196; 100%), Tense=Past (6196; 100%), Voice=Act (6196; 100%), Mood=Ind (6191; 100%), VerbForm=Fin (6191; 100%).

AUX tokens may have the following values of Gender:

Paradigm бытьMascFemNeut
Case=Loc|VerbForm=Partбывшем
Case=Nom|VerbForm=Partбывший
Mood=Ind|VerbForm=Finбылбылабыло

NUM

4097 NUM tokens (21% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Animacy=EMPTY (3177; 78%), NumType=Card (2992; 73%).

NUM tokens may have the following values of Gender:

Paradigm одинMascFemNeut
Animacy=Anim|Case=Accодного
Animacy=Anim|Case=Acc|NumType=Cardодного
Animacy=Inan|Case=Accодин
Animacy=Inan|Case=Acc|Number=Singодин
Animacy=Inan|Case=Acc|NumType=Cardодин
Case=Accоднуодно
Case=Acc|Number=Singодин
Case=Acc|NumType=Cardоднуодно
Case=Datодномуоднойодному
Case=Dat|NumType=Cardодномуоднойодному
Case=Genодногооднойодного
Case=Gen|NumType=Cardодногооднойодного
Case=Insоднимодной, одноюодним
Case=Ins|NumType=Cardоднимоднойодним
Case=Locодномоднойодном
Case=Loc|NumType=Cardодномоднойодном
Case=Nomодиноднаодно
Case=Nom|NumType=Cardодиноднаодно

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (70123; 65%), NOUN –[det]–> DET (20216; 59%), PROPN –[flat:name]–> PROPN (7056; 97%), VERB –[conj]–> VERB (5528; 55%), VERB –[nsubj]–> PROPN (4797; 66%), NOUN –[appos]–> PROPN (4552; 79%), ADJ –[nsubj]–> NOUN (3751; 62%), ADJ –[conj]–> ADJ (3393; 94%), NOUN –[amod]–> VERB (2669; 61%), PROPN –[amod]–> ADJ (2208; 92%).