home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-RNC: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

83109 tokens (49%) have a non-empty value of Gender. 25346 types (81%) occur at least once with a non-empty value of Gender. 10219 lemmas (82%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (37229; 22% instances), ADJ (13348; 8% instances), PROPN (12204; 7% instances), DET (8106; 5% instances), PRON (5388; 3% instances), VERB (5058; 3% instances), NUM (1512; 1% instances), AUX (264; 0% instances).

NOUN

37229 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (25892; 70%).

NOUN tokens may have the following values of Gender:

Gender seems to be lexical feature of NOUN. 99% lemmas (3490) occur only with one value of Gender.

ADJ

13348 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (11982; 90%), Variant=EMPTY (11516; 86%), Number=Sing (9698; 73%).

ADJ tokens may have the following values of Gender:

Paradigm великийMascFemNeut
Animacy=Anim|Case=Acc|Number=Singвеликого, великаго, великог[о], великѡг[о]
Animacy=Anim|Case=Acc|Number=Plurвеликихъ
Case=Acc|Number=Singвеликии, великий, Великій, великой, великои, Великіивеликую, великии, великꙋювеликое
Case=Acc|Number=Sing|Variant=ShortВелики, великъвелику, вели]кувелико
Case=Acc|Number=Dual|Variant=Shortвелице
Case=Acc|Number=Plurвеликиевеликая, великаꙗ
Case=Dat|Number=Singвеликому, великомꙋвеликои, великойвеликому
Case=Dat|Number=Sing|Variant=Shortвеликувелику
Case=Dat|Number=Plurвеликимъ, великимвеликимъ
Case=Gen|Number=Singвеликого, великаго, великог[о], Вєликаг[о], вел[икого, великово, вєликог[о]великія, великия, великои, великие, великиє, великое, великой, великіе, великіивеликого, великог[о], великаго
Case=Gen|Number=Plurвеликихъ, великих, великыхвеликихъ, великихвеликих, Великых, великихъ
Case=Gen|Number=Plur|Typo=Yesвеликии
Case=Ins|Number=Singвеликим, великимъ, Великимь, великымвеликоювеликим, великимъ, великым
Case=Ins|Number=Plurвеликим, великими
Case=Loc|Number=Singвеликом, великомъ, Велицем, Велікомъвеликой, велицеивеликом, великомъ, велицем
Case=Loc|Number=Sing|Variant=Shortвелице
Case=Loc|Number=Plurвеликихвеликих, великихъ
Case=Nom|Number=Singвеликий, великии, великій, великой, великыи, великiй, великїи, Великиⸯ, Великіи, вели[кии]великая, великаꙗвеликое
Case=Nom|Number=Sing|Variant=Shortвелики, великы, велик, великъ, велик[и], великьвеликавелико
Case=Nom|Number=Plurвеликие, великии, великиие, великия, великіе, велицыи
Case=Nom|Number=Plur|Variant=Shortвелики, велицивелики

PROPN

12204 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (12052; 99%).

PROPN tokens may have the following values of Gender:

Paradigm ВасильевъMascFem
Animacy=Anim|Case=Acc|NameType=Pat|Number=SingВасильева
Animacy=Anim|Case=Acc|NameType=Sur|Number=SingВасильева
Case=Acc|NameType=Pat|Number=PlurВасильевы
Case=Dat|NameType=Pat|Number=SingВасильевѣ
Case=Dat|NameType=Sur|Number=SingВасильеву
Case=Gen|NameType=Pat|Number=PlurВасильевых
Case=Gen|NameType=Sur|Number=SingВасил(ь)ева, Васильева
Case=Ins|NameType=Pat|Number=SingВасильевым
Case=Nom|NameType=Pat|Number=SingВасильев
Case=Nom|NameType=Sur|Number=SingВасильев, Васильевъ, Васильєвъ

Gender seems to be lexical feature of PROPN. 99% lemmas (3175) occur only with one value of Gender.

DET

8106 DET tokens (99% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Reflex=EMPTY (7024; 87%), Number=Sing (5314; 66%), Poss=EMPTY (5286; 65%).

DET tokens may have the following values of Gender:

Paradigm тотъMascFemNeut
Animacy=Anim|Case=Acc|Number=Singтого, тово, тог[о]того
Animacy=Anim|Case=Acc|Number=Plurтѣхъ, тех, тѣх, техъ
Case=Acc|Number=Singтот, тотъ, тои, тод, той, тыи, тѣиту, тое, тоѣ, тую, тою, тае, тоито, того, тое, тѡ
Case=Acc|Number=Plurтѣ, тете, тѣ, тата, тѣ, те
Case=Dat|Number=Singтому, томꙋтой, тои, тѡитому, темь, томꙋ
Case=Dat|Number=Plurтем, тѣмъ, тѣм, тѣмь, темъ, тымътем, тѣмъ, темъ, тымътѣмъ, тем, тѣм
Case=Gen|Number=Singтого, тово, тог[о], таво, того], тоетое, той, тои, тоя, то(й), тоей, тоѣтого, тово, тог[о], таво, таго, тѡгѡ
Case=Gen|Number=Plurтех, тѣхъ, тѣх, тихъ, тыхътех, тѣхъ, тѣхтех, тѣхъ, тѣх, техъ
Case=Ins|Number=Singтѣмъ, тем, тѣм, тѣмь, темъ, тимътою, тои, тойтем, тѣмъ, темъ, тѣм
Case=Ins|Number=Plurтѣми, теми, тымитѣми, теми, тѣнитеми, тѣми
Case=Loc|Number=Singтом, томътой, тои, тое, тоя, тоітом, томъ, то(м)
Case=Loc|Number=Plurтех, тѣхъ, тѣхтѣхъ, тех, тѣхтех, тѣх, тѣхъ
Case=Nom|Number=Singтот, тотъ, той, [тотъ, тыита, тая, те, т[а]то
Case=Nom|Number=Dualта
Case=Nom|Number=Plurте, тѣ, тии, тоете, тѣ, тата, те, тые

PRON

5388 PRON tokens (67% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (3878; 72%), PronType=Prs (3406; 63%), Person=3 (3405; 63%).

PRON tokens may have the following values of Gender:

Paradigm ониMascFemNeut
Animacy=Anim|Case=Acc|Number=Plurнихъ
Case=Acc|Number=Plurих, ихъ, нихъ, них, іхъ, iхъ, і[хъ]их, іхъіх
Case=Dat|Number=Singим
Case=Dat|Number=Plurим, имъ, ним, нимъним, нимъ, им, имъимъ
Case=Gen|Number=Plurих, ихъ, них, нихъ, нихь, іхъ, [и]хъ, [их], ихьих, них, ихъ, нихъ
Case=Ins|Number=Singим, нимъ
Case=Ins|Number=Plurними, ими, ни[ми], ним[и], німи, імиими
Case=Loc|Number=Plurних, нихъних, нихънихъ
Case=Nom|Number=Singани
Case=Nom|Number=Plurони, ани

VERB

5058 VERB tokens (34% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=EMPTY (5058; 100%), Mood=EMPTY (5052; 100%), Tense=Past (4749; 94%), Number=Sing (4470; 88%), Variant=EMPTY (3360; 66%), Voice=Act (3008; 59%), Case=EMPTY (2849; 56%), VerbForm=PartRes (2836; 56%), Aspect=Perf (2679; 53%).

VERB tokens may have the following values of Gender:

Paradigm велѣтиMascFemNeut
Case=Loc|Number=Plur|Tense=Pres|VerbForm=Part|Voice=Actвелящих
Case=Nom|Number=Sing|Tense=Past|Variant=Short|VerbForm=Part|Voice=Passвелѣно, велено, вѣленѡ
Number=Sing|Tense=Past|VerbForm=PartRes|Voice=Actвелел, велѣлъ, велѣл, велелъ, вѣлелвелела, велѣла

NUM

1512 NUM tokens (40% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (1322; 87%), NumForm=Word (969; 64%), Case=Nom (760; 50%).

NUM tokens may have the following values of Gender:

Paradigm дваMascFemNeut
Animacy=Anim|Case=Accдву, двухъ, двухдвух
Case=Accдва, дв[а]две, двѣ, [д]ведва, две
Case=Datдвум, двумъдвѣмъ
Case=Genдву, двою, двухъ, двꙋдву, дви, двою, двух, двухъдву
Case=Insдвема, двома, двумя, двѣмядвѣма, двема
Case=Locдву, двоюдву, двух, двухъдвою, дву, двухъ
Case=Nomдва, д[ва], дв[а]две, двѣ, два, дв[е], двидва

AUX

264 AUX tokens (18% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Person=EMPTY (264; 100%), Voice=Act (264; 100%), Analyt=EMPTY (256; 97%), Mood=EMPTY (256; 97%), Number=Sing (255; 97%), Tense=Past (252; 95%), VerbForm=PartRes (244; 92%).

AUX tokens may have the following values of Gender:

Paradigm бытиMascFemNeut
Analyt=Yes|Mood=Cnd|Number=Sing|Tense=Past|VerbForm=PartResбылъбыло
Analyt=Yes|Number=Sing|Tense=Past|VerbForm=PartResбыло
Animacy=Anim|Case=Acc|Number=Sing|Tense=Past|Variant=Short|VerbForm=Partбывша
Case=Acc|Number=Plur|Tense=Past|VerbForm=Partбывшіꙗ
Case=Acc|Number=Plur|Tense=Pres|VerbForm=Partсущаасущия
Case=Dat|Number=Sing|Tense=Past|Variant=Short|VerbForm=Partбывшу
Case=Dat|Number=Sing|Tense=Pres|Variant=Short|VerbForm=Partсущу, сушу, сꙋщꙋ
Case=Dat|Number=Sing|Tense=Pres|VerbForm=Partсущу
Case=Dat|Number=Plur|Tense=Past|Variant=Short|VerbForm=Partбывши
Case=Dat|Number=Plur|Tense=Pres|Variant=Short|VerbForm=Partсущемь
Case=Gen|Number=Plur|Tense=Past|VerbForm=Partбывших
Case=Gen|Number=Plur|Tense=Pres|VerbForm=Partсущих
Case=Nom|Number=Sing|Tense=Pres|Variant=Short|VerbForm=Partсущъ, сыи
Case=Nom|Number=Sing|Tense=Pres|VerbForm=Partсущий
Case=Nom|Number=Plur|Tense=Past|Variant=Short|VerbForm=Partбывше
Mood=Cnd|Number=Sing|Tense=Past|VerbForm=PartResбыло
Number=Sing|Tense=Past|VerbForm=PartResбылъ, былбылабыло, былѡ, была

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (10616; 98%), NOUN –[det]–> DET (6449; 99%), NOUN –[conj]–> NOUN (3310; 59%), PROPN –[flat:name]–> PROPN (3282; 100%), NOUN –[appos]–> PROPN (2831; 90%), NOUN –[appos]–> NOUN (1024; 80%), PROPN –[conj]–> PROPN (797; 87%), ADJ –[conj]–> ADJ (623; 92%), PROPN –[appos]–> NOUN (509; 92%), PROPN –[amod]–> ADJ (422; 99%).