home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Slovenian-SSJ: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

This is a layered feature with the following layers: Gender, Gender[psor].

121349 tokens (45%) have a non-empty value of Gender. 45177 types (93%) occur at least once with a non-empty value of Gender. 21537 lemmas (85%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (56865; 21% instances), ADJ (28426; 11% instances), VERB (11423; 4% instances), PROPN (10239; 4% instances), DET (7978; 3% instances), PRON (3964; 1% instances), AUX (1441; 1% instances), NUM (1013; 0% instances).

NOUN

56865 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (40401; 71%).

NOUN tokens may have the following values of Gender:

Paradigm deloMascNeut
Case=Acc|Number=Singdelo
Case=Acc|Number=Dualdeli
Case=Acc|Number=Plurdela
Case=Dat|Number=Singdelu
Case=Gen|Number=Singdeladela
Case=Gen|Number=Plurdel
Case=Ins|Number=Singdelom
Case=Ins|Number=Plurdeli
Case=Loc|Number=Singdelu
Case=Loc|Number=Plurdelih
Case=Nom|Number=Singdelo
Case=Nom|Number=Plurdela

Gender seems to be lexical feature of NOUN. 100% lemmas (8840) occur only with one value of Gender.

ADJ

28426 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (25970; 91%), VerbForm=EMPTY (24781; 87%), Definite=EMPTY (24330; 86%), Number=Sing (19341; 68%).

ADJ tokens may have the following values of Gender:

Paradigm drugMascFemNeut
Case=Acc|Definite=Def|Number=Singdrugi
Case=Acc|Definite=Ind|Number=Singdrug
Case=Acc|Number=Singdrugegadrugodrugo
Case=Acc|Number=Plurdrugedrugedruga
Case=Dat|Number=Singdrugemudrugi
Case=Dat|Number=Plurdrugim
Case=Gen|Number=Singdrugegadrugedrugega
Case=Gen|Number=Plurdrugihdrugihdrugih
Case=Ins|Number=Singdrugimdrugodrugim
Case=Ins|Number=Plurdrugimidrugimidrugimi
Case=Loc|Number=Singdrugemdrugidrugem
Case=Loc|Number=Dualdrugihdrugih
Case=Loc|Number=Plurdrugihdrugihdrugih
Case=Nom|Definite=Def|Number=Singdrugi
Case=Nom|Definite=Ind|Number=Singdrug
Case=Nom|Number=Singdrugadrugo
Case=Nom|Number=Dualdrugi
Case=Nom|Number=Plurdrugidrugedruga

VERB

11423 VERB tokens (46% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (11423; 100%), Person=EMPTY (11423; 100%), Tense=EMPTY (11423; 100%), VerbForm=Part (11423; 100%), Number=Sing (7536; 66%), Aspect=Perf (6957; 61%).

VERB tokens may have the following values of Gender:

Paradigm bitiMascFemNeut
Number=Singbilbilabilo, blo
Number=Dualbila, blabilibili
Number=Plurbilibile

PROPN

10239 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (9702; 95%), Case=Nom (5770; 56%).

PROPN tokens may have the following values of Gender:

Paradigm EUMascFem
Case=AccEU
Case=GenEU
Case=LocEU
Case=NomEUEU

Gender seems to be lexical feature of PROPN. 98% lemmas (4962) occur only with one value of Gender.

DET

7978 DET tokens (85% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number[psor]=EMPTY (6502; 81%), Person=EMPTY (6502; 81%), Number=Sing (5692; 71%), Poss=EMPTY (5684; 71%).

DET tokens may have the following values of Gender:

Paradigm taMascFemNeut
Case=Acc|Number=Singta, tegatoto
Case=Acc|Number=Dualti
Case=Acc|Number=Plurteteta
Case=Dat|Number=Singtemutejtemu
Case=Dat|Number=Plurtemtemtem
Case=Gen|Number=Singtegatetega
Case=Gen|Number=Dualteh
Case=Gen|Number=Plurtehtehteh
Case=Ins|Number=Singtemtotem
Case=Ins|Number=Dualtema
Case=Ins|Number=Plurtemitemitemi
Case=Loc|Number=Singtemtejtem
Case=Loc|Number=Dualteh
Case=Loc|Number=Plurtehtehteh
Case=Nom|Number=Singtatato
Case=Nom|Number=Dualtati
Case=Nom|Number=Plurtiteta

PRON

3964 PRON tokens (44% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (3964; 100%), Number=Sing (2985; 75%), PronType=Prs (2822; 71%), Person=3 (2772; 70%), Variant=Short (2162; 55%), Case=Acc (1988; 50%).

PRON tokens may have the following values of Gender:

Paradigm onMascFemNeut
Case=Acc|Number=Singnjeganjo
Case=Acc|Number=Sing|Variant=Shortgajoga
Case=Acc|Number=Dualnjiju, onadvanjiju
Case=Acc|Number=Dual|Variant=Shortju, jihjuju
Case=Acc|Number=Plurnjih, nje
Case=Acc|Number=Plur|Variant=Shortjihjihjih
Case=Dat|Number=Singnjemunjej
Case=Dat|Number=Sing|Variant=Shortmujimu
Case=Dat|Number=Dualnjima
Case=Dat|Number=Dual|Variant=Shortjimajima
Case=Dat|Number=Plurnjimnjim
Case=Dat|Number=Plur|Variant=Shortjimjimjim
Case=Gen|Number=Singnjeganjenjega
Case=Gen|Number=Sing|Variant=Shortgajega
Case=Gen|Number=Dualnjiju
Case=Gen|Number=Dual|Variant=Shortju
Case=Gen|Number=Plurnjihnjihnjih
Case=Gen|Number=Plur|Variant=Shortjihjihjih
Case=Ins|Number=Singnjimnjonjim
Case=Ins|Number=Dualnjimanjima
Case=Ins|Number=Plurnjiminjiminjimi
Case=Loc|Number=Singnjemnjejnjem
Case=Loc|Number=Dualnjijunjima
Case=Loc|Number=Plurnjihnjihnjih
Case=Nom|Number=Singonona
Case=Nom|Number=Dualonadva
Case=Nom|Number=Pluroni

AUX

1441 AUX tokens (8% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (1441; 100%), Person=EMPTY (1441; 100%), Polarity=EMPTY (1441; 100%), Tense=EMPTY (1441; 100%), VerbForm=Part (1441; 100%), Number=Sing (1129; 78%).

AUX tokens may have the following values of Gender:

Paradigm bitiMascFemNeut
Number=Singbilbila, blabilo, blo
Number=Dualbilabilibili
Number=Plurbili, blibilebila

NUM

1013 NUM tokens (18% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (1013; 100%), NumType=Card (1007; 99%).

NUM tokens may have the following values of Gender:

Paradigm enMascFemNeut
Case=Acc|Number=Singen, enega, Engaenoeno
Case=Dat|Number=Singenemueni
Case=Gen|Number=Singenega, engaeneenega
Case=Gen|Number=Plurenih
Case=Ins|Number=Singenimenoenim
Case=Loc|Number=Singenemenienem
Case=Loc|Number=Plurenih
Case=Nom|Number=Singenenaeno
Case=Nom|Number=Plureni

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (21257; 99%), NOUN –[det]–> DET (4982; 88%), ADJ –[nsubj]–> NOUN (1636; 98%), NOUN –[nmod]–> PROPN (1633; 54%), PROPN –[flat:name]–> PROPN (1539; 99%), ADJ –[conj]–> ADJ (1216; 93%), VERB –[nsubj]–> PROPN (1065; 73%), VERB –[conj]–> VERB (1004; 69%), PROPN –[conj]–> PROPN (625; 77%), PROPN –[amod]–> ADJ (454; 99%).