home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-CLTT: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut. Some words have combined values of the feature; 3 combinations have been observed: Fem|Masc, Fem|Neut, Masc|Neut.

This is a layered feature with the following layers: Gender, Gender[psor].

18851 tokens (52%) have a non-empty value of Gender. 3556 types (78%) occur at least once with a non-empty value of Gender. 1592 lemmas (59%) occur at least once with a non-empty value of Gender. The feature is used with 7 part-of-speech tags: NOUN (11062; 31% instances), ADJ (6637; 18% instances), DET (843; 2% instances), VERB (114; 0% instances), PRON (90; 0% instances), AUX (59; 0% instances), NUM (46; 0% instances).

NOUN

11062 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (7820; 71%), Animacy=EMPTY (6603; 60%).

NOUN tokens may have the following values of Gender:

Paradigm rokMascNeut
Animacy=Inan|Case=Acc|Number=Singrok
Animacy=Inan|Case=Gen|Number=Singroku
Animacy=Inan|Case=Ins|Number=Singrokem
Animacy=Inan|Case=Loc|Number=Singroce
Animacy=Inan|Case=Nom|Number=Singrok
Case=Gen|Number=Plurlet
Case=Loc|Number=Plurletech

Gender seems to be lexical feature of NOUN. 99% lemmas (848) occur only with one value of Gender.

ADJ

6637 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Polarity=Pos (6462; 97%), Degree=Pos (6247; 94%), Number=Sing (4127; 62%), Animacy=EMPTY (3966; 60%).

ADJ tokens may have the following values of Gender:

Paradigm uvedenýFem,MascFem,NeutMascFemNeut
Animacy=Anim|Case=Acc|Number=Plur|Polarity=Negneuvedené
Animacy=Inan|Case=Acc|Number=Sing|Polarity=Posuvedený
Animacy=Inan|Case=Acc|Number=Plur|Polarity=Posuvedené
Animacy=Inan|Case=Dat|Number=Sing|Polarity=Posuvedenému
Animacy=Inan|Case=Gen|Number=Sing|Polarity=Posuvedeného
Animacy=Inan|Case=Gen|Number=Plur|Polarity=Posuvedených
Animacy=Inan|Case=Ins|Number=Sing|Polarity=Posuvedeným
Animacy=Inan|Case=Loc|Number=Sing|Polarity=Posuvedeném
Animacy=Inan|Case=Loc|Number=Plur|Polarity=Posuvedených
Animacy=Inan|Case=Nom|Number=Sing|Polarity=Posuvedený
Animacy=Inan|Case=Nom|Number=Plur|Polarity=Posuvedené
Animacy=Inan|Number=Plur|Polarity=Pos|Variant=Short|VerbForm=Part|Voice=Passuvedeny
Case=Acc|Number=Sing|Polarity=Posuvedené
Case=Acc|Number=Plur|Polarity=Posuvedenéuvedená
Case=Dat|Number=Sing|Polarity=Posuvedené
Case=Dat|Number=Plur|Polarity=Posuvedeným
Case=Gen|Number=Sing|Polarity=Negneuvedené
Case=Gen|Number=Sing|Polarity=Posuvedené
Case=Gen|Number=Plur|Polarity=Posuvedenýchuvedených
Case=Ins|Number=Sing|Polarity=Posuvedenou
Case=Ins|Number=Plur|Polarity=Posuvedenými
Case=Loc|Number=Sing|Polarity=Posuvedené
Case=Loc|Number=Plur|Polarity=Posuvedených
Case=Nom|Number=Sing|Polarity=Negneuvedená
Case=Nom|Number=Sing|Polarity=Posuvedená
Case=Nom|Number=Plur|Polarity=Posuvedenéuvedená
Number=Sing|Polarity=Pos|Variant=Short|VerbForm=Part|Voice=Passuvedenuvedeno
Number=Plur,Sing|Polarity=Pos|Variant=Short|VerbForm=Part|Voice=Passuvedena

DET

843 DET tokens (74% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number[psor]=EMPTY (813; 96%), Person=EMPTY (813; 96%), Poss=EMPTY (792; 94%), Number=Sing (595; 71%).

DET tokens may have the following values of Gender:

Paradigm kterýMascMasc,NeutFemNeut
Animacy=Inan|Case=Acc|Number=Singkterý
Animacy=Inan|Case=Nom|Number=Plurkteré
Case=Acc|Number=Singkteroukteré
Case=Acc|Number=Plurkterékteré
Case=Dat|Number=Singkterémukteré
Case=Gen|Number=Singkteréhokteré
Case=Ins|Number=Singkterýmkterou
Case=Loc|Number=Singkterémkteré
Case=Nom|Number=Singkterýkterákteré
Case=Nom|Number=Plurkterékterá

VERB

114 VERB tokens (6% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (114; 100%), Person=EMPTY (114; 100%), Voice=Act (114; 100%), Tense=Past (113; 99%), VerbForm=Part (113; 99%), Polarity=Pos (105; 92%).

VERB tokens may have the following values of Gender:

Paradigm mociFem,NeutMascNeut
Number=Singmohlmohlo
Number=Plur,Singmohla

PRON

90 PRON tokens (14% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (90; 100%), Variant=EMPTY (89; 99%), Number=Sing (85; 94%), Person=EMPTY (54; 60%), PronType=Rel (46; 51%).

PRON tokens may have the following values of Gender:

Paradigm veškerýMascMasc,NeutFemNeut
Animacy=Inan|Case=Nom|Number=Plurveškeré
Case=Acc|Number=Singveškeré
Case=Acc|Number=Plurveškeréveškeré
Case=Gen|Number=Singveškerého
Case=Nom|Number=Plurveškeré

AUX

59 AUX tokens (10% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Aspect=Imp (59; 100%), Mood=EMPTY (59; 100%), Person=EMPTY (59; 100%), Tense=Past (59; 100%), VerbForm=Part (59; 100%), Voice=Act (59; 100%), Polarity=Pos (47; 80%).

AUX tokens may have the following values of Gender:

Paradigm býtFem,MascFem,NeutMascNeut
Animacy=Inan|Number=Plur|Polarity=Negnebyly
Animacy=Inan|Number=Plur|Polarity=Posbyly
Number=Sing|Polarity=Negnebyl
Number=Sing|Polarity=Posbylbylo
Number=Plur,Sing|Polarity=Negnebyla
Number=Plur,Sing|Polarity=Posbyla

NUM

46 NUM tokens (11% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (46; 100%), NumType=Card (46; 100%), Number=Sing (38; 83%).

NUM tokens may have the following values of Gender:

Paradigm jedenMascMasc,NeutFemNeut
Animacy=Inan|Case=Accjeden
Case=Accjednujedno
Case=Datjednomu
Case=Genjednohojedné
Case=Insjednímjednou
Case=Locjednom
Case=Nomjeden

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (5825; 97%), NOUN –[conj]–> NOUN (983; 55%), ADJ –[conj]–> ADJ (183; 83%), NOUN –[appos]–> NOUN (45; 70%), NOUN –[xcomp]–> ADJ (15; 94%), DET –[nmod]–> NOUN (9; 82%), ADJ –[dep]–> NOUN (6; 75%), ADJ –[amod]–> ADJ (4; 80%), ADJ –[appos]–> ADJ (4; 100%), ADJ –[conj]–> NOUN (4; 57%).