Treebank Statistics: UD_Czech-Poetry: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem
, Masc
, Neut
.
This is a layered feature with the following layers: Gender, Gender[psor].
2746 tokens (44%) have a non-empty value of Gender
.
2009 types (75%) occur at least once with a non-empty value of Gender
.
1380 lemmas (72%) occur at least once with a non-empty value of Gender
.
The feature is used with 8 part-of-speech tags: NOUN (1466; 23% instances), ADJ (595; 9% instances), VERB (240; 4% instances), DET (223; 4% instances), PRON (110; 2% instances), PROPN (84; 1% instances), AUX (18; 0% instances), NUM (10; 0% instances).
NOUN
1466 NOUN tokens (100% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (1047; 71%), Animacy=EMPTY (830; 57%).
NOUN
tokens may have the following values of Gender
:
Fem
(598; 41% of non-emptyGender
): duše, duši, chvíli, země, duší, lásky, ruka, tvář, vůní, zářMasc
(636; 43% of non-emptyGender
): den, svět, boha, bože, bůh, květ, člověk, bohy, oheň, senNeut
(232; 16% of non-emptyGender
): žití, oči, srdce, štěstí, nebes, těla, dítě, jaro, moře, očima
Paradigm oko | Fem | Neut |
---|---|---|
Case=Acc|Number=Plur | oči | |
Case=Gen|Number=Plur | očí | očí |
Case=Ins|Number=Sing | okem | |
Case=Ins|Number=Dual | očima | |
Case=Loc|Number=Sing | oku | |
Case=Nom|Number=Sing | oko | |
Case=Nom|Number=Plur | oči |
Gender
seems to be lexical feature of NOUN
. 99% lemmas (727) occur only with one value of Gender
.
ADJ
595 ADJ tokens (100% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Polarity=Pos (572; 96%), Degree=Pos (560; 94%), Aspect=EMPTY (534; 90%), Voice=EMPTY (494; 83%), VerbForm=EMPTY (493; 83%), Number=Sing (425; 71%), Animacy=EMPTY (365; 61%).
ADJ
tokens may have the following values of Gender
:
Fem
(260; 44% of non-emptyGender
): bílé, jiné, nové, tmavou, věčné, černá, Vyloupena, Zsinalá, bledá, kalnáMasc
(248; 42% of non-emptyGender
): celý, plný, bílý, kamenném, tvrdém, věrni, Mnohý, Pozdní, divoké, jiníNeut
(87; 15% of non-emptyGender
): nesmírném, pozlátkové, smilnící, tichém, umdlená, věčné, Astartiných, Lepší, Oněmlé, bledéEMPTY
(2): aj, marně
Paradigm tichý | Masc | Fem | Neut |
---|---|---|---|
Animacy=Anim|Case=Gen|Number=Plur | tichých | ||
Animacy=Inan|Case=Acc|Number=Sing | tichý | ||
Animacy=Inan|Case=Nom|Number=Sing | tichý | ||
Case=Acc|Number=Sing | tichou | ||
Case=Ins|Number=Sing | tichou | ||
Case=Loc|Number=Sing | tiché | tichém | |
Case=Nom|Number=Sing | tiché | ||
Case=Nom|Number=Plur | tiché |
VERB
240 VERB tokens (32% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (239; 100%), Person=EMPTY (239; 100%), Voice=Act (236; 98%), Tense=Past (233; 97%), VerbForm=Part (232; 97%), Polarity=Pos (226; 94%), Number=Sing (191; 80%), Aspect=Imp (129; 54%).
VERB
tokens may have the following values of Gender
:
Fem
(62; 26% of non-emptyGender
): Rozhučela, povylétla, rozlívala, vzrostla, Krákorala, Spadaly, Zavedla, Zřídila, bila, blysklaMasc
(168; 70% of non-emptyGender
): měl, chtěl, poznali, viděl, ctil, šel, cítil, dal, klečel, kmitalNeut
(10; 4% of non-emptyGender
): nezaneslo, nezřely, přešlo, přihodilo, sila, svitlo, vyschla, zklamalo, zrálo, zůstalyEMPTY
(503): jdou, letí, chce, jde, zdá, chcem, hledá, mám, stojí, vím
Paradigm zůstat | Masc | Fem | Neut |
---|---|---|---|
Animacy=Anim | zůstali | ||
zůstaly | zůstaly |
Gender
seems to be lexical feature of VERB
. 93% lemmas (165) occur only with one value of Gender
.
DET
223 DET tokens (77% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number=Sing (177; 79%), Number[psor]=EMPTY (177; 79%), Person=EMPTY (176; 79%), Animacy=EMPTY (172; 77%), Reflex=EMPTY (172; 77%), Poss=EMPTY (128; 57%).
DET
tokens may have the following values of Gender
:
Fem
(76; 34% of non-emptyGender
): tvé, své, té, svou, ta, moje, která, mou, naší, tvouMasc
(78; 35% of non-emptyGender
): náš, ten, ti, každý, svůj, sám, které, všechny, žádné, jejíNeut
(69; 31% of non-emptyGender
): to, tom, ty, své, vše, vším, svá, svého, tvé, tímEMPTY
(66): jeho, jejich, svých, jich, tolika, těm, Ti, svoje, tolik, tvých
Paradigm ten | Masc | Fem | Neut |
---|---|---|---|
Animacy=Anim|Case=Dat|Number=Plur | těm | ||
Animacy=Anim|Case=Gen|Number=Sing | toho | ||
Animacy=Anim|Case=Nom|Number=Plur | ti | ||
Animacy=Inan|Case=Acc|Number=Sing | ten, sěn | ||
Case=Acc|Number=Sing | tu | to | |
Case=Acc|Number=Plur | ty | ta | |
Case=Dat|Number=Sing | té | tomu | |
Case=Gen|Number=Sing | té | ||
Case=Ins|Number=Sing | tím | ||
Case=Loc|Number=Sing | té | tom | |
Case=Nom|Number=Sing | ten | ta | to |
Case=Nom|Number=Plur | Ty | ty |
PRON
110 PRON tokens (29% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Reflex=EMPTY (110; 100%), Variant=EMPTY (100; 91%), Number=Sing (86; 78%), Animacy=EMPTY (63; 57%), Person=EMPTY (56; 51%).
PRON
tokens may have the following values of Gender
:
Fem
(37; 34% of non-emptyGender
): jež, jí, ji, níž, ni, Vlasti, je, již, nich, níMasc
(66; 60% of non-emptyGender
): jenž, mu, jež, ho, jej, kdo, ním, Němu, jemu, nikdoNeut
(7; 6% of non-emptyGender
): jež, všecko, ním, něco, všeckaEMPTY
(271): se, co, mi, si, já, ty, tě, tobě, nás, ti
Paradigm on | Masc | Fem | Neut |
---|---|---|---|
Animacy=Anim|Case=Acc|Number=Sing|Person=3|PrepCase=Npr | jej | ||
Animacy=Anim|Case=Acc|Number=Sing|Person=3|Variant=Short | ho | ||
Animacy=Anim|Case=Acc|Number=Sing|PrepCase=Pre | něj | ||
Animacy=Anim|Case=Dat|Number=Sing|Person=3|PrepCase=Npr | jemu, mu | ||
Animacy=Anim|Case=Dat|Number=Sing|Person=3|PrepCase=Pre | Němu | ||
Animacy=Anim|Case=Dat|Number=Sing|Person=3|Variant=Short | mu | ||
Animacy=Anim|Case=Ins|Number=Sing|Person=3|PrepCase=Pre | ním | ||
Animacy=Anim|Case=Nom|Number=Sing|Person=3 | On | ||
Animacy=Inan|Case=Gen|Number=Sing|Person=3|PrepCase=Pre | něho | ||
Animacy=Inan|Case=Loc|Number=Sing|Person=3|PrepCase=Pre | něm | ||
Case=Acc|Number=Sing|Person=3|PrepCase=Npr | jej | ji | |
Case=Acc|Number=Sing|Person=3|PrepCase=Pre | něj | ni | |
Case=Acc|Number=Sing|Person=3|Variant=Short | ho | ||
Case=Acc|Number=Plur|Person=3|PrepCase=Npr | je | ||
Case=Dat|Number=Sing|Person=3|PrepCase=Npr | jí | ||
Case=Dat|Number=Sing|Person=3|PrepCase=Pre | Němu | ||
Case=Dat|Number=Sing|Person=3|Variant=Short | mu | ||
Case=Gen|Number=Plur|Person=3|PrepCase=Pre | nich | ||
Case=Ins|Number=Sing|Person=3|PrepCase=Npr | Jí | ||
Case=Ins|Number=Sing|Person=3|PrepCase=Pre | ním | ní | ním |
Case=Nom|Number=Sing|Person=3 | on |
PROPN
84 PROPN tokens (100% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Number=Sing (73; 87%), Animacy=Anim (44; 52%).
PROPN
tokens may have the following values of Gender
:
Fem
(27; 32% of non-emptyGender
): Magdaleno, Maria, Svitava, Vltavy, Evy, Francii, Golgatu, Lůnou, Madonny, PannoMasc
(55; 65% of non-emptyGender
): Armand, Sion, Angelico, Armandovi, Azték, Bajušáku, Baudelaira, Chodováci, Diderota, DudákoviNeut
(2; 2% of non-emptyGender
): Labe
Gender
seems to be lexical feature of PROPN
. 100% lemmas (69) occur only with one value of Gender
.
AUX
18 AUX tokens (13% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Aspect=Imp (18; 100%), Mood=EMPTY (18; 100%), Person=EMPTY (18; 100%), Tense=Past (18; 100%), VerbForm=Part (18; 100%), Voice=Act (18; 100%), Polarity=Pos (17; 94%), Number=Sing (15; 83%).
AUX
tokens may have the following values of Gender
:
Fem
(4; 22% of non-emptyGender
): Nebyla, byla, byly, bývalyMasc
(9; 50% of non-emptyGender
): byl, byli, jsiNeut
(5; 28% of non-emptyGender
): byloEMPTY
(118): je, by, jsem, jste, jest, jsi, jsou, budeš, bude, bych
Paradigm být | Masc | Fem | Neut |
---|---|---|---|
Animacy=Anim|Number=Sing|Polarity=Pos | byl | ||
Animacy=Anim|Number=Plur|Polarity=Pos | byli | ||
Number=Sing|Polarity=Neg | Nebyla | ||
Number=Sing|Polarity=Pos | byl, jsi | byla | bylo |
Number=Plur|Polarity=Pos | byly |
NUM
10 NUM tokens (56% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumForm=Word (10; 100%), NumType=Card (10; 100%), Number=Sing (9; 90%), Case=Nom (6; 60%).
NUM
tokens may have the following values of Gender
:
Fem
(3; 30% of non-emptyGender
): jedna, jednou, jednéMasc
(6; 60% of non-emptyGender
): jeden, dvaNeut
(1; 10% of non-emptyGender
): jednomEMPTY
(8): dvé, 80, Deset, dvacet, obou, tisícem, šesti
Paradigm jeden | Masc | Fem | Neut |
---|---|---|---|
Animacy=Anim|Case=Nom | jeden | ||
Case=Gen | jedné | ||
Case=Ins | jednou | ||
Case=Loc | jednom | ||
Case=Nom | jeden | jedna |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[amod]–> ADJ (409; 96%),
NOUN –[det]–> DET (142; 74%),
ADJ –[conj]–> ADJ (41; 95%),
VERB –[conj]–> VERB (34; 59%),
VERB –[nsubj]–> PROPN (14; 74%),
ADJ –[nsubj]–> NOUN (9; 100%),
PROPN –[amod]–> ADJ (9; 100%),
PROPN –[flat]–> PROPN (9; 90%),
ADJ –[nsubj:pass]–> NOUN (8; 100%),
NOUN –[dep]–> ADJ (8; 100%).