Treebank Statistics: UD_Slovak-SNK: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem
, Masc
, Neut
.
This is a layered feature with the following layers: Gender, Gender[psor].
52554 tokens (50%) have a non-empty value of Gender
.
23376 types (90%) occur at least once with a non-empty value of Gender
.
11904 lemmas (84%) occur at least once with a non-empty value of Gender
.
The feature is used with 9 part-of-speech tags: NOUN (21661; 20% instances), ADJ (9466; 9% instances), VERB (9002; 8% instances), PROPN (4569; 4% instances), DET (4391; 4% instances), PRON (2081; 2% instances), AUX (730; 1% instances), NUM (620; 1% instances), ADV (34; 0% instances).
NOUN
21661 NOUN tokens (100% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (16426; 76%), Animacy=EMPTY (12499; 58%).
NOUN
tokens may have the following values of Gender
:
Fem
(8725; 40% of non-emptyGender
): vláda, chvíľu, mama, tvár, oblasti, vlády, chvíli, ruku, časť, izbyMasc
(9162; 42% of non-emptyGender
): roku, deň, rokov, ľudí, život, života, rokoch, čas, človek, pocitNeut
(3774; 17% of non-emptyGender
): oči, storočia, meno, deti, miesto, obdobie, mesta, slová, svetlo, slovo
Paradigm zákon | Masc | Neut |
---|---|---|
Animacy=Inan|Case=Acc|Number=Sing | zákon | |
Animacy=Inan|Case=Acc|Number=Plur | zákony | |
Animacy=Inan|Case=Gen|Number=Sing | zákona | |
Animacy=Inan|Case=Gen|Number=Plur | zákonov | |
Animacy=Inan|Case=Ins|Number=Sing | zákonom | |
Animacy=Inan|Case=Ins|Number=Plur | zákonmi | |
Animacy=Inan|Case=Loc|Number=Plur | zákonoch | |
Animacy=Inan|Case=Nom|Number=Sing | zákon | |
Animacy=Inan|Case=Nom|Number=Plur | zákony | |
Case=Gen|Number=Sing | zákona |
Gender
seems to be lexical feature of NOUN
. 100% lemmas (5090) occur only with one value of Gender
.
ADJ
9466 ADJ tokens (100% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Degree=Pos (8737; 92%), Polarity=EMPTY (8276; 87%), VerbForm=EMPTY (8276; 87%), Voice=EMPTY (8276; 87%), Number=Sing (6932; 73%), Animacy=EMPTY (5500; 58%).
ADJ
tokens may have the following values of Gender
:
Fem
(3843; 41% of non-emptyGender
): druhej, veľkej, prvá, slovenskej, verejných, štátnej, Makovej, európskej, prvej, veľkúMasc
(3966; 42% of non-emptyGender
): celý, prvý, druhý, veľký, ďalší, nový, veľkého, jediný, prvým, novéhoNeut
(1657; 18% of non-emptyGender
): veľké, celé, ľudské, možné, ďalšie, známe, jasné, nové, prvé, maléEMPTY
(8): VYSYPANÉ, požadovaných, rôznych, tradovaných, väčšími, zachytených, zamračení, šetriacich
Paradigm veľký | Masc | Fem | Neut |
---|---|---|---|
Animacy=Anim|Case=Acc|Degree=Pos|Number=Sing | veľkého | ||
Animacy=Anim|Case=Acc|Degree=Sup|Number=Sing | najväčšieho | ||
Animacy=Anim|Case=Gen|Degree=Pos|Number=Sing | Veľkého | ||
Animacy=Anim|Case=Ins|Degree=Pos|Number=Sing | veľkým | ||
Animacy=Anim|Case=Ins|Degree=Pos|Number=Plur | veľkými | ||
Animacy=Anim|Case=Ins|Degree=Sup|Number=Sing | najväčším | ||
Animacy=Anim|Case=Nom|Degree=Pos|Number=Sing | veľký | ||
Animacy=Anim|Case=Nom|Degree=Cmp|Number=Sing | väčší | ||
Animacy=Inan|Case=Acc|Degree=Pos|Number=Sing | veľký | ||
Animacy=Inan|Case=Acc|Degree=Cmp|Number=Sing | väčší | ||
Animacy=Inan|Case=Acc|Degree=Cmp|Number=Plur | väčšie | ||
Animacy=Inan|Case=Dat|Degree=Pos|Number=Plur | veľkým | ||
Animacy=Inan|Case=Gen|Degree=Pos|Number=Sing | veľkého | ||
Animacy=Inan|Case=Gen|Degree=Pos|Number=Plur | veľkých | ||
Animacy=Inan|Case=Gen|Degree=Cmp|Number=Sing | väčšieho | ||
Animacy=Inan|Case=Gen|Degree=Cmp|Number=Plur | väčších | ||
Animacy=Inan|Case=Ins|Degree=Pos|Number=Sing | veľkým | ||
Animacy=Inan|Case=Ins|Degree=Pos|Number=Plur | veľkými | ||
Animacy=Inan|Case=Ins|Degree=Cmp|Number=Sing | väčším | ||
Animacy=Inan|Case=Ins|Degree=Sup|Number=Sing | najväčším | ||
Animacy=Inan|Case=Loc|Degree=Pos|Number=Sing | veľkom | ||
Animacy=Inan|Case=Nom|Degree=Pos|Number=Sing | veľký | ||
Animacy=Inan|Case=Nom|Degree=Pos|Number=Plur | veľké | ||
Animacy=Inan|Case=Nom|Degree=Cmp|Number=Sing | väčší | ||
Animacy=Inan|Case=Nom|Degree=Sup|Number=Sing | najväčší | ||
Case=Acc|Degree=Pos|Number=Sing | veľkú | veľké | |
Case=Acc|Degree=Pos|Number=Plur | veľké | veľké | |
Case=Acc|Degree=Cmp|Number=Sing | väčšiu | ||
Case=Acc|Degree=Sup|Number=Sing | najväčšiu | najväčšie | |
Case=Dat|Degree=Pos|Number=Sing | veľkej | ||
Case=Gen|Degree=Pos|Number=Sing | veľkej | veľkého | |
Case=Gen|Degree=Pos|Number=Plur | veľkých | ||
Case=Gen|Degree=Sup|Number=Plur | najväčších | ||
Case=Ins|Degree=Pos|Number=Sing | veľkou | veľkým | |
Case=Ins|Degree=Pos|Number=Plur | veľkými | veľkými | |
Case=Ins|Degree=Sup|Number=Sing | najväčšou | najväčším | |
Case=Ins|Degree=Sup|Number=Plur | Najväčšími | ||
Case=Loc|Degree=Pos|Number=Sing | veľkej | veľkom | |
Case=Loc|Degree=Cmp|Number=Sing | väčšom | ||
Case=Nom|Degree=Pos|Number=Sing | veľká | veľké | |
Case=Nom|Degree=Pos|Number=Plur | veľké | ||
Case=Nom|Degree=Cmp|Number=Sing | väčšia | ||
Case=Nom|Degree=Cmp|Number=Plur | Väčšie | ||
Case=Nom|Degree=Sup|Number=Sing | najväčšia | Najväčšie |
VERB
9002 VERB tokens (64% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (9001; 100%), Person=EMPTY (9001; 100%), Tense=Past (9000; 100%), VerbForm=Part (9000; 100%), Polarity=Pos (8312; 92%), Number=Sing (7792; 87%), Aspect=Perf (5438; 60%).
VERB
tokens may have the following values of Gender
:
Fem
(3007; 33% of non-emptyGender
): povedala, mala, bola, odvetila, zvolala, spýtala, chcela, začala, nemala, pozrelaMasc
(5112; 57% of non-emptyGender
): mal, povedal, bol, odvetil, spýtal, začal, stal, chcel, prišiel, vedelNeut
(883; 10% of non-emptyGender
): bolo, stalo, malo, podarilo, došlo, nestalo, zdalo, trvalo, išlo, napadloEMPTY
(5066): je, má, ide, môže, mám, majú, musí, musím, sú, chcem
Paradigm mať | Masc | Fem | Neut |
---|---|---|---|
Animacy=Anim|Number=Sing|Polarity=Neg | nemal | ||
Animacy=Anim|Number=Sing|Polarity=Pos | mal | ||
Animacy=Anim|Number=Plur|Polarity=Neg | nemali | ||
Animacy=Anim|Number=Plur|Polarity=Pos | mali | ||
Animacy=Inan|Number=Sing|Polarity=Neg | nemal | ||
Animacy=Inan|Number=Sing|Polarity=Pos | mal | ||
Animacy=Inan|Number=Plur|Polarity=Pos | mali | ||
Number=Sing|Polarity=Neg | nemala | nemalo | |
Number=Sing|Polarity=Pos | mala | malo | |
Number=Plur|Polarity=Neg | nemali | ||
Number=Plur|Polarity=Pos | mali | mali |
PROPN
4569 PROPN tokens (95% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Number=Sing (4406; 96%), Case=Nom (2783; 61%), Animacy=Anim (2570; 56%).
PROPN
tokens may have the following values of Gender
:
Fem
(1283; 28% of non-emptyGender
): Maja, Lori, Jazmína, Blythe, Amy, Marga, Irma, Maju, Makulienka, DelinaMasc
(2995; 66% of non-emptyGender
): Chris, Winston, Aladin, Mauglí, Vilko, Herkules, Abu, Bush, Baghíra, FerdoNeut
(291; 6% of non-emptyGender
): Uhorska, Slovensku, Nemecka, Nemecku, Nemecko, Slovenska, Slovensko, Slnka, Slnko, TalianskaEMPTY
(261): J, SR, USA, EÚ, P, A, N, V, B, C
Paradigm maja | Masc | Fem |
---|---|---|
Animacy=Anim|Case=Dat | Maji | |
Case=Acc | Maju | |
Case=Dat | Maji | |
Case=Gen | Maji | |
Case=Ins | Majou | |
Case=Loc | Maji | |
Case=Nom | Maja |
Gender
seems to be lexical feature of PROPN
. 100% lemmas (1682) occur only with one value of Gender
.
DET
4391 DET tokens (100% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Gender[psor]=EMPTY (3913; 89%), Number[psor]=EMPTY (3464; 79%), Person=EMPTY (3464; 79%), Number=Sing (3447; 79%), Poss=EMPTY (3138; 71%), Animacy=EMPTY (2870; 65%).
DET
tokens may have the following values of Gender
:
Fem
(1259; 29% of non-emptyGender
): jeho, ktorá, jej, ktoré, tejto, svojej, tá, táto, ich, tejMasc
(1521; 35% of non-emptyGender
): ktorý, jeho, ten, jej, môj, tento, každý, ktoré, všetci, ktoríNeut
(1611; 37% of non-emptyGender
): to, jeho, toho, všetko, ktoré, tom, tomu, toto, jej, svojeEMPTY
(13): ktoré, jeho, ta, Vaše, ich, toľko
Paradigm ktorý | Masc | Fem | Neut |
---|---|---|---|
Animacy=Anim|Case=Acc|Number=Sing | ktorého | ||
Animacy=Anim|Case=Acc|Number=Plur | ktorých | ||
Animacy=Anim|Case=Dat|Number=Sing | ktorému, ktorým | ||
Animacy=Anim|Case=Dat|Number=Plur | ktorým | ||
Animacy=Anim|Case=Gen|Number=Sing | ktorého | ||
Animacy=Anim|Case=Gen|Number=Plur | ktorých | ||
Animacy=Anim|Case=Ins|Number=Sing | ktorým | ||
Animacy=Anim|Case=Ins|Number=Plur | ktorými | ||
Animacy=Anim|Case=Nom|Number=Sing | ktorý | ||
Animacy=Anim|Case=Nom|Number=Plur | ktorí | ||
Animacy=Anim|Case=Nom|Number=Plur|Typo=Yes | ktorý | ||
Animacy=Inan|Case=Acc|Number=Sing | ktorý | ||
Animacy=Inan|Case=Acc|Number=Plur | ktoré | ||
Animacy=Inan|Case=Dat|Number=Sing | ktorému, ktorým | ||
Animacy=Inan|Case=Gen|Number=Sing | ktorého | ||
Animacy=Inan|Case=Gen|Number=Plur | ktorých | ||
Animacy=Inan|Case=Ins|Number=Sing | ktorým | ||
Animacy=Inan|Case=Ins|Number=Plur | ktorými | ||
Animacy=Inan|Case=Loc|Number=Sing | ktorom | ||
Animacy=Inan|Case=Loc|Number=Plur | ktorých | ||
Animacy=Inan|Case=Nom|Number=Sing | ktorý | ||
Animacy=Inan|Case=Nom|Number=Plur | ktoré | ||
Animacy=Inan|Case=Nom|Number=Plur|Typo=Yes | ktorý | ||
Case=Acc|Number=Sing | ktorú | ktoré | |
Case=Acc|Number=Plur | ktoré | ktoré | |
Case=Dat|Number=Sing | ktorej | ktorému | |
Case=Gen|Number=Sing | ktorej | ktorého | |
Case=Gen|Number=Plur | ktorých | ||
Case=Ins|Number=Sing | ktorou | ktorým | |
Case=Ins|Number=Plur | ktorými | ktorými | |
Case=Loc|Number=Sing | ktorej | ktorom | |
Case=Loc|Number=Plur | ktorých | ktorých | |
Case=Nom|Number=Sing | ktorá | ktoré | |
Case=Nom|Number=Plur | ktoré | ktoré | |
Case=Nom|Number=Plur|Typo=Yes | ktorí |
PRON
2081 PRON tokens (32% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Reflex=EMPTY (2081; 100%), Number=Sing (1784; 86%), PronType=Prs (1466; 70%), Person=3 (1407; 68%), Animacy=EMPTY (1056; 51%).
PRON
tokens may have the following values of Gender
:
Fem
(509; 24% of non-emptyGender
): ju, jej, nej, ona, ňou, ich, ňu, nich, im, neMasc
(1025; 49% of non-emptyGender
): ho, mu, ich, neho, im, kto, nich, ním, nikto, onNeut
(547; 26% of non-emptyGender
): čo, niečo, nič, ho, ich, čosi, ňom, nich, čím, všetkoEMPTY
(4354): sa, si, mi, ma, ja, mňa, nás, mne, ťa, ty
Paradigm on | Masc | Fem | Neut |
---|---|---|---|
Animacy=Anim|Case=Acc|Number=Sing|Person=3 | ho, Jeho, neho | ||
Animacy=Anim|Case=Acc|Number=Sing | neho | ||
Animacy=Anim|Case=Acc|Number=Plur|Person=3 | ich, nich | ||
Animacy=Anim|Case=Dat|Number=Sing|Person=3 | mu, nemu, jemu | ||
Animacy=Anim|Case=Dat|Number=Plur|Person=3 | im, nim | ||
Animacy=Anim|Case=Gen|Number=Sing|Person=3 | neho, ho | ||
Animacy=Anim|Case=Gen|Number=Sing | neho | ||
Animacy=Anim|Case=Gen|Number=Plur|Person=3 | nich | ||
Animacy=Anim|Case=Ins|Number=Sing|Person=3 | ním | ||
Animacy=Anim|Case=Ins|Number=Plur|Person=3 | nimi | ||
Animacy=Anim|Case=Loc|Number=Sing|Person=3 | ňom | ||
Animacy=Anim|Case=Loc|Number=Plur|Person=3 | nich | ||
Animacy=Anim|Case=Nom|Number=Sing|Person=3 | on | ||
Animacy=Anim|Case=Nom|Number=Plur|Person=3 | oni | ||
Animacy=Inan|Case=Acc|Number=Sing|Person=3 | ho, neho | ||
Animacy=Inan|Case=Acc|Number=Sing | neho | ||
Animacy=Inan|Case=Acc|Number=Plur|Person=3 | ich, ne | ||
Animacy=Inan|Case=Dat|Number=Sing|Person=3 | nemu, mu | ||
Animacy=Inan|Case=Dat|Number=Plur|Person=3 | im, nim | ||
Animacy=Inan|Case=Gen|Number=Sing|Person=3 | neho, ho | ||
Animacy=Inan|Case=Gen|Number=Sing | neho | ||
Animacy=Inan|Case=Gen|Number=Plur|Person=3 | nich, ich | ||
Animacy=Inan|Case=Ins|Number=Sing|Person=3 | ním | ||
Animacy=Inan|Case=Ins|Number=Plur|Person=3 | nimi | ||
Animacy=Inan|Case=Loc|Number=Sing|Person=3 | ňom | ||
Animacy=Inan|Case=Loc|Number=Plur|Person=3 | nich | ||
Case=Acc|Number=Sing|Person=3 | ju, ňu | ho | |
Case=Acc|Number=Plur|Person=3 | ich, ne | ich | |
Case=Dat|Number=Sing|Person=3 | jej, nej | mu | |
Case=Dat|Number=Plur|Person=3 | im | ||
Case=Loc|Number=Sing|Person=3 | ňom |
AUX
730 AUX tokens (20% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Aspect=Imp (730; 100%), Mood=EMPTY (730; 100%), Person=EMPTY (730; 100%), Tense=Past (730; 100%), VerbForm=Part (730; 100%), Polarity=Pos (675; 92%), Number=Sing (614; 84%).
AUX
tokens may have the following values of Gender
:
Fem
(240; 33% of non-emptyGender
): bola, boli, nebola, neboli, bývali, bývalaMasc
(341; 47% of non-emptyGender
): bol, boli, nebol, neboli, býval, bývaliNeut
(149; 20% of non-emptyGender
): bolo, nebolo, boliEMPTY
(2993): som, je, sme, by, sú, bude, si, ste, byť, budú
Paradigm byť | Masc | Fem | Neut |
---|---|---|---|
Animacy=Anim|Number=Sing|Polarity=Neg | nebol | ||
Animacy=Anim|Number=Sing|Polarity=Pos | bol | ||
Animacy=Anim|Number=Plur|Polarity=Neg | neboli | ||
Animacy=Anim|Number=Plur|Polarity=Pos | boli | ||
Animacy=Inan|Number=Sing|Polarity=Neg | nebol | ||
Animacy=Inan|Number=Sing|Polarity=Pos | bol | ||
Animacy=Inan|Number=Plur|Polarity=Neg | neboli | ||
Animacy=Inan|Number=Plur|Polarity=Pos | boli | ||
Number=Sing|Polarity=Neg | nebola | nebolo | |
Number=Sing|Polarity=Pos | bola | bolo | |
Number=Plur|Polarity=Neg | neboli | ||
Number=Plur|Polarity=Pos | boli | boli |
NUM
620 NUM tokens (39% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumForm=EMPTY (620; 100%), Animacy=EMPTY (362; 58%), Number=Sing (323; 52%).
NUM
tokens may have the following values of Gender
:
Fem
(163; 26% of non-emptyGender
): dve, jednej, jedna, jednou, jednu, tri, dvoch, obe, troch, obidvochMasc
(258; 42% of non-emptyGender
): jeden, dva, jedného, obaja, tri, dvoch, jedným, štyri, dvaja, jednomNeut
(199; 32% of non-emptyGender
): veľa, jedno, mnoho, päť, zopár, trochu, viac, desať, dvadsať, viaceroEMPTY
(953): II, 1, 11, 2, I, 2004, 4, 20, 10, III
Paradigm jeden | Masc | Fem | Neut |
---|---|---|---|
Animacy=Anim|Case=Acc|Number=Sing | jedného | ||
Animacy=Anim|Case=Dat|Number=Sing | jednému | ||
Animacy=Anim|Case=Ins|Number=Sing | jedným | ||
Animacy=Anim|Case=Loc|Number=Sing | jednom | ||
Animacy=Anim|Case=Nom|Number=Sing | jeden | ||
Animacy=Inan|Case=Acc|Number=Sing | jeden | ||
Animacy=Inan|Case=Gen|Number=Sing | jedného | ||
Animacy=Inan|Case=Ins|Number=Sing | jedným | ||
Animacy=Inan|Case=Loc|Number=Sing | jednom | ||
Animacy=Inan|Case=Nom|Number=Sing | jeden | ||
Animacy=Inan|Case=Nom|Number=Plur | jedny | ||
Case=Acc|Number=Sing | jednu | jedno | |
Case=Gen|Number=Sing | jednej | jedného | |
Case=Ins|Number=Sing | jednou | jedným | |
Case=Loc|Number=Sing | jednej | jednom | |
Case=Nom|Number=Sing | jedna | jedno, jeden |
ADV
34 ADV tokens (1% of all ADV
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADV
and Gender
co-occurred: Degree=EMPTY (34; 100%), PronType=EMPTY (34; 100%).
ADV
tokens may have the following values of Gender
:
Masc
(34; 100% of non-emptyGender
): raz, ráz, razyEMPTY
(4411): veľmi, potom, tu, tam, kde, tak, opäť, vtedy, ako, nikdy
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[amod]–> ADJ (7697; 100%),
NOUN –[det]–> DET (2456; 99%),
VERB –[nsubj]–> NOUN (2089; 67%),
VERB –[nsubj]–> PROPN (1457; 88%),
VERB –[conj]–> VERB (1001; 73%),
NOUN –[conj]–> NOUN (555; 51%),
PROPN –[nmod]–> PROPN (435; 75%),
VERB –[nsubj]–> DET (394; 66%),
PROPN –[nmod]–> NOUN (354; 83%),
VERB –[advcl]–> VERB (320; 54%).