Treebank Statistics: UD_Icelandic-PUD: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem
, Masc
, Neut
.
7484 tokens (40%) have a non-empty value of Gender
.
4656 types (71%) occur at least once with a non-empty value of Gender
.
3330 lemmas (69%) occur at least once with a non-empty value of Gender
.
The feature is used with 11 part-of-speech tags: NOUN (4074; 22% instances), PRON (1249; 7% instances), ADJ (1150; 6% instances), PROPN (616; 3% instances), VERB (241; 1% instances), NUM (118; 1% instances), DET (22; 0% instances), ADV (10; 0% instances), SCONJ (2; 0% instances), ADP (1; 0% instances), AUX (1; 0% instances).
NOUN
4074 NOUN tokens (99% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Definite=Ind (2933; 72%), Number=Sing (2792; 69%).
NOUN
tokens may have the following values of Gender
:
Fem
(1333; 33% of non-emptyGender
): öld, sögu, milljónir, stjórn, hendi, leið, notkun, menntun, ríkisstjórnin, sögunnarMasc
(1301; 32% of non-emptyGender
): hluta, október, stað, fjölda, áratugnum, maður, apríl, tíma, janúar, júníNeut
(1440; 35% of non-emptyGender
): árið, ár, árum, ára, fólk, áhrif, sinn, borð, kjölfar, efniEMPTY
(27): hafi, FSLN, Frú, Metropolitan, Norður-Ontario, Papworth-, alvarlegust, ati, dala, dr.
Paradigm ríki | Fem | Neut |
---|---|---|
Case=Acc|Definite=Ind|Number=Sing | ríki | |
Case=Dat|Definite=Def|Number=Sing | ríkinu | |
Case=Dat|Definite=Ind|Number=Sing | ríki | |
Case=Dat|Definite=Ind|Number=Plur | ríkjum | |
Case=Gen|Definite=Def|Number=Sing | ríkisins | |
Case=Gen|Definite=Def|Number=Plur | ríkjanna | |
Case=Gen|Definite=Ind|Number=Sing | ríkis | |
Case=Gen|Definite=Ind|Number=Plur | ríkja | |
Case=Nom|Definite=Def|Number=Sing | ríkið | |
Case=Nom|Definite=Ind|Number=Sing | ríki |
Gender
seems to be lexical feature of NOUN
. 100% lemmas (2207) occur only with one value of Gender
.
PRON
1249 PRON tokens (91% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Number=Sing (967; 77%), PronType=Prs (686; 55%).
PRON
tokens may have the following values of Gender
:
Fem
(253; 20% of non-emptyGender
): hún, hennar, sinni, þær, sína, henni, sér, þeirra, hana, þessiMasc
(433; 35% of non-emptyGender
): hann, hans, þeir, sig, honum, þeirra, sér, þeim, sínum, einnNeut
(563; 45% of non-emptyGender
): það, því, þetta, þess, þau, allt, hvað, sitt, engu, sérEMPTY
(121): ég, við, okkar, okkur, mig, mér, þið, þú, hvort, mín
Paradigm þessi | Masc | Fem | Neut |
---|---|---|---|
Case=Acc|Number=Sing | þennan | þessa | þetta |
Case=Acc|Number=Plur | þessa | þessar | þessi |
Case=Dat|Number=Sing | þessum | þessari | þessu, þessum |
Case=Dat|Number=Plur | þessum | þessum | |
Case=Gen|Number=Sing | þessa | þessarar | þessa |
Case=Gen|Number=Plur | þessara | þessara | |
Case=Nom|Number=Sing | Þessi | þessi | þetta |
Case=Nom|Number=Plur | Þessar | þessi, Þessir |
ADJ
1150 ADJ tokens (99% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Degree=Pos (917; 80%), Number=Sing (775; 67%), Definite=Ind (724; 63%).
ADJ
tokens may have the following values of Gender
:
Fem
(321; 28% of non-emptyGender
): eigin, fyrstu, meiri, nýja, nýjar, síðustu, fyrri, lifandi, mikla, bestuMasc
(350; 30% of non-emptyGender
): margir, minnsta, þriðja, fyrsti, gjaldþrota, hefðbundna, hærri, marga, næsta, síðastaNeut
(479; 42% of non-emptyGender
): hægt, fyrsta, meira, miklu, mikið, mörg, kleift, nýja, ótrúlegt, bestaEMPTY
(17): Fyrst, fremst, alfarið, fæst, fúnir, meira, nóg, ný, rétt, samhliða
Paradigm mikill | Masc | Fem | Neut |
---|---|---|---|
Case=Acc|Definite=Def|Degree=Pos|Number=Sing | mikla | ||
Case=Acc|Definite=Def|Degree=Cmp|Number=Sing | meiri | meira | |
Case=Acc|Definite=Def|Degree=Sup|Number=Sing | mesta | ||
Case=Acc|Definite=Def|Degree=Sup|Number=Plur | mestu | ||
Case=Acc|Definite=Ind|Degree=Pos|Number=Sing | mikinn | mikla | mikið |
Case=Acc|Definite=Ind|Degree=Pos|Number=Plur | miklar | mikil | |
Case=Dat|Definite=Def|Degree=Cmp|Number=Sing | meiri | meiri | |
Case=Dat|Definite=Def|Degree=Sup|Number=Sing | mestu | mesta | |
Case=Dat|Definite=Ind|Degree=Pos|Number=Sing | miklu | ||
Case=Dat|Definite=Ind|Degree=Sup|Number=Sing | mestu | ||
Case=Gen|Definite=Def|Degree=Pos|Number=Sing | mikla | ||
Case=Gen|Definite=Ind|Degree=Pos|Number=Sing | mikils | mikillar | |
Case=Nom|Definite=Def|Degree=Pos|Number=Sing | mikla | ||
Case=Nom|Definite=Def|Degree=Cmp|Number=Sing | meiri | meira | |
Case=Nom|Definite=Def|Degree=Sup|Number=Sing | mesta | mesta | |
Case=Nom|Definite=Ind|Degree=Pos|Number=Sing | mikill | mikil | mikið |
PROPN
616 PROPN tokens (42% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Number=Sing (506; 82%).
PROPN
tokens may have the following values of Gender
:
Fem
(179; 29% of non-emptyGender
): Evrópu, Ítalíu, Albaníu, Asíu, Suður-Ameríku, Makedóníu, Ástralíu, Þrakíu, Afríku, AmeríkuMasc
(248; 40% of non-emptyGender
): Krist, Breta, Kínverjar, Akkemenída, Balkanskaga, Evrópubúar, Frakka, Kristófer, Kínverja, KólumbusNeut
(189; 31% of non-emptyGender
): Bretlandi, Kína, Bandaríkin, Bandaríkjanna, Bretland, Bandaríkjunum, Frakklandi, Grikklandi, Kyrrahafi, MiðjarðarhafEMPTY
(848): the, Trump, of, Hong, Kong, de, Clinton, Disney, Rafferty, a
Paradigm Heródes | Masc | Fem |
---|---|---|
Heródesar | Heródesar |
Gender
seems to be lexical feature of PROPN
. 99% lemmas (428) occur only with one value of Gender
.
VERB
241 VERB tokens (12% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (241; 100%), Person=EMPTY (241; 100%), Case=Nom (232; 96%), Tense=Past (218; 90%), VerbForm=Part (218; 90%), Voice=Act (215; 89%), Number=Sing (184; 76%).
VERB
tokens may have the following values of Gender
:
Fem
(57; 24% of non-emptyGender
): orðin, notuð, birt, birtar, gerð, lögð, ræktaðar, afturkölluð, dvöldu, fluttarMasc
(57; 24% of non-emptyGender
): lýstur, byggður, haldnir, orðnir, rekinn, sagður, sýndur, afturkallaði, bornir, bundinnNeut
(127; 53% of non-emptyGender
): komið, notað, gert, farið, greint, litið, sagt, sett, stofnað, taliðEMPTY
(1757): sagði, fór, varð, segir, fá, hafa, gera, kom, koma, nota
Paradigm segja | Masc | Fem | Neut |
---|---|---|---|
Number=Sing | sagður | sögð | sagt |
Number=Plur | sagðir |
NUM
118 NUM tokens (27% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: Number=Plur (95; 81%).
NUM
tokens may have the following values of Gender
:
Fem
(33; 28% of non-emptyGender
): tvær, sex, tveggja, þremur, einnar, átta, ein, einni, fimm, fjórarMasc
(48; 41% of non-emptyGender
): einn, tvo, tíu, fimm, fjórir, tveimur, 9., fjóra, sex, tveggjaNeut
(37; 31% of non-emptyGender
): eitt, tveggja, fjögur, tveimur, tvö, þrjú, sex, tíu, þrjátíu, 10.000EMPTY
(322): 8., I, 1, 100, 1492, 2010, 2012, 2014, 2015, 2017
Paradigm tveir | Masc | Fem | Neut |
---|---|---|---|
Case=Acc | tvo | tvær | |
Case=Dat | tveimur | tveimur | tveimur |
Case=Gen | tveggja | tveggja | tveggja |
Case=Nom | Tveir | tvær | tvö |
DET
22 DET tokens (100% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number=Sing (15; 68%).
DET
tokens may have the following values of Gender
:
Fem
(5; 23% of non-emptyGender
): Hin, aðrar, hinnar, hinni, hinumMasc
(8; 36% of non-emptyGender
): hinn, hinna, hins, hinumNeut
(9; 41% of non-emptyGender
): hið, hinna, hins, hinu, þetta
Paradigm hinn | Masc | Fem | Neut |
---|---|---|---|
Case=Acc|Number=Sing | hinn | hið | |
Case=Dat|Number=Sing | hinni | hinu | |
Case=Dat|Number=Plur | hinum | hinum | |
Case=Gen|Number=Sing | hins | hinnar | hins |
Case=Gen|Number=Plur | hinna | hinna | |
Case=Nom|Number=Sing | hinn | Hin | hið |
ADV
10 ADV tokens (1% of all ADV
tokens) have a non-empty value of Gender
.
ADV
tokens may have the following values of Gender
:
Fem
(1; 10% of non-emptyGender
): óvenjuMasc
(4; 40% of non-emptyGender
): mun, gríðarlega, þáNeut
(5; 50% of non-emptyGender
): allt, loks, meira, vafningalaust, þvíEMPTY
(1204): ekki, þar, svo, á, fram, til, upp, einnig, enn, líka
SCONJ
2 SCONJ tokens (0% of all SCONJ
tokens) have a non-empty value of Gender
.
SCONJ
tokens may have the following values of Gender
:
Neut
(2; 100% of non-emptyGender
): þvíEMPTY
(722): sem, að, þegar, ef, en, þótt, hvort, meðan, Og, Eða
ADP
1 ADP tokens (0% of all ADP
tokens) have a non-empty value of Gender
.
ADP
tokens may have the following values of Gender
:
Neut
(1; 100% of non-emptyGender
): viðEMPTY
(2341): í, á, til, við, um, með, fyrir, af, frá, eftir
AUX
1 AUX tokens (0% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Mood=EMPTY (1; 100%), Number=Sing (1; 100%), Person=EMPTY (1; 100%), Tense=EMPTY (1; 100%), VerbForm=EMPTY (1; 100%), Voice=EMPTY (1; 100%).
AUX
tokens may have the following values of Gender
:
Neut
(1; 100% of non-emptyGender
): verðiEMPTY
(973): er, var, voru, eru, hefur, verið, hafa, vera, hafði, höfðu
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[amod]–> ADJ (753; 93%),
NOUN –[det]–> PRON (242; 94%),
NOUN –[nmod:poss]–> PRON (120; 61%),
NOUN –[nsubj]–> NOUN (58; 51%),
ADJ –[nsubj]–> NOUN (47; 70%),
ADJ –[nsubj]–> PRON (30; 88%),
ADJ –[conj]–> ADJ (18; 86%),
ADJ –[det]–> PRON (14; 93%),
NOUN –[acl:relcl]–> ADJ (12; 67%),
ADJ –[expl]–> PRON (11; 100%).