Treebank Statistics: UD_Icelandic-Modern: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem
, Masc
, Neut
.
27967 tokens (35%) have a non-empty value of Gender
.
7728 types (76%) occur at least once with a non-empty value of Gender
.
4349 lemmas (74%) occur at least once with a non-empty value of Gender
.
The feature is used with 10 part-of-speech tags: NOUN (12915; 16% instances), PRON (4843; 6% instances), ADJ (3580; 4% instances), DET (3497; 4% instances), PROPN (1994; 2% instances), VERB (744; 1% instances), NUM (224; 0% instances), ADV (131; 0% instances), AUX (36; 0% instances), X (3; 0% instances).
NOUN
12915 NOUN tokens (95% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Definite=Ind (10003; 77%), Number=Sing (9176; 71%).
NOUN
tokens may have the following values of Gender
:
Fem
(3867; 30% of non-emptyGender
): leið, raun, ræðu, ríkisstjórn, umræðu, klukkan, upplýsingar, veru, vinnu, aðgerðirMasc
(4411; 34% of non-emptyGender
): forseti, menn, þingmaður, ráðherra, tíma, herra, dag, stað, vegar, þingmanniNeut
(4637; 36% of non-emptyGender
): mál, fólk, máli, málið, ár, ára, ári, sæti, dæmis, áriðEMPTY
(729): m, Frú, móti,, gr., nefnd, kr., stundum, k., allsherjar-
Paradigm lið | Masc | Fem | Neut |
---|---|---|---|
Case=Acc|Definite=Def|Number=Sing | liðið | ||
Case=Acc|Definite=Ind|Number=Sing | lið | ||
Case=Acc|Definite=Ind|Number=Plur | lið | lið | |
Case=Dat|Definite=Def|Number=Sing | liðinu | ||
Case=Dat|Definite=Def|Number=Plur | liðunum | ||
Case=Dat|Definite=Ind|Number=Sing | lið | liði | |
Case=Dat|Definite=Ind|Number=Plur | liðum | ||
Case=Gen|Definite=Def|Number=Sing | liðsins | ||
Case=Gen|Definite=Def|Number=Plur | liðanna | ||
Case=Gen|Definite=Ind|Number=Sing | liðs | ||
Case=Gen|Definite=Ind|Number=Plur | liða | ||
Case=Nom|Definite=Def|Number=Sing | Liðin | liðið | |
Case=Nom|Definite=Def|Number=Plur | liðin | ||
Case=Nom|Definite=Ind|Number=Sing | lið | ||
Case=Nom|Definite=Ind|Number=Plur | lið | ||
Case=Nom|Number=Sing|VerbForm=Part|Voice=Act | liðið |
Gender
seems to be lexical feature of NOUN
. 99% lemmas (2701) occur only with one value of Gender
.
PRON
4843 PRON tokens (63% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Person=EMPTY (4843; 100%), Number=Sing (4266; 88%), PronType=Prs (4136; 85%).
PRON
tokens may have the following values of Gender
:
Fem
(565; 12% of non-emptyGender
): hún, þær, hana, sér, sinni, henni, hennar, minni, mín, aðrarMasc
(811; 17% of non-emptyGender
): hann, þeir, sér, sig, hans, sínum, honum, annars, öðrum, þeimNeut
(3467; 72% of non-emptyGender
): það, því, þess, hvað, þau, annað, sér, hverju, sig, annarsEMPTY
(2891): ég, við, mér, okkur, mig, því, okkar, maður, annars, það
Paradigm það | Masc | Fem | Neut |
---|---|---|---|
Case=Acc|Number=Sing|PronType=Dem | það | ||
Case=Acc|Number=Sing|PronType=Prs | það | ||
Case=Acc|Number=Plur|PronType=Dem | þau | ||
Case=Acc|Number=Plur|PronType=Prs | þau | ||
Case=Dat|Number=Sing|PronType=Dem | því | ||
Case=Dat|Number=Sing|PronType=Prs | því | ||
Case=Dat|Number=Plur|PronType=Dem | þeim | þeim | |
Case=Dat|Number=Plur|PronType=Prs | þeim | þeim | þeim |
Case=Gen|Number=Sing|PronType=Dem | þess | þess | |
Case=Gen|Number=Sing|PronType=Prs | þess | ||
Case=Gen|Number=Plur|PronType=Prs | þeirra | þeirra | |
Case=Nom|Number=Sing | það | ||
Case=Nom|Number=Sing|PronType=Dem | það | ||
Case=Nom|Number=Sing|PronType=Prs | það | ||
Case=Nom|Number=Plur|PronType=Dem | þau | ||
Case=Nom|Number=Plur|PronType=Prs | þau |
ADJ
3580 ADJ tokens (83% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Degree=Pos (2892; 81%), Number=Sing (2798; 78%), Definite=Ind (2272; 63%), Case=Nom (1891; 53%).
ADJ
tokens may have the following values of Gender
:
Fem
(847; 24% of non-emptyGender
): góð, fyrri, síðustu, næstu, betri, fyrstu, ánægð, mikla, sammála, góðaMasc
(995; 28% of non-emptyGender
): virðulegi, sammála, minnsta, besta, minni, nýjan, vinstri, fatlaðra, síðustu, vissNeut
(1738; 49% of non-emptyGender
): hægt, gott, rétt, miklu, fyrsta, mikilvægt, sjálfsögðu, ljóst, síðasta, erfittEMPTY
(737): hv., hæstv., sama, 2., 1., 3., 5., 8., 9., m.
Paradigm háttvirtur | Masc | Fem | Neut |
---|---|---|---|
Case=Acc|Definite=Ind|Number=Sing | háttvirtan | ||
Case=Acc|Definite=Ind|Number=Plur | háttvirt | ||
Case=Dat|Definite=Ind|Number=Plur | háttvirtum | ||
Case=Gen|Definite=Ind|Number=Sing | háttvirts | ||
Case=Nom|Definite=Def|Number=Sing | hv. | ||
Case=Nom|Definite=Ind|Number=Sing | háttvirtur, hv. | ||
Case=Nom|Definite=Ind|Number=Plur | háttvirtar |
DET
3497 DET tokens (94% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Definite=EMPTY (3119; 89%), Degree=EMPTY (3119; 89%), Number=Sing (2575; 74%), PronType=Dem (1841; 53%).
DET
tokens may have the following values of Gender
:
Fem
(681; 19% of non-emptyGender
): þá, þessa, þessari, sú, þessar, þeirri, þær, þessi, þeim, hvaðaMasc
(841; 24% of non-emptyGender
): þeim, allir, meiri, þann, hins, einhvern, þeir, alla, sá, enginnNeut
(1975; 56% of non-emptyGender
): þetta, það, þessu, allt, eitthvað, ekkert, því, þessi, þau, þeimEMPTY
(205): meira, eitt, mikið, 1, einn, ein, svolítið, einu, þetta, einum
Paradigm þessi | Masc | Fem | Neut |
---|---|---|---|
Case=Acc|Number=Sing | þennan | þessa | þetta |
Case=Acc|Number=Plur | þessa | þessar | þessi |
Case=Dat|Number=Sing | þessum | þessari | þessu |
Case=Dat|Number=Plur | þessum | þessum | þessum |
Case=Gen|Number=Sing | þessa | þessarar | þessa |
Case=Gen|Number=Plur | þessara | þessara | þessara |
Case=Nom|Number=Sing | þessi | þessi | þetta |
Case=Nom|Number=Plur | þessir | þessar | þessi |
PROPN
1994 PROPN tokens (73% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Number=Sing (1755; 88%), Definite=Ind (1701; 85%).
PROPN
tokens may have the following values of Gender
:
Fem
(512; 26% of non-emptyGender
): Hrafnhildur, Bryndís, Evrópu, Chusovitina, Rún, Danmörku, Lúthersdóttir, Brasilíu, Grótta, ParísMasc
(1054; 53% of non-emptyGender
): Ólympíuleikunum, Blöndal, Íslendingar, Ólympíuleikum, Þór, Jón, Pétur, Arnar, Forseti, ValurNeut
(428; 21% of non-emptyGender
): Íslands, Ríó, Ísland, Alþingi, Íslandi, Frakklandi, Alþingis, Evrópusambandinu, Evrópusambandið, EvrópumótinuEMPTY
(749): þm., RÚV, EM, London, H., HM, Collins, KSÍ, United, KR
Paradigm hrafnhildur | Masc | Fem |
---|---|---|
Case=Acc | Hrafnhildi | Hrafnhildi |
Case=Dat | Hrafnhildi | Hrafnhildi, Hrafnhildur |
Case=Gen | Hrafnhildar | |
Case=Nom | Hrafnhildur |
Gender
seems to be lexical feature of PROPN
. 98% lemmas (620) occur only with one value of Gender
.
VERB
744 VERB tokens (8% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (744; 100%), Person=EMPTY (744; 100%), Tense=EMPTY (744; 100%), VerbForm=Part (734; 99%), Voice=Act (724; 97%), Number=Sing (599; 81%).
VERB
tokens may have the following values of Gender
:
Fem
(116; 16% of non-emptyGender
): orðin, farin, komin, tekin, teknar, samþykkt, settar, skráð, felld, gerðarMasc
(137; 18% of non-emptyGender
): kominn, settir, sýndur, farinn, haldnir, komnir, orðinn, valinn, fluttur, gefinnNeut
(491; 66% of non-emptyGender
): gert, farið, keppt, sagt, tekið, haldið, komið, sett, miðað, lagtEMPTY
(8551): fara, gera, hringir, held, koma, taka, þakka, kemur, á, segja
Paradigm koma | Masc | Fem | Neut |
---|---|---|---|
Degree=Pos|Number=Plur | komandi | ||
Number=Sing|VerbForm=Part|Voice=Act | kominn | komin | komið |
Number=Sing|VerbForm=Part|Voice=Mid | komist | ||
Number=Plur|VerbForm=Part|Voice=Act | komnir | komnar | komin |
NUM
224 NUM tokens (21% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumType=Card (223; 100%), Number=Plur (217; 97%).
NUM
tokens may have the following values of Gender
:
Fem
(40; 18% of non-emptyGender
): tvær, þrjár, fimm, þúsund, sjö, þremur, níu, sex, tveggja, fjögurraMasc
(73; 33% of non-emptyGender
): þrír, fjóra, tveimur, átta, fimm, sex, tvo, fjórir, þrjá, fjórumNeut
(111; 50% of non-emptyGender
): tvö, tveimur, fjögur, þrjú, fjórum, tíu, tveggja, þriggja, þremur, fjögurraEMPTY
(824): 100, 2, 200, 0, 50, 2012, 3, 16, 18, 20
Paradigm tveir | Masc | Fem | Neut |
---|---|---|---|
Case=Acc | tvo | tvær | tvö |
Case=Dat | tveimur, tveim | tveimur | tveimur |
Case=Gen | tveggja | tveggja | |
Case=Nom | tveir | tvær | tvö |
ADV
131 ADV tokens (2% of all ADV
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADV
and Gender
co-occurred: Degree=Pos (96; 73%).
ADV
tokens may have the following values of Gender
:
Fem
(28; 21% of non-emptyGender
): svona, fleiri, þannig, mikla, fallega, gríðarlega, kvitt, margar, meiri, mikilMasc
(24; 18% of non-emptyGender
): svona, meiri, mikinn, margir, eins, fleiri, hugsanlega, marga, miklu, sammálaNeut
(79; 60% of non-emptyGender
): rétt, svona, meira, mikið, mikil, mörgum, skýrt, ekkert, ytra, öðruvísiEMPTY
(6829): ekki, þá, svo, hér, bara, eins, þar, nú, þannig, mjög
Paradigm svona | Masc | Fem | Neut |
---|---|---|---|
Case=Acc|Number=Sing | svona | svona | |
Case=Acc|Number=Plur | svona | svona | |
Case=Dat|Number=Sing | svona | svona | |
Case=Dat|Number=Plur | svona | svona | svona |
Case=Nom|Number=Sing | svona | svona | |
Case=Nom|Number=Plur | svona |
AUX
36 AUX tokens (1% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Mood=EMPTY (36; 100%), Number=Sing (36; 100%), Person=EMPTY (36; 100%), Tense=EMPTY (36; 100%), VerbForm=Part (36; 100%), Voice=Act (36; 100%).
AUX
tokens may have the following values of Gender
:
Neut
(36; 100% of non-emptyGender
): verið, haftEMPTY
(5268): er, var, eru, sé, verið, hefur, hafa, vera, hafi, væri
X
3 X tokens (3% of all X
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which X
and Gender
co-occurred: Foreign=EMPTY (3; 100%).
X
tokens may have the following values of Gender
:
Fem
(1; 33% of non-emptyGender
): skyttunarMasc
(1; 33% of non-emptyGender
): final-fourNeut
(1; 33% of non-emptyGender
): nýafstöðuEMPTY
(88): Molde, 2016, Eidur, FK, að, i, se, your, 22, 3
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[amod]–> ADJ (1828; 79%),
NOUN –[det]–> DET (1172; 94%),
NOUN –[amod]–> DET (633; 95%),
NOUN –[conj]–> NOUN (331; 53%),
NOUN –[nmod:poss]–> PRON (276; 67%),
PROPN –[flat:name]–> PROPN (246; 65%),
ADJ –[nsubj]–> PRON (188; 54%),
NOUN –[det]–> PRON (136; 98%),
ADJ –[nsubj]–> NOUN (123; 87%),
ADJ –[conj]–> NOUN (118; 86%).