Treebank Statistics: UD_Icelandic-GC: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem
, Masc
, Neut
.
39478 tokens (40%) have a non-empty value of Gender
.
16804 types (84%) occur at least once with a non-empty value of Gender
.
11319 lemmas (83%) occur at least once with a non-empty value of Gender
.
The feature is used with 8 part-of-speech tags: NOUN (20376; 20% instances), PRON (6328; 6% instances), ADJ (5405; 5% instances), PROPN (4964; 5% instances), NUM (1336; 1% instances), ADV (933; 1% instances), VERB (91; 0% instances), DET (45; 0% instances).
NOUN
20376 NOUN tokens (99% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Definite=EMPTY (15591; 77%), Number=Sing (14204; 70%).
NOUN
tokens may have the following values of Gender
:
Fem
(6779; 33% of non-emptyGender
): leið, ákvörðun, upplýsingar, konur, stjórn, sögn, þjónustu, frétt, tilkynningu, lögregluMasc
(6621; 32% of non-emptyGender
): maður, stað, tíma, menn, leik, þátt, sigur, formaður, forsætisráðherra, hlutiNeut
(6976; 34% of non-emptyGender
): ára, fólk, sæti, málið, mál, áhrif, liðið, landsins, ráð, ogEMPTY
(188): mars, 2017, apríl, ágúst, desember, febrúar, júlí, júní, nóvember, október
Paradigm ár | Masc | Fem | Neut |
---|---|---|---|
Case=Acc | ár | ||
Case=Acc|Definite=Def|Number=Sing | árið | ||
Case=Acc|Definite=Def|Number=Plur | árarnar | árin | |
Case=Acc|Number=Sing | ár | ||
Case=Acc|Number=Plur | ára | árar | ár |
Case=Dat|Definite=Def|Number=Sing | árinu | ||
Case=Dat|Definite=Def|Number=Plur | árunum | ||
Case=Dat|Number=Sing | ári | ár | ári, árum |
Case=Dat|Number=Plur | árum | ||
Case=Gen|Definite=Def|Number=Sing | ársins | ársins | |
Case=Gen|Definite=Def|Number=Plur | áranna | ||
Case=Gen|Number=Sing | árs, ára | árs | |
Case=Gen|Number=Plur | ára | ||
Case=Nom|Definite=Def|Number=Sing | árið | ||
Case=Nom|Definite=Def|Number=Plur | árin | ||
Case=Nom|Number=Sing | ár | ||
Case=Nom|Number=Plur | ár |
Gender
seems to be lexical feature of NOUN
. 98% lemmas (6892) occur only with one value of Gender
.
PRON
6328 PRON tokens (81% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Number=Sing (4808; 76%), Person=EMPTY (3757; 59%), PronType=EMPTY (3715; 59%).
PRON
tokens may have the following values of Gender
:
Fem
(1215; 19% of non-emptyGender
): hún, þær, hennar, henni, sinni, sína, sú, þeim, þá, hanaMasc
(2037; 32% of non-emptyGender
): hann, þeir, hans, þeim, þeirra, sínum, honum, allir, þess, þannNeut
(3076; 49% of non-emptyGender
): það, því, þetta, þess, þau, þessu, allt, hvað, þeirra, þeimEMPTY
(1474): ég, við, sér, sig, mér, okkur, okkar, mig, þú, þér
Paradigm sá | Masc | Fem | Neut |
---|---|---|---|
Case=Acc|Number=Sing | þann | þá, það | það |
Case=Acc|Number=Sing|Person=3|PronType=Prs | það | ||
Case=Acc|Number=Plur | þá | þær | þau |
Case=Acc|Number=Plur|Person=3|PronType=Prs | þau | ||
Case=Dat|Number=Sing | þeim | þeirri | því |
Case=Dat|Number=Sing|Person=3|PronType=Prs | því | ||
Case=Dat|Number=Plur | þeim | þeim | þeim |
Case=Dat|Number=Plur|Person=3|PronType=Prs | þeim | ||
Case=Gen|Number=Sing | þess | þeirrar | þess |
Case=Gen|Number=Sing|Person=3|PronType=Prs | þess | ||
Case=Gen|Number=Plur | þeirra | þeirra | þeirra |
Case=Gen|Number=Plur|Person=3|PronType=Prs | þeirra | þeirra | |
Case=Nom|Number=Sing | sá, þess, þessi | sú | það |
Case=Nom|Number=Sing|Person=3|PronType=Prs | sá | það | |
Case=Nom|Number=Plur | þeir | þær | þau |
Case=Nom|Number=Plur|Person=3|PronType=Prs | þeir | þau |
ADJ
5405 ADJ tokens (99% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Degree=EMPTY (4207; 78%), Number=Sing (3779; 70%).
ADJ
tokens may have the following values of Gender
:
Fem
(1521; 28% of non-emptyGender
): síðustu, fyrstu, mikil, næstu, Sameinuðu, meiri, frekari, mikla, ný, góðarMasc
(1730; 32% of non-emptyGender
): margir, síðustu, fleiri, fyrri, mikill, fyrrverandi, fyrstu, fyrsti, seinni, fyrstaNeut
(2154; 40% of non-emptyGender
): hægt, mikið, ljóst, gott, síðasta, meira, miklu, íslenska, erfitt, fyrstaEMPTY
(46): hægt, sammála, spennandi, Suðlæg, Svört, aðgerðalaus, beyglaður, brotinn, eftirfarandi, eldri
Paradigm mikill | Masc | Fem | Neut |
---|---|---|---|
Case=Acc|Degree=Pos|Number=Sing | mikla | mikla | mikið, mikla |
Case=Acc|Degree=Pos|Number=Plur | mikil | ||
Case=Acc|Degree=Cmp|Number=Sing | meiri | meiri | meira |
Case=Acc|Degree=Cmp|Number=Plur | meiri | meiri | |
Case=Acc|Degree=Sup|Number=Sing | mestan | mesta | mest |
Case=Acc|Degree=Sup|Number=Plur | mestar | mestu | |
Case=Acc|Number=Sing | mikinn, mikla | mikla | mikið |
Case=Acc|Number=Plur | miklar | mikil, miklu | |
Case=Dat|Degree=Pos|Number=Sing | miklum | miklu | |
Case=Dat|Degree=Pos|Number=Plur | miklum | ||
Case=Dat|Degree=Cmp|Number=Sing | meiri | meira | |
Case=Dat|Degree=Cmp|Number=Plur | meiri | ||
Case=Dat|Degree=Sup|Number=Sing | mestum | mestu | mestu, mesta |
Case=Dat|Number=Sing | miklum | mikilli | miklu, meiru |
Case=Dat|Number=Plur | miklum, miklu | miklum | miklum |
Case=Gen|Degree=Pos|Number=Sing | mikils | ||
Case=Gen|Degree=Cmp|Number=Sing | meira | ||
Case=Gen|Number=Sing | mikils, mikla | mikillar | mikils |
Case=Gen|Number=Plur | mikilla | ||
Case=Nom|Degree=Pos|Number=Sing | mikill | mikið | |
Case=Nom|Degree=Pos|Number=Plur | mikil | ||
Case=Nom|Degree=Cmp|Number=Sing | meiri | meiri | meira |
Case=Nom|Degree=Cmp|Number=Plur | meiri | meiri | |
Case=Nom|Degree=Sup|Number=Sing | mest, mesta | mesta, mest | |
Case=Nom|Degree=Sup|Number=Plur | mestir | Mestar | |
Case=Nom|Number=Sing | mikill, mikli | mikil, mikla | mikið |
Case=Nom|Number=Plur | miklir | miklar | mikil |
PROPN
4964 PROPN tokens (81% of all PROPN
tokens) have a non-empty value of Gender
.
PROPN
tokens may have the following values of Gender
:
Fem
(1423; 29% of non-emptyGender
): Reykjavík, Katrín, Reykjavíkur, Evrópu, 2, Akureyri, Sigríður, Anna, Danmörku, ErlaMasc
(2577; 52% of non-emptyGender
): Trump, Jón, þór, Guðmundur, Sigurður, Ólafur, Bjarni, Björn, Davíð, DonaldNeut
(964; 19% of non-emptyGender
): Íslands, Íslandi, Ísland, Bandaríkjunum, Bandaríkjanna, Morgunblaðinu, RÚV, Alþingi, Alþingis, ESBEMPTY
(1161): Icelandair, Arsenal, Facebook, New, United, WOW, York, air, City, Group
Paradigm Trump | Masc | Fem |
---|---|---|
Case=Acc | Trump | Trump |
Case=Acc|Number=Sing | Trump | |
Case=Dat | Trump | |
Case=Gen | Trump, Trumps | |
Case=Gen|Number=Sing | Trump | |
Case=Nom | Trump | |
Case=Nom|Number=Sing | Trump |
Gender
seems to be lexical feature of PROPN
. 98% lemmas (2595) occur only with one value of Gender
.
NUM
1336 NUM tokens (71% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: Number=Plur (1149; 86%), NumType=EMPTY (752; 56%).
NUM
tokens may have the following values of Gender
:
Fem
(293; 22% of non-emptyGender
): tvær, milljónir, tveggja, tveimur, þremur, fjórar, tíu, fimm, milljónum, prósentMasc
(420; 31% of non-emptyGender
): tveir, tvo, sjö, einn, átta, þrír, fimm, tveggja, tveimur, tíuNeut
(623; 47% of non-emptyGender
): fimm, tvö, prósent, þrjú, eitt, sex, þúsund, tíu, tveggja, fjögurEMPTY
(554): 2017, 2016, 0, 2, 2012, 2014, 1, prósent, 2013, 3
Paradigm tveir | Masc | Fem | Neut |
---|---|---|---|
Case=Acc | tvo | tvær | tvö |
Case=Acc|NumType=Card | tveggja | ||
Case=Dat | tveimur | tveimur | tveimur |
Case=Gen | tveggja | tveggja | tveggja |
Case=Nom | tveir | tvær | tvö |
ADV
933 ADV tokens (9% of all ADV
tokens) have a non-empty value of Gender
.
ADV
tokens may have the following values of Gender
:
Fem
(176; 19% of non-emptyGender
): viku, byrjun, nótt, mínútu, vikur, mínútur, vikum, helgi, leið, helginaMasc
(409; 44% of non-emptyGender
): dag, mánuði, daga, daginn, hálfleik, tíma, mánuðum, dagana, laugardag, laugardaginnNeut
(348; 37% of non-emptyGender
): ár, ári, árum, fyrra, sumar, lok, því, árið, kvöld, þaðEMPTY
(9366): ekki, þar, þá, í, á, fram, svo, upp, til, út
Paradigm ár | Fem | Neut |
---|---|---|
Case=Acc|Definite=Def|Number=Sing | árið | |
Case=Acc|Definite=Def|Number=Plur | árin | |
Case=Acc|Number=Sing | ár | |
Case=Acc|Number=Plur | ár | |
Case=Dat|Definite=Def|Number=Sing | árinu | |
Case=Dat|Definite=Def|Number=Plur | árunum | |
Case=Dat|Number=Sing | ári | |
Case=Dat|Number=Plur | árum | |
Case=Gen|Number=Plur | ára | |
Case=Nom|Number=Sing | ár |
Gender
seems to be lexical feature of ADV
. 97% lemmas (163) occur only with one value of Gender
.
VERB
91 VERB tokens (1% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (91; 100%), Person=EMPTY (91; 100%), Voice=EMPTY (91; 100%), Tense=EMPTY (90; 99%), VerbForm=Part (89; 98%), Case=Nom (87; 96%), Number=Sing (71; 78%).
VERB
tokens may have the following values of Gender
:
Fem
(19; 21% of non-emptyGender
): kynnt, sögð, afhenta, birt, búin, farin, flutta, gerð, hafin, hafnarMasc
(29; 32% of non-emptyGender
): hafinn, orðinn, handteknir, bundinn, byggður, búsettur, eigandi, endurgreiddir, gagnrýndir, handteknaNeut
(43; 47% of non-emptyGender
): búið, farið, gert, kveðið, stefnt, talið, þekkt, Fjallað, Skrifað, SýntEMPTY
(12749): er, segir, var, eru, sagði, hafa, kemur, gera, verið, koma
Paradigm segja | Fem | Neut |
---|---|---|
sögð | sagt |
DET
45 DET tokens (100% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Definite=Def (45; 100%), Number=Sing (36; 80%).
DET
tokens may have the following values of Gender
:
Fem
(11; 24% of non-emptyGender
): hin, hina, hinar, hinnar, hinni, hinumMasc
(13; 29% of non-emptyGender
): hinn, hinum, HinirNeut
(21; 47% of non-emptyGender
): hið, hin, hinu, WOW-ið, hinum, volume-ið
Paradigm hinn | Masc | Fem | Neut |
---|---|---|---|
Case=Acc|Number=Sing | hina | hið | |
Case=Acc|Number=Plur | hinar | hin | |
Case=Dat|Number=Sing | hinum | hinni | hinu |
Case=Dat|Number=Plur | hinum | hinum | |
Case=Gen|Number=Sing | hinnar | ||
Case=Nom|Number=Sing | hinn | hin | hið |
Case=Nom|Number=Plur | Hinir | hin |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[amod]–> ADJ (3260; 96%),
NOUN –[nmod]–> PRON (1713; 95%),
PROPN –[flat]–> PROPN (1140; 100%),
NOUN –[nummod]–> NUM (690; 85%),
NOUN –[flat]–> NOUN (324; 100%),
ADJ –[nsubj]–> NOUN (317; 95%),
PROPN –[obl]–> NOUN (219; 52%),
ADV –[amod]–> ADJ (209; 80%),
ADJ –[nsubj]–> PRON (184; 75%),
PROPN –[conj]–> PROPN (157; 57%).