home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Icelandic-PUD: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

7484 tokens (40%) have a non-empty value of Gender. 4656 types (71%) occur at least once with a non-empty value of Gender. 3330 lemmas (69%) occur at least once with a non-empty value of Gender. The feature is used with 11 part-of-speech tags: NOUN (4074; 22% instances), PRON (1249; 7% instances), ADJ (1150; 6% instances), PROPN (616; 3% instances), VERB (241; 1% instances), NUM (118; 1% instances), DET (22; 0% instances), ADV (10; 0% instances), SCONJ (2; 0% instances), ADP (1; 0% instances), AUX (1; 0% instances).

NOUN

4074 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Definite=Ind (2933; 72%), Number=Sing (2792; 69%).

NOUN tokens may have the following values of Gender:

Paradigm ríkiFemNeut
Case=Acc|Definite=Ind|Number=Singríki
Case=Dat|Definite=Def|Number=Singríkinu
Case=Dat|Definite=Ind|Number=Singríki
Case=Dat|Definite=Ind|Number=Plurríkjum
Case=Gen|Definite=Def|Number=Singríkisins
Case=Gen|Definite=Def|Number=Plurríkjanna
Case=Gen|Definite=Ind|Number=Singríkis
Case=Gen|Definite=Ind|Number=Plurríkja
Case=Nom|Definite=Def|Number=Singríkið
Case=Nom|Definite=Ind|Number=Singríki

Gender seems to be lexical feature of NOUN. 100% lemmas (2207) occur only with one value of Gender.

PRON

1249 PRON tokens (91% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (967; 77%), PronType=Prs (686; 55%).

PRON tokens may have the following values of Gender:

Paradigm þessiMascFemNeut
Case=Acc|Number=Singþennanþessaþetta
Case=Acc|Number=Plurþessaþessarþessi
Case=Dat|Number=Singþessumþessariþessu, þessum
Case=Dat|Number=Plurþessumþessum
Case=Gen|Number=Singþessaþessararþessa
Case=Gen|Number=Plurþessaraþessara
Case=Nom|Number=SingÞessiþessiþetta
Case=Nom|Number=PlurÞessarþessi, Þessir

ADJ

1150 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (917; 80%), Number=Sing (775; 67%), Definite=Ind (724; 63%).

ADJ tokens may have the following values of Gender:

Paradigm mikillMascFemNeut
Case=Acc|Definite=Def|Degree=Pos|Number=Singmikla
Case=Acc|Definite=Def|Degree=Cmp|Number=Singmeirimeira
Case=Acc|Definite=Def|Degree=Sup|Number=Singmesta
Case=Acc|Definite=Def|Degree=Sup|Number=Plurmestu
Case=Acc|Definite=Ind|Degree=Pos|Number=Singmikinnmiklamikið
Case=Acc|Definite=Ind|Degree=Pos|Number=Plurmiklarmikil
Case=Dat|Definite=Def|Degree=Cmp|Number=Singmeirimeiri
Case=Dat|Definite=Def|Degree=Sup|Number=Singmestumesta
Case=Dat|Definite=Ind|Degree=Pos|Number=Singmiklu
Case=Dat|Definite=Ind|Degree=Sup|Number=Singmestu
Case=Gen|Definite=Def|Degree=Pos|Number=Singmikla
Case=Gen|Definite=Ind|Degree=Pos|Number=Singmikilsmikillar
Case=Nom|Definite=Def|Degree=Pos|Number=Singmikla
Case=Nom|Definite=Def|Degree=Cmp|Number=Singmeirimeira
Case=Nom|Definite=Def|Degree=Sup|Number=Singmestamesta
Case=Nom|Definite=Ind|Degree=Pos|Number=Singmikillmikilmikið

PROPN

616 PROPN tokens (42% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (506; 82%).

PROPN tokens may have the following values of Gender:

Paradigm HeródesMascFem
HeródesarHeródesar

Gender seems to be lexical feature of PROPN. 99% lemmas (428) occur only with one value of Gender.

VERB

241 VERB tokens (12% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (241; 100%), Person=EMPTY (241; 100%), Case=Nom (232; 96%), Tense=Past (218; 90%), VerbForm=Part (218; 90%), Voice=Act (215; 89%), Number=Sing (184; 76%).

VERB tokens may have the following values of Gender:

Paradigm segjaMascFemNeut
Number=Singsagðursögðsagt
Number=Plursagðir

NUM

118 NUM tokens (27% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Number=Plur (95; 81%).

NUM tokens may have the following values of Gender:

Paradigm tveirMascFemNeut
Case=Acctvotvær
Case=Dattveimurtveimurtveimur
Case=Gentveggjatveggjatveggja
Case=NomTveirtværtvö

DET

22 DET tokens (100% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (15; 68%).

DET tokens may have the following values of Gender:

Paradigm hinnMascFemNeut
Case=Acc|Number=Singhinnhið
Case=Dat|Number=Singhinnihinu
Case=Dat|Number=Plurhinumhinum
Case=Gen|Number=Singhinshinnarhins
Case=Gen|Number=Plurhinnahinna
Case=Nom|Number=SinghinnHinhið

ADV

10 ADV tokens (1% of all ADV tokens) have a non-empty value of Gender.

ADV tokens may have the following values of Gender:

SCONJ

2 SCONJ tokens (0% of all SCONJ tokens) have a non-empty value of Gender.

SCONJ tokens may have the following values of Gender:

ADP

1 ADP tokens (0% of all ADP tokens) have a non-empty value of Gender.

ADP tokens may have the following values of Gender:

AUX

1 AUX tokens (0% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (1; 100%), Number=Sing (1; 100%), Person=EMPTY (1; 100%), Tense=EMPTY (1; 100%), VerbForm=EMPTY (1; 100%), Voice=EMPTY (1; 100%).

AUX tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (753; 93%), NOUN –[det]–> PRON (242; 94%), NOUN –[nmod:poss]–> PRON (120; 61%), NOUN –[nsubj]–> NOUN (58; 51%), ADJ –[nsubj]–> NOUN (47; 70%), ADJ –[nsubj]–> PRON (30; 88%), ADJ –[conj]–> ADJ (18; 86%), ADJ –[det]–> PRON (14; 93%), NOUN –[acl:relcl]–> ADJ (12; 67%), ADJ –[expl]–> PRON (11; 100%).