home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Icelandic-GC: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

39478 tokens (40%) have a non-empty value of Gender. 16804 types (84%) occur at least once with a non-empty value of Gender. 11319 lemmas (83%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (20376; 20% instances), PRON (6328; 6% instances), ADJ (5405; 5% instances), PROPN (4964; 5% instances), NUM (1336; 1% instances), ADV (933; 1% instances), VERB (91; 0% instances), DET (45; 0% instances).

NOUN

20376 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Definite=EMPTY (15591; 77%), Number=Sing (14204; 70%).

NOUN tokens may have the following values of Gender:

Paradigm árMascFemNeut
Case=Accár
Case=Acc|Definite=Def|Number=Singárið
Case=Acc|Definite=Def|Number=Plurárarnarárin
Case=Acc|Number=Singár
Case=Acc|Number=Pluráraárarár
Case=Dat|Definite=Def|Number=Singárinu
Case=Dat|Definite=Def|Number=Plurárunum
Case=Dat|Number=Singáriárári, árum
Case=Dat|Number=Plurárum
Case=Gen|Definite=Def|Number=Singársinsársins
Case=Gen|Definite=Def|Number=Pluráranna
Case=Gen|Number=Singárs, áraárs
Case=Gen|Number=Plurára
Case=Nom|Definite=Def|Number=Singárið
Case=Nom|Definite=Def|Number=Plurárin
Case=Nom|Number=Singár
Case=Nom|Number=Plurár

Gender seems to be lexical feature of NOUN. 98% lemmas (6892) occur only with one value of Gender.

PRON

6328 PRON tokens (81% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (4808; 76%), Person=EMPTY (3757; 59%), PronType=EMPTY (3715; 59%).

PRON tokens may have the following values of Gender:

Paradigm MascFemNeut
Case=Acc|Number=Singþannþá, þaðþað
Case=Acc|Number=Sing|Person=3|PronType=Prsþað
Case=Acc|Number=Plurþáþærþau
Case=Acc|Number=Plur|Person=3|PronType=Prsþau
Case=Dat|Number=Singþeimþeirriþví
Case=Dat|Number=Sing|Person=3|PronType=Prsþví
Case=Dat|Number=Plurþeimþeimþeim
Case=Dat|Number=Plur|Person=3|PronType=Prsþeim
Case=Gen|Number=Singþessþeirrarþess
Case=Gen|Number=Sing|Person=3|PronType=Prsþess
Case=Gen|Number=Plurþeirraþeirraþeirra
Case=Gen|Number=Plur|Person=3|PronType=Prsþeirraþeirra
Case=Nom|Number=Singsá, þess, þessiþað
Case=Nom|Number=Sing|Person=3|PronType=Prsþað
Case=Nom|Number=Plurþeirþærþau
Case=Nom|Number=Plur|Person=3|PronType=Prsþeirþau

ADJ

5405 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=EMPTY (4207; 78%), Number=Sing (3779; 70%).

ADJ tokens may have the following values of Gender:

Paradigm mikillMascFemNeut
Case=Acc|Degree=Pos|Number=Singmiklamiklamikið, mikla
Case=Acc|Degree=Pos|Number=Plurmikil
Case=Acc|Degree=Cmp|Number=Singmeirimeirimeira
Case=Acc|Degree=Cmp|Number=Plurmeirimeiri
Case=Acc|Degree=Sup|Number=Singmestanmestamest
Case=Acc|Degree=Sup|Number=Plurmestarmestu
Case=Acc|Number=Singmikinn, miklamiklamikið
Case=Acc|Number=Plurmiklarmikil, miklu
Case=Dat|Degree=Pos|Number=Singmiklummiklu
Case=Dat|Degree=Pos|Number=Plurmiklum
Case=Dat|Degree=Cmp|Number=Singmeirimeira
Case=Dat|Degree=Cmp|Number=Plurmeiri
Case=Dat|Degree=Sup|Number=Singmestummestumestu, mesta
Case=Dat|Number=Singmiklummikillimiklu, meiru
Case=Dat|Number=Plurmiklum, miklumiklummiklum
Case=Gen|Degree=Pos|Number=Singmikils
Case=Gen|Degree=Cmp|Number=Singmeira
Case=Gen|Number=Singmikils, miklamikillarmikils
Case=Gen|Number=Plurmikilla
Case=Nom|Degree=Pos|Number=Singmikillmikið
Case=Nom|Degree=Pos|Number=Plurmikil
Case=Nom|Degree=Cmp|Number=Singmeirimeirimeira
Case=Nom|Degree=Cmp|Number=Plurmeirimeiri
Case=Nom|Degree=Sup|Number=Singmest, mestamesta, mest
Case=Nom|Degree=Sup|Number=PlurmestirMestar
Case=Nom|Number=Singmikill, miklimikil, miklamikið
Case=Nom|Number=Plurmiklirmiklarmikil

PROPN

4964 PROPN tokens (81% of all PROPN tokens) have a non-empty value of Gender.

PROPN tokens may have the following values of Gender:

Paradigm TrumpMascFem
Case=AccTrumpTrump
Case=Acc|Number=SingTrump
Case=DatTrump
Case=GenTrump, Trumps
Case=Gen|Number=SingTrump
Case=NomTrump
Case=Nom|Number=SingTrump

Gender seems to be lexical feature of PROPN. 98% lemmas (2595) occur only with one value of Gender.

NUM

1336 NUM tokens (71% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Number=Plur (1149; 86%), NumType=EMPTY (752; 56%).

NUM tokens may have the following values of Gender:

Paradigm tveirMascFemNeut
Case=Acctvotværtvö
Case=Acc|NumType=Cardtveggja
Case=Dattveimurtveimurtveimur
Case=Gentveggjatveggjatveggja
Case=Nomtveirtværtvö

ADV

933 ADV tokens (9% of all ADV tokens) have a non-empty value of Gender.

ADV tokens may have the following values of Gender:

Paradigm árFemNeut
Case=Acc|Definite=Def|Number=Singárið
Case=Acc|Definite=Def|Number=Plurárin
Case=Acc|Number=Singár
Case=Acc|Number=Plurár
Case=Dat|Definite=Def|Number=Singárinu
Case=Dat|Definite=Def|Number=Plurárunum
Case=Dat|Number=Singári
Case=Dat|Number=Plurárum
Case=Gen|Number=Plurára
Case=Nom|Number=Singár

Gender seems to be lexical feature of ADV. 97% lemmas (163) occur only with one value of Gender.

VERB

91 VERB tokens (1% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (91; 100%), Person=EMPTY (91; 100%), Voice=EMPTY (91; 100%), Tense=EMPTY (90; 99%), VerbForm=Part (89; 98%), Case=Nom (87; 96%), Number=Sing (71; 78%).

VERB tokens may have the following values of Gender:

Paradigm segjaFemNeut
sögðsagt

DET

45 DET tokens (100% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Definite=Def (45; 100%), Number=Sing (36; 80%).

DET tokens may have the following values of Gender:

Paradigm hinnMascFemNeut
Case=Acc|Number=Singhinahið
Case=Acc|Number=Plurhinarhin
Case=Dat|Number=Singhinumhinnihinu
Case=Dat|Number=Plurhinumhinum
Case=Gen|Number=Singhinnar
Case=Nom|Number=Singhinnhinhið
Case=Nom|Number=PlurHinirhin

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (3260; 96%), NOUN –[nmod]–> PRON (1713; 95%), PROPN –[flat]–> PROPN (1140; 100%), NOUN –[nummod]–> NUM (690; 85%), NOUN –[flat]–> NOUN (324; 100%), ADJ –[nsubj]–> NOUN (317; 95%), PROPN –[obl]–> NOUN (219; 52%), ADV –[amod]–> ADJ (209; 80%), ADJ –[nsubj]–> PRON (184; 75%), PROPN –[conj]–> PROPN (157; 57%).