home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Norwegian-Nynorsk: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut. Some words have combined values of the feature; 1 combinations have been observed: Fem|Masc.

95232 tokens (32%) have a non-empty value of Gender. 21495 types (69%) occur at least once with a non-empty value of Gender. 14947 lemmas (66%) occur at least once with a non-empty value of Gender. The feature is used with 7 part-of-speech tags: NOUN (55466; 18% instances), ADJ (15206; 5% instances), DET (11269; 4% instances), PRON (10383; 3% instances), PROPN (2829; 1% instances), NUM (75; 0% instances), VERB (4; 0% instances).

NOUN

55466 NOUN tokens (98% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=EMPTY (40908; 74%), Definite=Ind (33351; 60%).

NOUN tokens may have the following values of Gender:

Paradigm lovMascFemNeut
_lova, lovi
Definite=Indlovlovlov
Definite=Ind|Number=Plurlover
Number=Plurlovene

Gender seems to be lexical feature of NOUN. 99% lemmas (11852) occur only with one value of Gender.

ADJ

15206 ADJ tokens (52% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Definite=Ind (15205; 100%), Number=EMPTY (15203; 100%), Degree=Pos (12486; 82%), VerbForm=EMPTY (12484; 82%).

ADJ tokens may have the following values of Gender:

Paradigm openFem,MascMascFemNeut
openopenopaope

DET

11269 DET tokens (75% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=EMPTY (11269; 100%), PronType=Art (5930; 53%).

DET tokens may have the following values of Gender:

Paradigm allFem,MascMascFemNeut
alleallallalt

PRON

10383 PRON tokens (54% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Animacy=EMPTY (10381; 100%), Number=EMPTY (10379; 100%), PronType=Prs (10351; 100%), Person=3 (9223; 89%), Case=EMPTY (8096; 78%).

PRON tokens may have the following values of Gender:

Paradigm sinMascFemNeut
sinsisitt

PROPN

2829 PROPN tokens (16% of all PROPN tokens) have a non-empty value of Gender.

PROPN tokens may have the following values of Gender:

Gender seems to be lexical feature of PROPN. 100% lemmas (395) occur only with one value of Gender.

NUM

75 NUM tokens (2% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (75; 100%), Number=EMPTY (75; 100%).

NUM tokens may have the following values of Gender:

Paradigm nokoMascFemNeut
nokonokoNoko

VERB

4 VERB tokens (0% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=Ind (4; 100%), Tense=Pres (4; 100%), VerbForm=Fin,Part (4; 100%).

VERB tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (9642; 76%), NOUN –[nmod:poss]–> PRON (1094; 74%), ADJ –[expl]–> PRON (825; 90%), ADJ –[conj]–> ADJ (526; 74%), DET –[nmod]–> NOUN (144; 69%), PRON –[acl:relcl]–> ADJ (71; 76%), PRON –[det]–> DET (28; 67%), ADJ –[xcomp]–> ADJ (26; 52%), ADJ –[amod]–> ADJ (24; 60%), DET –[conj]–> DET (23; 85%).