home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Low_Saxon-LSDC: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut. Some words have combined values of the feature; 4 combinations have been observed: Fem|Masc, Fem|Masc|Neut, Fem|Neut, Masc|Neut.

This is a layered feature with the following layers: Gender, Gender[psor].

6951 tokens (31%) have a non-empty value of Gender. 2222 types (46%) occur at least once with a non-empty value of Gender. 1627 lemmas (50%) occur at least once with a non-empty value of Gender. The feature is used with 6 part-of-speech tags: NOUN (2816; 12% instances), DET (2152; 10% instances), PRON (1314; 6% instances), ADJ (630; 3% instances), PROPN (29; 0% instances), NUM (10; 0% instances).

NOUN

2816 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (2284; 81%).

NOUN tokens may have the following values of Gender:

Paradigm tydFem,MascMascFem
Case=Acc,Dat|Number=Singtyd, tydentydtyd
Case=Acc,Dat|Number=Plurtyden, tyd
Case=Acc|Number=Singtydtyd
Case=Acc|Number=Plurtyden
Case=Dat|Number=Singtyd
Case=Dat|Number=Plurtyden
Case=Nom|Number=Singtydtyd

Gender seems to be lexical feature of NOUN. 95% lemmas (1264) occur only with one value of Gender.

DET

2152 DET tokens (97% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Poss=EMPTY (1860; 86%), Number=Sing (1839; 85%), PronType=Art (1680; 78%), Definite=Def (1343; 62%).

DET tokens may have the following values of Gender:

Paradigm enFem,MascFem,NeutMascMasc,NeutFemNeut
Case=Acc,Dat|Definite=Def|Number=Sing|PronType=Artden, enen
Case=Acc,Dat|Definite=Def|Number=Plur|PronType=Artden
Case=Acc,Dat|Definite=Ind|Number=Sing|PronType=Artenen, nenen, eyne, neen
Case=Acc,Dat|Definite=Ind|Number=Plur|PronType=Dem'ne
Case=Acc|Definite=Def|Number=Sing|PronType=Artden, eynenenen
Case=Acc|Definite=Def|Number=Sing|PronType=Prseynen
Case=Acc|Definite=Ind|Number=Singen
Case=Acc|Definite=Ind|Number=Sing|PronType=Arten, neenen, eynen, ne, nen, e, eyneen, ne, eyne, een, Eyn, e
Case=Acc|Definite=Ind|Number=Plur|PronType=Arten, ne
Case=Acc|Number=Sing|PronType=Artenne
Case=Acc|Number=Sing|PronType=Prsen
Case=Dat|Definite=Def|Number=Sing|PronType=Artenen
Case=Dat|Definite=Ind|Number=Sing|PronType=Arteynemeynereynem
Case=Gen|Definite=Def|Number=Sing|PronType=Artenen
Case=Nom|Definite=Def|Number=Sing|PronType=Artneneen
Case=Nom|Definite=Ind|Number=Sing|PronType=Arten, ne, eyn, nenenen, ne, eyneen, eyn, e
Case=Nom|Definite=Ind|Number=Sing|PronType=Prsne
Case=Nom|Number=Sing|PronType=Arten
Definite=Ind|Number=Singen

PRON

1314 PRON tokens (51% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (1298; 99%), Case=Nom (872; 66%), Person=3 (781; 59%), PronType=Prs (776; 59%).

PRON tokens may have the following values of Gender:

Paradigm deeFem,MascMascFemNeut
Case=Acc,Dat|Number=Sing|PronType=Demden
Case=Acc,Dat|Number=Sing|PronType=Reldendee
Case=Acc|Number=Sing|PronType=DemDeeden, deanDeedee
Case=Acc|Number=Sing|PronType=Reldendee
Case=Acc|Number=Plur|PronType=Reldee
Case=Dat|Number=Sing|PronType=Demdem, deane
Case=Dat|Number=Sing|PronType=Reldem
Case=Nom|Number=Sing|Person=3|PronType=Demdee
Case=Nom|Number=Sing|Person=3|PronType=Reldeedee
Case=Nom|Number=Sing|PronType=Demdeedeedeedee
Case=Nom|Number=Sing|PronType=Prsdee
Case=Nom|Number=Sing|PronType=Reldeedee, dendee
Case=Nom|Number=Plur|PronType=Demdee
Case=Nom|Number=Plur|PronType=Reldeedee

ADJ

630 ADJ tokens (48% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (545; 87%), Number=Sing (518; 82%).

ADJ tokens may have the following values of Gender:

Paradigm oldFem,MascMascMasc,NeutFemNeut
Case=Acc,Dat|Degree=Pos|Number=Singoldenolde
Case=Acc,Dat|Degree=Pos|Number=Plurolden
Case=Acc|Degree=Pos|Number=Singolde, oldenoldeold
Case=Acc|Number=Singold
Case=Acc|Number=Plurolden
Case=Dat|Degree=Pos|Number=Pluroldenolden
Case=Dat|Number=Singolden
Case=Gen|Degree=Pos|Number=Singöldesten
Case=Nom|Degree=Pos|Number=Singoldeolde, old, ol, olden, olderolde, oldold, olde
Case=Nom|Degree=Pos|Number=Pluroldeoldeolde, olden
Case=Nom|Number=Singoldeolde

PROPN

29 PROPN tokens (6% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (29; 100%).

PROPN tokens may have the following values of Gender:

Gender seems to be lexical feature of PROPN. 100% lemmas (26) occur only with one value of Gender.

NUM

10 NUM tokens (10% of all NUM tokens) have a non-empty value of Gender.

NUM tokens may have the following values of Gender:

Paradigm eynMascFemNeut
Case=Acc,Dat|Number=Singeyneneyn
Case=Acc,Dat|NumType=Cardeyn
Case=Acc|NumType=Cardeynen
Case=Dat|Number=Singeyneeynen
Case=Nom|NumType=Cardeyn

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (1978; 97%), NOUN –[amod]–> ADJ (532; 94%), ADJ –[det]–> DET (54; 96%), PRON –[det]–> DET (18; 86%), NOUN –[det:poss]–> DET (14; 64%), ADJ –[conj]–> ADJ (9; 90%), NOUN –[flat]–> NOUN (8; 89%), NOUN –[orphan]–> NOUN (6; 60%), NOUN –[det]–> PRON (4; 67%), PRON –[conj]–> PRON (4; 67%).