home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Welsh-CCG: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

14275 tokens (27%) have a non-empty value of Gender. 4771 types (66%) occur at least once with a non-empty value of Gender. 3149 lemmas (66%) occur at least once with a non-empty value of Gender. The feature is used with 5 part-of-speech tags: NOUN (11234; 21% instances), PROPN (1375; 3% instances), PRON (1198; 2% instances), ADJ (377; 1% instances), NUM (91; 0% instances).

NOUN

11234 NOUN tokens (70% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: VerbForm=EMPTY (11226; 100%), Number=Sing (8883; 79%), Mutation=EMPTY (8263; 74%).

NOUN tokens may have the following values of Gender:

Paradigm iaithMascFem
Mutation=AM|Number=Singhiaithhiaith
Mutation=AM|Number=Plurhieithoedd
Number=SingIaithiaith
Number=Plurieithoedd

PROPN

1375 PROPN tokens (68% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (1348; 98%), Mutation=EMPTY (1155; 84%).

PROPN tokens may have the following values of Gender:

Paradigm BangorMascFem
Mutation=NMMangor
Mutation=SMFangorFangor
Bangor

Gender seems to be lexical feature of PROPN. 93% lemmas (606) occur only with one value of Gender.

PRON

1198 PRON tokens (34% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (1092; 91%), PronType=Prs (1071; 89%), Person=3 (1063; 89%), Poss=EMPTY (864; 72%).

PRON tokens may have the following values of Gender:

Paradigm hwyMascFem
Number=Sing'w'w
Number=Plur|Poss=Yeseu

ADJ

377 ADJ tokens (10% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (364; 97%), Number=Sing (347; 92%), Mutation=EMPTY (257; 68%).

ADJ tokens may have the following values of Gender:

Paradigm daMascFem
Degree=Pos|Mutation=SMwell
Degree=Posgwell
Degree=Cmpgwell
Degree=Equcystal

Gender seems to be lexical feature of ADJ. 93% lemmas (177) occur only with one value of Gender.

NUM

91 NUM tokens (13% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (91; 100%), NumForm=Word (89; 98%), Mutation=EMPTY (49; 54%).

NUM tokens may have the following values of Gender:

Paradigm dauMascFem
Mutation=SMddauddwy
daudwy, dyw

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[nmod]–> NOUN (1978; 57%), NOUN –[conj]–> NOUN (449; 58%), NOUN –[det]–> PRON (83; 53%), NOUN –[appos]–> NOUN (67; 62%), PROPN –[conj]–> PROPN (48; 51%), PRON –[compound:redup]–> PRON (35; 100%), PROPN –[appos]–> NOUN (17; 52%), PROPN –[nmod]–> NOUN (16; 57%), NOUN –[fixed]–> NOUN (15; 100%), NOUN –[acl:relcl]–> NOUN (14; 54%).