home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Spanish-GSD: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

158460 tokens (37%) have a non-empty value of Gender. 20671 types (45%) occur at least once with a non-empty value of Gender. 14570 lemmas (42%) occur at least once with a non-empty value of Gender. The feature is used with 10 part-of-speech tags: NOUN (70453; 16% instances), DET (56083; 13% instances), ADJ (15416; 4% instances), VERB (7450; 2% instances), PRON (4453; 1% instances), PROPN (3418; 1% instances), X (518; 0% instances), AUX (338; 0% instances), NUM (209; 0% instances), SYM (122; 0% instances).

NOUN

70453 NOUN tokens (91% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (50632; 72%).

NOUN tokens may have the following values of Gender:

Paradigm parteMascFem
Number=Singparteparte
Number=Plurpartes

Gender seems to be lexical feature of NOUN. 97% lemmas (8767) occur only with one value of Gender.

DET

56083 DET tokens (92% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (51186; 91%), Number=Sing (44758; 80%), Definite=Def (43530; 78%).

DET tokens may have the following values of Gender:

Paradigm elMascFem
Definite=Def|Number=Singella, l'
Definite=Def|Number=Sing|Typo=Yesal, ena, al
Definite=Def|Number=Plurloslas
Number=Sing|Typo=Yesal, ena

ADJ

15416 ADJ tokens (62% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (10981; 71%).

ADJ tokens may have the following values of Gender:

Paradigm primeroMascFem
Number=Singprimer, primeroprimera
Number=Sing|NumType=Ordprimer, primeroprimera
Number=Plurprimerosprimeras
Number=Plur|NumType=Ordprimerosprimeras

VERB

7450 VERB tokens (20% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (7450; 100%), Person=EMPTY (7447; 100%), VerbForm=Part (6783; 91%), Number=Sing (5988; 80%), Tense=EMPTY (4374; 59%).

VERB tokens may have the following values of Gender:

Paradigm tenerMascFem
Number=Sing|Tense=Past|VerbForm=Parttenido
Number=Sing|VerbForm=Fintengo, tuvo
Number=Plur|Tense=Past|VerbForm=Parttenidostenidas
Number=Plur|VerbForm=Fintienes
VerbForm=Fintenéis

PRON

4453 PRON tokens (32% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (4448; 100%), Number=Sing (3347; 75%), PronType=Prs (2883; 65%), Person=3 (2820; 63%), PrepCase=EMPTY (2269; 51%).

PRON tokens may have the following values of Gender:

Paradigm élMascFem
Case=Acc,Nom|Number=Singél, elloella
Case=Acc,Nom|Number=Plurellosellas
Case=Acc|Number=Sing|PrepCase=Nprlola
Case=Acc|Number=Plur|PrepCase=Nprloslas
Case=Dat|Number=Sing|PrepCase=Npr|Typo=Yesla
Case=Nom|Number=Singél

PROPN

3418 PROPN tokens (9% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (2936; 86%).

PROPN tokens may have the following values of Gender:

Paradigm theMascFem
thethe

Gender seems to be lexical feature of PROPN. 99% lemmas (2029) occur only with one value of Gender.

X

518 X tokens (28% of all X tokens) have a non-empty value of Gender.

The most frequent other feature values with which X and Gender co-occurred: Number=Sing (413; 80%).

X tokens may have the following values of Gender:

Paradigm 'sMascFem
_'s's
Number=Sing's's
Number=Sing|Person=3's

Gender seems to be lexical feature of X. 96% lemmas (369) occur only with one value of Gender.

AUX

338 AUX tokens (3% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (338; 100%), Person=EMPTY (337; 100%), Number=Sing (332; 98%), VerbForm=Part (269; 80%), Tense=Past (268; 79%).

AUX tokens may have the following values of Gender:

Paradigm haberMascFem
Number=Singhaber, han
Number=Plur|Person=3has
habéis

NUM

209 NUM tokens (2% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (209; 100%), Number=Sing (177; 85%), NumForm=Word (169; 81%).

NUM tokens may have the following values of Gender:

Paradigm unoMascFem
un, unouna

SYM

122 SYM tokens (7% of all SYM tokens) have a non-empty value of Gender.

SYM tokens may have the following values of Gender:

Paradigm $MascFem
Number=Sing$$
Number=Sing|VerbForm=Part$
Number=Plur|VerbForm=Part$

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (42537; 84%), NOUN –[amod]–> ADJ (11103; 58%), NOUN –[conj]–> NOUN (2928; 53%), NOUN –[acl]–> VERB (1938; 81%), VERB –[nsubj:pass]–> NOUN (697; 86%), PRON –[nmod]–> NOUN (500; 68%), ADJ –[nsubj]–> NOUN (466; 56%), ADJ –[conj]–> ADJ (448; 54%), NOUN –[nsubj]–> NOUN (422; 51%), NOUN –[det]–> PRON (186; 70%).