home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-Porttinari: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

64570 tokens (38%) have a non-empty value of Gender. 9713 types (51%) occur at least once with a non-empty value of Gender. 6649 lemmas (52%) occur at least once with a non-empty value of Gender. The feature is used with 7 part-of-speech tags: NOUN (29447; 18% instances), DET (24018; 14% instances), ADJ (5441; 3% instances), PRON (2766; 2% instances), VERB (2267; 1% instances), NUM (569; 0% instances), AUX (62; 0% instances).

NOUN

29447 NOUN tokens (94% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (21248; 72%).

NOUN tokens may have the following values of Gender:

Paradigm filhoMascFem
Number=Singfilhofilha
Number=Plurfilhosfilhas

Gender seems to be lexical feature of NOUN. 97% lemmas (4562) occur only with one value of Gender.

DET

24018 DET tokens (99% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (21173; 88%), Number=Sing (19605; 82%), Definite=Def (18892; 79%).

DET tokens may have the following values of Gender:

Paradigm oMascFem
Number=Singoa
Number=Plurosas

ADJ

5441 ADJ tokens (64% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: VerbForm=EMPTY (4518; 83%), Number=Sing (3812; 70%).

ADJ tokens may have the following values of Gender:

Paradigm novoMascFem
Number=Singnovonova
Number=Plurnovosnovas

PRON

2766 PRON tokens (43% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (2181; 79%), Person=3 (1821; 66%), Case=EMPTY (1649; 60%).

PRON tokens may have the following values of Gender:

Paradigm eleMascFem
Number=Singeleela
Number=Plureleselas

VERB

2267 VERB tokens (13% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (2267; 100%), Person=EMPTY (2267; 100%), Tense=EMPTY (2267; 100%), VerbForm=Part (2267; 100%), Voice=Pass (1728; 76%), Number=Sing (1606; 71%).

VERB tokens may have the following values of Gender:

Paradigm terMascFem
Number=Singtido
Number=Plur|Voice=Passtidas

NUM

569 NUM tokens (18% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (559; 98%).

NUM tokens may have the following values of Gender:

Paradigm umMascFem
umuma

AUX

62 AUX tokens (1% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (62; 100%), Number=Sing (62; 100%), Person=EMPTY (62; 100%), Tense=EMPTY (62; 100%), VerbForm=Part (62; 100%).

AUX tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (18905; 93%), NOUN –[amod]–> ADJ (3748; 62%), NOUN –[conj]–> NOUN (774; 50%), VERB –[nsubj:pass]–> NOUN (471; 92%), ADJ –[nsubj]–> NOUN (225; 54%), PRON –[amod]–> ADJ (142; 55%), NUM –[nmod]–> NOUN (122; 56%), PRON –[nmod]–> NOUN (99; 57%), PRON –[det]–> DET (75; 63%), PRON –[nsubj]–> NOUN (49; 67%).