home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Italian-VIT: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

116420 tokens (42%) have a non-empty value of Gender. 12733 types (54%) occur at least once with a non-empty value of Gender. 8515 lemmas (54%) occur at least once with a non-empty value of Gender. The feature is used with 14 part-of-speech tags: NOUN (54622; 19% instances), DET (37733; 13% instances), ADJ (12473; 4% instances), VERB (7833; 3% instances), PRON (2754; 1% instances), AUX (668; 0% instances), ADV (156; 0% instances), NUM (98; 0% instances), X (59; 0% instances), ADP (8; 0% instances), SCONJ (8; 0% instances), CCONJ (6; 0% instances), INTJ (1; 0% instances), PUNCT (1; 0% instances).

NOUN

54622 NOUN tokens (95% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (34459; 63%).

NOUN tokens may have the following values of Gender:

Paradigm fineMascFem
Number=Singfinefine, fin
Number=Plurfini

Gender seems to be lexical feature of NOUN. 99% lemmas (5507) occur only with one value of Gender.

DET

37733 DET tokens (86% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (34990; 93%), Definite=Def (30875; 82%), Number=Sing (25954; 69%).

DET tokens may have the following values of Gender:

Paradigm ilMascFem
Definite=Def|Number=Sing|PronType=Artil, lo, gli, i, l'la
Definite=Def|Number=Plur|PronType=Arti, gli, ille, il, i
Number=Singlo
Number=Plurile

ADJ

12473 ADJ tokens (62% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (7816; 63%).

ADJ tokens may have the following values of Gender:

Paradigm altroMascFem
Number=Singaltroaltra
Number=Sing|PronType=Demaltro
Number=Sing|PronType=Indaltroaltra
Number=Pluraltrialtre

VERB

7833 VERB tokens (37% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (7833; 100%), Person=EMPTY (7833; 100%), Tense=Past (7772; 99%), VerbForm=Part (7772; 99%), Number=Sing (5633; 72%).

VERB tokens may have the following values of Gender:

Paradigm fareMascFem
Number=Singfatto, salvofatta
Number=Plurfattifatte

PRON

2754 PRON tokens (28% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Clitic=EMPTY (2154; 78%), Number=Sing (1987; 72%), Person=EMPTY (1926; 70%).

PRON tokens may have the following values of Gender:

Paradigm quelloMascFem
Number=Singquello, quelquella
Number=Plurquelliquelle

AUX

668 AUX tokens (7% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (668; 100%), Person=EMPTY (668; 100%), Tense=Past (667; 100%), VerbForm=Part (667; 100%), Number=Sing (504; 75%).

AUX tokens may have the following values of Gender:

Paradigm essereMascFem
Number=Sing|PronType=Rel|Tense=Past|VerbForm=Partstata
Number=Sing|Tense=Past|VerbForm=Partstatostata
Number=Pluressere
Number=Plur|Tense=Past|VerbForm=Partstatistate

ADV

156 ADV tokens (1% of all ADV tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADV and Gender co-occurred: PronType=EMPTY (154; 99%).

ADV tokens may have the following values of Gender:

Paradigm moltoMascFem
Number=Singmolto
Number=PlurMolte

Gender seems to be lexical feature of ADV. 97% lemmas (71) occur only with one value of Gender.

NUM

98 NUM tokens (2% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (98; 100%).

NUM tokens may have the following values of Gender:

Paradigm unoMascFem
un, unoun', una

X

59 X tokens (15% of all X tokens) have a non-empty value of Gender.

The most frequent other feature values with which X and Gender co-occurred: Foreign=Yes (59; 100%).

X tokens may have the following values of Gender:

Gender seems to be lexical feature of X. 100% lemmas (27) occur only with one value of Gender.

ADP

8 ADP tokens (0% of all ADP tokens) have a non-empty value of Gender.

ADP tokens may have the following values of Gender:

SCONJ

8 SCONJ tokens (0% of all SCONJ tokens) have a non-empty value of Gender.

SCONJ tokens may have the following values of Gender:

CCONJ

6 CCONJ tokens (0% of all CCONJ tokens) have a non-empty value of Gender.

CCONJ tokens may have the following values of Gender:

INTJ

1 INTJ tokens (1% of all INTJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which INTJ and Gender co-occurred: Polarity=EMPTY (1; 100%).

INTJ tokens may have the following values of Gender:

PUNCT

1 PUNCT tokens (0% of all PUNCT tokens) have a non-empty value of Gender.

PUNCT tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (30335; 83%), NOUN –[amod]–> ADJ (9788; 61%), NOUN –[conj]–> NOUN (2595; 58%), NOUN –[advcl]–> VERB (1307; 76%), VERB –[nsubj:pass]–> NOUN (1075; 93%), NOUN –[det:poss]–> DET (961; 77%), VERB –[conj]–> VERB (339; 51%), NOUN –[det:predet]–> DET (331; 95%), NOUN –[nsubj]–> NOUN (162; 53%), ADJ –[amod]–> ADJ (137; 58%).