Treebank Statistics: UD_Italian-Valico: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
2860 tokens (42%) have a non-empty value of Gender
.
732 types (55%) occur at least once with a non-empty value of Gender
.
587 lemmas (61%) occur at least once with a non-empty value of Gender
.
The feature is used with 8 part-of-speech tags: NOUN (1012; 15% instances), DET (1008; 15% instances), VERB (384; 6% instances), ADJ (240; 4% instances), PRON (194; 3% instances), AUX (20; 0% instances), ADV (1; 0% instances), NUM (1; 0% instances).
NOUN
1012 NOUN tokens (98% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (878; 87%).
NOUN
tokens may have the following values of Gender
:
Fem
(399; 39% of non-emptyGender
): donna, ragazza, terra, spalle, borsa, città, spalla, cosa, situazione, casaMasc
(613; 61% of non-emptyGender
): uomo, parco, ragazzo, giornale, amore, momento, banco, giorno, marito, fidanzatoEMPTY
(18): amante, delinquente, sol, OCCHIALI, Parco, X, affari, diem, discaount, giornale
Paradigm amico | Masc | Fem |
---|---|---|
Number=Sing | amico | amica |
Number=Plur | amici |
Gender
seems to be lexical feature of NOUN
. 98% lemmas (286) occur only with one value of Gender
.
DET
1008 DET tokens (99% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number=Sing (919; 91%), Poss=EMPTY (905; 90%), PronType=Art (830; 82%), Definite=Def (605; 60%).
DET
tokens may have the following values of Gender
:
Fem
(387; 38% of non-emptyGender
): la, una, le, sua, un’, l’, mia, questa, altra, delleMasc
(621; 62% of non-emptyGender
): il, un, l’, suo, i, mio, questo, altro, lo, gliEMPTY
(15): che, ogni, qualche, alcun, loro, tutti
Paradigm suo | Masc | Fem |
---|---|---|
Number=Sing|Poss=Yes|PronType=Prs | suo | sua |
Number=Plur | sui | |
Number=Plur|Poss=Yes|PronType=Prs | suoi | sue, suoe |
VERB
384 VERB tokens (40% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (384; 100%), Person=EMPTY (384; 100%), Tense=Past (384; 100%), VerbForm=Part (384; 100%), Number=Sing (380; 99%).
VERB
tokens may have the following values of Gender
:
Fem
(31; 8% of non-emptyGender
): andata, salvata, arrivata, alzata, arrabbiata, arrabiata, caduta, chiamata, chiusa, comprataMasc
(353; 92% of non-emptyGender
): detto, visto, fatto, pensato, sentito, seduto, cominciato, andato, gridato, salvatoEMPTY
(583): era, portava, aveva, leggendo, sembrava, leggeva, fare, andare, gridava, leggere
Paradigm vedere | Masc | Fem |
---|---|---|
visto, veduto | vista |
ADJ
240 ADJ tokens (77% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=Sing (218; 91%).
ADJ
tokens may have the following values of Gender
:
Fem
(89; 37% of non-emptyGender
): bella, contenta, bionda, arrabiata, strana, arraviata, brutta, cattiva, destra, furiosaMasc
(151; 63% of non-emptyGender
): brutto, simpatico, bel, bello, carino, muscoloso, timido, improviso, nuovo, robustoEMPTY
(70): grande, forte, felice, giovane, gentile, normale, incredibile, interessante, triste, SPLENDENTE
Paradigm brutto | Masc | Fem |
---|---|---|
brutto | brutta |
PRON
194 PRON tokens (38% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Number=Sing (172; 89%), PronType=Prs (149; 77%), Person=3 (144; 74%), Clitic=Yes (99; 51%).
PRON
tokens may have the following values of Gender
:
Fem
(80; 41% of non-emptyGender
): la, lei, le, l’, questa, quella, altre, molte, queste, unaMasc
(114; 59% of non-emptyGender
): lui, lo, l’, gli, quello, li, questo, nessuno, tutti, entrambiEMPTY
(315): che, mi, si, me, niente, c’, io, qualcosa, se, ne
Paradigm quello | Masc | Fem |
---|---|---|
quello | quella |
AUX
20 AUX tokens (3% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Mood=EMPTY (20; 100%), Person=EMPTY (20; 100%), Tense=Past (20; 100%), VerbForm=Part (20; 100%), Number=Sing (19; 95%).
AUX
tokens may have the following values of Gender
:
Fem
(3; 15% of non-emptyGender
): stataMasc
(17; 85% of non-emptyGender
): stato, dovuto, potuto, stati, viuto, volutoEMPTY
(572): ha, ho, era, è, sono, aveva, stava, ero, avevo, essere
Paradigm essere | Masc | Fem |
---|---|---|
Number=Sing | stato | stata |
Number=Plur | stati |
ADV
1 ADV tokens (0% of all ADV
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADV
and Gender
co-occurred: PronType=EMPTY (1; 100%).
ADV
tokens may have the following values of Gender
:
Masc
(1; 100% of non-emptyGender
): perEMPTY
(389): non, molto, Ieri, poi, come, più, anche, così, invece, subito
NUM
1 NUM tokens (6% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumType=EMPTY (1; 100%).
NUM
tokens may have the following values of Gender
:
Fem
(1; 100% of non-emptyGender
): diecisetteEMPTY
(17): due, 1, 10, 28, 30, 700, 9, 95, 99, catordieci
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (847; 96%),
NOUN –[amod]–> ADJ (129; 76%),
NOUN –[det:poss]–> DET (95; 93%),
VERB –[conj]–> VERB (60; 51%),
ADJ –[nsubj]–> NOUN (20; 69%),
ADJ –[conj]–> ADJ (15; 60%),
NOUN –[nsubj]–> NOUN (8; 73%),
ADJ –[det]–> DET (7; 100%),
VERB –[ccomp]–> NOUN (6; 67%),
VERB –[obl]–> ADJ (6; 67%).