Treebank Statistics: UD_Italian-PUD: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
12701 tokens (54%) have a non-empty value of Gender
.
4689 types (77%) occur at least once with a non-empty value of Gender
.
3977 lemmas (85%) occur at least once with a non-empty value of Gender
.
The feature is used with 11 part-of-speech tags: NOUN (4384; 18% instances), DET (3783; 16% instances), PROPN (1739; 7% instances), ADJ (1599; 7% instances), PRON (609; 3% instances), VERB (449; 2% instances), AUX (60; 0% instances), ADP (54; 0% instances), NUM (20; 0% instances), ADV (2; 0% instances), CCONJ (2; 0% instances).
NOUN
4384 NOUN tokens (100% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (3086; 70%).
NOUN
tokens may have the following values of Gender
:
Fem
(1897; 43% of non-emptyGender
): parte, città, persone, volta, guerra, vita, popolazione, regione, storia, crescitaMasc
(2487; 57% of non-emptyGender
): anni, anno, governo, secolo, stato, tempo, giorno, mondo, numero, periodoEMPTY
(8): fronte, Più, Qualunque, chiunque, milione, post, verificar
Paradigm fine | Masc | Fem |
---|---|---|
Number=Sing | fine | fine |
Number=Plur | fini |
Gender
seems to be lexical feature of NOUN
. 98% lemmas (1785) occur only with one value of Gender
.
DET
3783 DET tokens (100% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number=Sing (2838; 75%), Definite=EMPTY (2308; 61%), PronType=EMPTY (2306; 61%).
DET
tokens may have the following values of Gender
:
Fem
(1600; 42% of non-emptyGender
): la, le, l’, una, un’, questa, altre, molte, queste, diverseMasc
(2183; 58% of non-emptyGender
): il, i, un, l’, gli, lo, questo, ciò, molti, altriEMPTY
(2): Ciò, l’
Paradigm il | Masc | Fem |
---|---|---|
Definite=Def|Number=Sing|PronType=Art | il, l', lo | la, l' |
Definite=Def|Number=Plur|PronType=Art | i, gli | le |
Number=Sing | il, l', lo | la, l' |
Number=Plur | i, gli | le, la |
PROPN
1739 PROPN tokens (99% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Number=Sing (1598; 92%).
PROPN
tokens may have the following values of Gender
:
Fem
(777; 45% of non-emptyGender
): Cina, Francia, guerra, Europa, Gran, Hong, Italia, Kong, Russia, AlbaniaMasc
(962; 55% of non-emptyGender
): Mar, Stati, Uniti, Trump, Mediterraneo, Nord, Regno, Caraibi, Donald, JosephEMPTY
(17): Doss, Target, ETA, Income, Mailis, Michael, Multi, Reach, Return, St.
Paradigm Trump | Masc | Fem |
---|---|---|
Trump | Trump |
Gender
seems to be lexical feature of PROPN
. 97% lemmas (1161) occur only with one value of Gender
.
ADJ
1599 ADJ tokens (99% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=Sing (1112; 70%).
ADJ
tokens may have the following values of Gender
:
Fem
(689; 43% of non-emptyGender
): prima, maggior, gran, grande, alta, maggiore, meridionale, americana, nuova, secondaMasc
(910; 57% of non-emptyGender
): ultimi, nuovi, nuovo, stesso, primo, tutti, tutto, grande, scorso, ultimoEMPTY
(16): ex, evidente, probabile, nord, poca, possibile, post, presente, significativamente, soddisfatta
Paradigm nuovo | Masc | Fem |
---|---|---|
Number=Sing | nuovo | nuova |
Number=Plur | nuovi | nuove |
PRON
609 PRON tokens (71% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Case=EMPTY (525; 86%), Number=Sing (439; 72%), Number[psor]=EMPTY (384; 63%), PronType=EMPTY (368; 60%).
PRON
tokens may have the following values of Gender
:
Fem
(249; 41% of non-emptyGender
): che, sua, loro, sue, cui, quella, propria, le, la, leiMasc
(360; 59% of non-emptyGender
): che, suo, lo, loro, questo, cui, suoi, gli, lui, qualiEMPTY
(252): si, ci, c’, mi, ne, chi, me, noi, se, vi
Paradigm che | Masc | Fem |
---|---|---|
Number=Sing | che | che |
Number=Plur | che | che |
VERB
449 VERB tokens (22% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Aspect=EMPTY (449; 100%), Mood=EMPTY (449; 100%), Person=EMPTY (449; 100%), Tense=Past (424; 94%), Number=Sing (287; 64%), Voice=EMPTY (282; 63%).
VERB
tokens may have the following values of Gender
:
Fem
(179; 40% of non-emptyGender
): considerate, diventata, usata, basata, cresciute, lasciata, pubblicate, riguardanti, utilizzata, considerataMasc
(270; 60% of non-emptyGender
): fatto, utilizzato, inclusi, stato, accusato, detto, diretto, fondato, pubblicato, seguitiEMPTY
(1605): affermato, avere, ha, afferma, aveva, iniziò, sono, detto, far, hanno
Paradigm fare | Masc | Fem |
---|---|---|
Number=Sing | fatto | |
Number=Sing|Tense=Past | fatto | fatta |
Number=Sing|Tense=Past|Voice=Pass | fatto | |
Number=Plur|Tense=Past | fatte |
AUX
60 AUX tokens (6% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Aspect=EMPTY (60; 100%), Mood=EMPTY (60; 100%), Person=EMPTY (60; 100%), Tense=Past (60; 100%), Voice=EMPTY (60; 100%), Number=Sing (45; 75%).
AUX
tokens may have the following values of Gender
:
Fem
(21; 35% of non-emptyGender
): stata, stateMasc
(39; 65% of non-emptyGender
): stato, statiEMPTY
(925): è, ha, sono, era, fu, essere, hanno, venne, può, erano
Paradigm essere | Masc | Fem |
---|---|---|
Number=Sing | stato | stata |
Number=Plur | stati | state |
ADP
54 ADP tokens (1% of all ADP
tokens) have a non-empty value of Gender
.
ADP
tokens may have the following values of Gender
:
Fem
(14; 26% of non-emptyGender
): alla, della, alle, dalla, dell’, nellaMasc
(40; 74% of non-emptyGender
): al, del, dell’, all’, Da, Sulla, ai, allo, dagli, daiEMPTY
(3754): di, in, a, per, da, con, su, come, tra, dopo
Paradigm di | Masc | Fem |
---|---|---|
del, dell', dello | della, dell' |
NUM
20 NUM tokens (5% of all NUM
tokens) have a non-empty value of Gender
.
NUM
tokens may have the following values of Gender
:
Fem
(2; 10% of non-emptyGender
): unaMasc
(18; 90% of non-emptyGender
): uno, unEMPTY
(424): due, tre, 10, milioni, 1, quattro, 3, sei, 20, 2014
Paradigm uno | Masc | Fem |
---|---|---|
uno, un | una |
ADV
2 ADV tokens (0% of all ADV
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADV
and Gender
co-occurred: Polarity=EMPTY (2; 100%).
ADV
tokens may have the following values of Gender
:
Masc
(2; 100% of non-emptyGender
): piùEMPTY
(847): non, più, anche, Tuttavia, solo, ancora, inoltre, dove, già, prima
CCONJ
2 CCONJ tokens (0% of all CCONJ
tokens) have a non-empty value of Gender
.
CCONJ
tokens may have the following values of Gender
:
Fem
(1; 50% of non-emptyGender
): cheMasc
(1; 50% of non-emptyGender
): cheEMPTY
(580): e, ma, o, ed, sia, che, mentre, quindi, né, oppure
Paradigm che | Masc | Fem |
---|---|---|
che | che |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (3023; 100%),
NOUN –[amod]–> ADJ (1219; 99%),
PROPN –[det]–> DET (516; 100%),
PROPN –[flat]–> PROPN (401; 99%),
NOUN –[nmod]–> PROPN (276; 61%),
NOUN –[det:poss]–> PRON (225; 99%),
NOUN –[conj]–> NOUN (141; 59%),
PROPN –[amod]–> ADJ (130; 99%),
NOUN –[acl]–> VERB (124; 66%),
VERB –[nsubj:pass]–> NOUN (107; 94%).