Treebank Statistics: UD_Italian-Old: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
40557 tokens (33%) have a non-empty value of Gender
.
6290 types (51%) occur at least once with a non-empty value of Gender
.
3820 lemmas (60%) occur at least once with a non-empty value of Gender
.
The feature is used with 8 part-of-speech tags: NOUN (15906; 13% instances), DET (14197; 12% instances), ADJ (4863; 4% instances), PRON (3180; 3% instances), VERB (2328; 2% instances), AUX (49; 0% instances), ADV (33; 0% instances), PROPN (1; 0% instances).
NOUN
15906 NOUN tokens (100% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (12680; 80%).
NOUN
tokens may have the following values of Gender
:
Fem
(7357; 46% of non-emptyGender
): terra, gente, parte, mente, donna, vita, parole, luce, anima, vistaMasc
(8549; 54% of non-emptyGender
): occhi, mondo, maestro, ciel, viso, loco, duca, amor, lume, tempoEMPTY
(50): i, quando, sì, O, P, diece, due, no, più, sei
Paradigm braccio | Masc | Fem |
---|---|---|
Number=Sing | braccio | |
Number=Plur | braccia |
Gender
seems to be lexical feature of NOUN
. 98% lemmas (2520) occur only with one value of Gender
.
DET
14197 DET tokens (95% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Poss=EMPTY (12228; 86%), Number=Sing (11100; 78%), PronType=Art (9602; 68%), Definite=Def (9209; 65%).
DET
tokens may have the following values of Gender
:
Fem
(5803; 41% of non-emptyGender
): la, l’, le, sua, mia, una, quella, questa, tua, altraMasc
(8394; 59% of non-emptyGender
): il, ‘l, l’, li, lo, un, i, mio, suo, quelEMPTY
(772): lor, tal, qual, ogne, più, altro, altra, altrui, alcun, quale
Paradigm il | Masc | Fem |
---|---|---|
Definite=Def|Number=Sing|PronType=Art | il, 'l, l', lo, l, sul, i | la, l' |
Definite=Def|Number=Plur|PronType=Art | li, i, il, ', l', l, ne | le, l', il |
Number=Plur|PronType=Dem | le |
ADJ
4863 ADJ tokens (94% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=Sing (3765; 77%).
ADJ
tokens may have the following values of Gender
:
Fem
(2205; 45% of non-emptyGender
): prima, bella, alta, santa, sola, divina, buona, umana, viva, lungaMasc
(2658; 55% of non-emptyGender
): alto, primo, buon, dolce, etterno, gran, santo, secondo, novo, vivoEMPTY
(313): gran, dolce, grande, più, maggior, dolente, forte, gravi, men, cortese
Paradigm grande | Masc | Fem |
---|---|---|
Degree=Cmp|Number=Sing | grande | grande |
Number=Sing | gran, grande | gran, grande, Grand' |
Number=Plur | grandi, gran | gran, grandi |
PRON
3180 PRON tokens (22% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Reflex=EMPTY (3177; 100%), Poss=EMPTY (3143; 99%), Number=Sing (2693; 85%), Clitic=EMPTY (2450; 77%), PronType=Prs (1642; 52%), Person=3 (1619; 51%).
PRON
tokens may have the following values of Gender
:
Fem
(794; 25% of non-emptyGender
): la, lei, quella, ella, le, una, l’, essa, questa, altraMasc
(2386; 75% of non-emptyGender
): lui, quel, li, elli, lo, colui, altro, un, el, queiEMPTY
(11009): che, si, io, mi, ch’, tu, s’, ti, me, m’
Paradigm quello | Masc | Fem |
---|---|---|
Number=Sing|Person=1 | quel, quei, quelli, quello, Quell' | quella |
Number=Sing | quel, quei, quello, quelli, quell' | quella, Quell' |
Number=Plur|Person=1 | quei, quelli, que' | quelle |
Number=Plur | quei, quelli | quelle |
VERB
2328 VERB tokens (14% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (2328; 100%), VerbForm=Part (2321; 100%), Aspect=Perf (1935; 83%), Number=Sing (1889; 81%), Person=EMPTY (1816; 78%), Voice=Pass (1645; 71%), Tense=EMPTY (1435; 62%).
VERB
tokens may have the following values of Gender
:
Fem
(593; 25% of non-emptyGender
): fatta, veduta, morta, volta, stretta, aperta, aperte, rotta, partita, scioltaMasc
(1735; 75% of non-emptyGender
): fatto, tratto, detto, giunto, vòlto, messo, venuto, fatti, morti, chiusoEMPTY
(14581): disse, fa, vidi, veder, vedi, fare, ha, fece, va, fé
Paradigm fare | Masc | Fem |
---|---|---|
Aspect=Perf|Number=Sing|Person=1|Tense=Past | fatto | |
Aspect=Perf|Number=Sing|Person=1|Tense=Past|Voice=Act | fatt' | |
Aspect=Perf|Number=Sing|Person=2|Tense=Past | fatto | |
Aspect=Perf|Number=Sing|Person=2|Tense=Past|Voice=Act | fatto, fatt' | |
Aspect=Perf|Number=Sing|Person=3|Tense=Past | fatto, fatte | |
Aspect=Perf|Number=Sing|Person=3|Tense=Past|Voice=Act | fatto, fatte, fatt' | |
Aspect=Perf|Number=Sing|Person=3|Tense=Past|Voice=Pass | fatto | |
Aspect=Perf|Number=Sing|Tense=Past | fatto | |
Aspect=Perf|Number=Sing|Tense=Past|Voice=Pass | fatto | fatta |
Aspect=Perf|Number=Sing|Voice=Pass | fatto, fatta, fatt', fatte | fatta |
Aspect=Perf|Number=Plur|Tense=Past|Voice=Pass | fatti | |
Aspect=Perf|Number=Plur|Voice=Pass | fatte | |
Number=Sing|Person=1|Tense=Past|Voice=Pass | fatta | |
Number=Sing|Person=3|Tense=Past | fatto | |
Number=Sing|Person=3|Tense=Past|Voice=Act | fatto | |
Number=Sing|Person=3|Tense=Past|Voice=Pass | fatta | |
Number=Sing|Tense=Past | fatto | fatta |
Number=Sing|Tense=Past|Voice=Act | fatto | fatt' |
Number=Sing|Tense=Past|Voice=Pass | fatto | fatta |
Number=Plur|Tense=Past | fatti | |
Number=Plur|Tense=Past|Voice=Act | fatti | |
Number=Plur|Tense=Past|Voice=Pass | fatti | |
Number=Plur|Voice=Act | fatti |
AUX
49 AUX tokens (1% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Mood=EMPTY (49; 100%), VerbForm=Part (48; 98%), Aspect=Perf (47; 96%), Tense=Past (46; 94%), Number=Sing (43; 88%), Voice=EMPTY (42; 86%), Person=3 (26; 53%).
AUX
tokens may have the following values of Gender
:
Fem
(6; 12% of non-emptyGender
): state, dee, stataMasc
(43; 88% of non-emptyGender
): stato, è, fosse, fossero, fossi, son, potuto, stata, stati, volutoEMPTY
(3480): è, fu, era, son, esser, fui, se’, avea, ha, fosse
Paradigm essere | Masc | Fem |
---|---|---|
Number=Sing|Person=1|Tense=Past | stato, fossi, son, sono, stati | |
Number=Sing|Person=3|Tense=Past | stato, è, fosse, fossero, stata, state, stati | stata |
Number=Sing|Voice=Pass | stato | |
Number=Plur|Person=1|Tense=Past | fossimo | |
Number=Plur|Person=2|Tense=Past | state | |
Number=Plur|Person=3|Tense=Past | state | |
Number=Plur|Tense=Past | state | |
Number=Plur|Voice=Pass | state |
ADV
33 ADV tokens (0% of all ADV
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADV
and Gender
co-occurred: PronType=Ind (24; 73%).
ADV
tokens may have the following values of Gender
:
Masc
(33; 100% of non-emptyGender
): poco, ben, tosto, ‘ncontro, molto, quanto, secondo, sùbito, tantoEMPTY
(10369): non, sì, più, come, poi, così, là, già, qui, tanto
PROPN
1 PROPN tokens (0% of all PROPN
tokens) have a non-empty value of Gender
.
PROPN
tokens may have the following values of Gender
:
Masc
(1; 100% of non-emptyGender
): TesoroEMPTY
(1872): Dio, Beatrice, Cristo, Virgilio, Maria, Pietro, Roma, Fiorenza, Carlo, Guido
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (10311; 96%),
NOUN –[amod]–> ADJ (2777; 92%),
NOUN –[det:poss]–> DET (1769; 93%),
NOUN –[acl]–> VERB (436; 72%),
NOUN –[conj]–> NOUN (384; 53%),
PRON –[det]–> DET (325; 61%),
ADJ –[conj]–> ADJ (223; 93%),
ADJ –[nsubj]–> NOUN (213; 93%),
ADJ –[det]–> DET (114; 95%),
NOUN –[nsubj]–> NOUN (71; 59%).