Treebank Statistics: UD_Italian-VIT: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
116481 tokens (42%) have a non-empty value of Gender
.
12734 types (54%) occur at least once with a non-empty value of Gender
.
8506 lemmas (54%) occur at least once with a non-empty value of Gender
.
The feature is used with 14 part-of-speech tags: NOUN (54690; 20% instances), DET (37731; 13% instances), ADJ (12482; 4% instances), VERB (7838; 3% instances), PRON (2804; 1% instances), AUX (668; 0% instances), ADV (149; 0% instances), NUM (98; 0% instances), X (8; 0% instances), ADP (6; 0% instances), CCONJ (4; 0% instances), INTJ (1; 0% instances), PUNCT (1; 0% instances), SCONJ (1; 0% instances).
NOUN
54690 NOUN tokens (95% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (34527; 63%).
NOUN
tokens may have the following values of Gender
:
Fem
(24110; 44% of non-emptyGender
): società, attività, parte, legge, titolarità, provincia, città, sede, domanda, gestioneMasc
(30580; 56% of non-emptyGender
): anni, miliardi, anno, posti, presidente, punto, governo, stato, gruppo, lavoroEMPTY
(3069): n., art., insegnanti, dpr, mila, a, b, docenti, dl, via
Paradigm fine | Masc | Fem |
---|---|---|
Number=Sing | fine | fine, fin |
Number=Plur | fini |
Gender
seems to be lexical feature of NOUN
. 99% lemmas (5508) occur only with one value of Gender
.
DET
37731 DET tokens (86% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: PronType=Art (34990; 93%), Definite=Def (30875; 82%), Number=Sing (25954; 69%).
DET
tokens may have the following values of Gender
:
Fem
(15861; 42% of non-emptyGender
): la, le, una, un’, sua, questa, tutte, queste, sue, quellaMasc
(21870; 58% of non-emptyGender
): il, i, un, gli, lo, questo, suo, tutti, questi, unoEMPTY
(6176): l’, loro, ogni, tale, il, qualche, tali, che, quest’, cui
Paradigm il | Masc | Fem |
---|---|---|
Definite=Def|Number=Sing|PronType=Art | il, lo, gli, i, l' | la |
Definite=Def|Number=Plur|PronType=Art | i, gli, il | le, il, i |
Number=Sing | il, lo | |
Number=Plur | i | le |
ADJ
12482 ADJ tokens (62% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=Sing (7828; 63%).
ADJ
tokens may have the following values of Gender
:
Fem
(5638; 45% of non-emptyGender
): altre, nuova, italiana, altra, nuove, politica, stessa, pubblica, economica, politicheMasc
(6844; 55% of non-emptyGender
): altri, nuovo, economico, stesso, nuovi, scorso, altro, finanziario, ultimo, italianoEMPTY
(7685): precedente, primo, grande, presente, ex, netto, generale, grandi, nazionale, sociale
Paradigm altro | Masc | Fem |
---|---|---|
Number=Sing | altro | altra |
Number=Sing|PronType=Dem | altro | |
Number=Sing|PronType=Ind | altro | altra |
Number=Plur | altri | altre |
VERB
7838 VERB tokens (37% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (7838; 100%), Person=EMPTY (7838; 100%), Tense=Past (7777; 99%), VerbForm=Part (7777; 99%), Number=Sing (5635; 72%).
VERB
tokens may have the following values of Gender
:
Fem
(2293; 29% of non-emptyGender
): prevista, indicate, presentata, comprese, effettuata, fatta, richiesta, data, previste, richiesteMasc
(5545; 71% of non-emptyGender
): fatto, detto, approvato, previsto, avuto, previsti, deciso, ottenuto, visto, datoEMPTY
(13557): è, ha, fare, fa, far, hanno, dice, sono, avere, scade
Paradigm fare | Masc | Fem |
---|---|---|
Number=Sing | fatto, salvo | fatta |
Number=Plur | fatti | fatte |
PRON
2804 PRON tokens (29% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Clitic=EMPTY (2204; 79%), Number=Sing (2034; 73%), Person=EMPTY (1976; 70%).
PRON
tokens may have the following values of Gender
:
Fem
(716; 26% of non-emptyGender
): quella, la, quelle, le, una, essa, questa, queste, altra, esseMasc
(2088; 74% of non-emptyGender
): lo, quello, quale, quelli, quanto, questo, tutti, gli, li, luiEMPTY
(6987): che, si, cui, ci, c’, ne, mi, dove, chi, quali
Paradigm quello | Masc | Fem |
---|---|---|
Number=Sing | quello, quel | quella |
Number=Plur | quelli | quelle |
AUX
668 AUX tokens (7% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Mood=EMPTY (668; 100%), Person=EMPTY (668; 100%), Tense=Past (667; 100%), VerbForm=Part (667; 100%), Number=Sing (504; 75%).
AUX
tokens may have the following values of Gender
:
Fem
(203; 30% of non-emptyGender
): stata, state, dovuta, fatta, volutaMasc
(465; 70% of non-emptyGender
): stato, stati, potuto, dovuto, voluto, fatto, dovuti, essere, volutiEMPTY
(8754): è, ha, sono, essere, hanno, era, sarà, deve, può, sia
Paradigm essere | Masc | Fem |
---|---|---|
Number=Sing|PronType=Rel|Tense=Past|VerbForm=Part | stata | |
Number=Sing|Tense=Past|VerbForm=Part | stato | stata |
Number=Plur | essere | |
Number=Plur|Tense=Past|VerbForm=Part | stati | state |
ADV
149 ADV tokens (1% of all ADV
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADV
and Gender
co-occurred: PronType=EMPTY (147; 99%).
ADV
tokens may have the following values of Gender
:
Fem
(44; 30% of non-emptyGender
): estremamente, inizialmente, costantemente, normalmente, celermente, contrariamente, lungamente, solamente, una, MolteMasc
(105; 70% of non-emptyGender
): volta, molto, poco, fa, lungo, troppo, no, seguito, casual, dietroEMPTY
(10723): non, più, anche, solo, così, già, ancora, ieri, poi, sempre
Paradigm molto | Masc | Fem |
---|---|---|
Number=Sing | molto | |
Number=Plur | Molte |
Gender
seems to be lexical feature of ADV
. 97% lemmas (67) occur only with one value of Gender
.
NUM
98 NUM tokens (2% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumType=Card (98; 100%).
NUM
tokens may have the following values of Gender
:
Fem
(35; 36% of non-emptyGender
): un’, terza, una, mezzaMasc
(63; 64% of non-emptyGender
): miliardi, milioni, un, primi, terzi, bis, rientro, unoEMPTY
(6295): due, tre, cento, 15, 1, 5, 1973, 2, 20, 30
Paradigm uno | Masc | Fem |
---|---|---|
un, uno | un', una |
X
8 X tokens (2% of all X
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which X
and Gender
co-occurred: Foreign=Yes (8; 100%).
X
tokens may have the following values of Gender
:
Fem
(3; 38% of non-emptyGender
): area, deregulation, mountainMasc
(5; 63% of non-emptyGender
): local, network, personal, show, word-processingEMPTY
(384): joint, venture, station, work, baby, cd, sitter, computer, condicio, par
ADP
6 ADP tokens (0% of all ADP
tokens) have a non-empty value of Gender
.
ADP
tokens may have the following values of Gender
:
Masc
(6; 100% of non-emptyGender
): dietro, per, ne, niente, rispettoEMPTY
(45561): di, a, in, per, da, con, su, tra, ad, come
CCONJ
4 CCONJ tokens (0% of all CCONJ
tokens) have a non-empty value of Gender
.
CCONJ
tokens may have the following values of Gender
:
Fem
(1; 25% of non-emptyGender
): essaMasc
(3; 75% of non-emptyGender
): altro, caso, quantoEMPTY
(8204): e, ma, o, ed, come, sia, che, cioè, ovvero, nonché
INTJ
1 INTJ tokens (1% of all INTJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which INTJ
and Gender
co-occurred: Polarity=EMPTY (1; 100%).
INTJ
tokens may have the following values of Gender
:
Masc
(1; 100% of non-emptyGender
): okEMPTY
(77): no, sì, basta, Bè, Ecco, addio, avanti, macché, oh, Beh
PUNCT
1 PUNCT tokens (0% of all PUNCT
tokens) have a non-empty value of Gender
.
PUNCT
tokens may have the following values of Gender
:
Fem
(1; 100% of non-emptyGender
): leEMPTY
(31584): ,, ., “, ), -, (, :, ?, ;, «/em>
SCONJ
1 SCONJ tokens (0% of all SCONJ
tokens) have a non-empty value of Gender
.
SCONJ
tokens may have the following values of Gender
:
Masc
(1; 100% of non-emptyGender
): addebitatiEMPTY
(2199): che, se, perché, quando, mentre, come, qualora, poiché, affinché, ove
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (30364; 83%),
NOUN –[amod]–> ADJ (9793; 61%),
NOUN –[conj]–> NOUN (2596; 58%),
NOUN –[advcl]–> VERB (1297; 76%),
VERB –[nsubj:pass]–> NOUN (1075; 93%),
NOUN –[det:poss]–> DET (960; 77%),
VERB –[conj]–> VERB (338; 51%),
NOUN –[det:predet]–> DET (333; 95%),
NOUN –[nsubj]–> NOUN (162; 53%),
ADJ –[amod]–> ADJ (137; 58%).