Treebank Statistics: UD_Italian-ParlaMint: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
8463 tokens (41%) have a non-empty value of Gender
.
1959 types (58%) occur at least once with a non-empty value of Gender
.
1500 lemmas (65%) occur at least once with a non-empty value of Gender
.
The feature is used with 7 part-of-speech tags: NOUN (4065; 20% instances), DET (2728; 13% instances), ADJ (748; 4% instances), VERB (543; 3% instances), PRON (317; 2% instances), AUX (61; 0% instances), ADP (1; 0% instances).
NOUN
4065 NOUN tokens (96% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (2723; 67%).
NOUN
tokens may have the following values of Gender
:
Fem
(1761; 43% of non-emptyGender
): famiglia, legge, adozione, carceri, comunità, possibilità, relatrice, responsabilità, polizia, parteMasc
(2304; 57% of non-emptyGender
): affidamento, emendamento, signor, ministro, detenuti, n., Governo, giorno, ordine, emendamentiEMPTY
(151): minore, presidente, minori, familiari, fronte, rappresentante, collega, Capigruppo, agenti, grazie
Paradigm senatore | Masc | Fem |
---|---|---|
Number=Sing | senatore | senatrice |
Number=Plur | senatori | senatrici |
Gender
seems to be lexical feature of NOUN
. 99% lemmas (928) occur only with one value of Gender
.
DET
2728 DET tokens (85% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: PronType=Art (2297; 84%), Definite=Def (1976; 72%), Number=Sing (1858; 68%).
DET
tokens may have the following values of Gender
:
Fem
(1154; 42% of non-emptyGender
): la, le, una, questa, un’, queste, sua, propria, tutta, delleMasc
(1574; 58% of non-emptyGender
): il, i, un, gli, questo, lo, questi, tutti, tutto, suoEMPTY
(491): l’, loro, ogni, tale, qualche, più, che, quell’, quest’, tal
Paradigm il | Masc | Fem |
---|---|---|
Number=Sing | il, lo | la |
Number=Plur | i, gli | le |
ADJ
748 ADJ tokens (66% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=Sing (511; 68%).
ADJ
tokens may have the following values of Gender
:
Fem
(355; 47% of non-emptyGender
): penitenziaria, affidataria, altre, carceraria, prima, stessa, affettiva, odierna, sanitaria, affidatarieMasc
(393; 53% of non-emptyGender
): contrario, altri, prolungato, primo, altro, stesso, vero, chiaro, penitenziari, penitenziarioEMPTY
(383): possibile, familiare, sociali, presente, bis, generale, verbale, difficile, gravi, nazionale
Paradigm penitenziario | Masc | Fem |
---|---|---|
Number=Sing | penitenziario | penitenziaria |
Number=Plur | penitenziari | penitenziarie |
VERB
543 VERB tokens (27% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (543; 100%), Person=EMPTY (543; 100%), VerbForm=Part (542; 100%), Tense=Past (540; 99%), Number=Sing (397; 73%).
VERB
tokens may have the following values of Gender
:
Fem
(120; 22% of non-emptyGender
): appoggiata, fatta, applicata, avanzata, stata, assunte, avvenute, convocata, costrette, effettuateMasc
(423; 78% of non-emptyGender
): fatto, presentato, detto, visto, adottato, dato, affidato, previsto, proposto, accadutoEMPTY
(1497): ha, è, fare, avere, dire, tratta, garantire, chiedo, far, affrontare
Paradigm fare | Masc | Fem |
---|---|---|
Number=Sing | fatto | fatta |
Number=Plur | fatti |
PRON
317 PRON tokens (28% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Number=Sing (255; 80%), Clitic=EMPTY (244; 77%), Person=EMPTY (215; 68%).
PRON
tokens may have the following values of Gender
:
Fem
(77; 24% of non-emptyGender
): lei, la, le, quella, questa, quelle, una, queste, altra, ellaMasc
(240; 76% of non-emptyGender
): lo, questo, quello, quanto, ciò, altro, altri, tutto, li, tuttiEMPTY
(822): che, si, ci, cui, mi, chi, c’, noi, lei, io
Paradigm questo | Masc | Fem |
---|---|---|
Number=Sing | questo | questa |
Number=Plur | questi | queste |
AUX
61 AUX tokens (6% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Mood=EMPTY (61; 100%), Person=EMPTY (61; 100%), Tense=Past (61; 100%), VerbForm=Part (61; 100%), Number=Sing (36; 59%).
AUX
tokens may have the following values of Gender
:
Fem
(19; 31% of non-emptyGender
): state, stataMasc
(42; 69% of non-emptyGender
): stato, stati, dovuto, potutoEMPTY
(925): è, sono, essere, ha, deve, hanno, ho, può, abbiamo, sia
Paradigm essere | Masc | Fem |
---|---|---|
Number=Sing | stato | stata |
Number=Plur | stati | state |
ADP
1 ADP tokens (0% of all ADP
tokens) have a non-empty value of Gender
.
ADP
tokens may have the following values of Gender
:
Masc
(1; 100% of non-emptyGender
): finEMPTY
(3064): di, in, a, per, da, con, su, ad, come, tra
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (2510; 82%),
NOUN –[amod]–> ADJ (594; 69%),
NOUN –[compound]–> NOUN (59; 70%),
VERB –[conj]–> VERB (45; 58%),
ADJ –[conj]–> ADJ (30; 65%),
NOUN –[nsubj]–> NOUN (17; 74%),
NOUN –[appos]–> NOUN (14; 67%),
NOUN –[conj]–> PRON (8; 57%),
ADJ –[det]–> DET (7; 64%),
PRON –[acl]–> NOUN (7; 88%).