Treebank Statistics: UD_French-ParisStories: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
11929 tokens (28%) have a non-empty value of Gender
.
1946 types (59%) occur at least once with a non-empty value of Gender
.
1650 lemmas (67%) occur at least once with a non-empty value of Gender
.
The feature is used with 9 part-of-speech tags: NOUN (4289; 10% instances), PRON (3248; 8% instances), DET (2274; 5% instances), VERB (1155; 3% instances), ADJ (870; 2% instances), AUX (38; 0% instances), ADV (33; 0% instances), PROPN (16; 0% instances), NUM (6; 0% instances).
NOUN
4289 NOUN tokens (97% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (3434; 80%).
NOUN
tokens may have the following values of Gender
:
Fem
(1667; 39% of non-emptyGender
): fois, maison, mère, heures, année, chose, vie, peur, ville, heureMasc
(2622; 61% of non-emptyGender
): coup, fait, peu, temps, ans, moment, truc, jour, monde, côtéEMPTY
(140): genre, mode, potes, gens, contre, collègues, enfants, machin, parents, autres
Paradigm truc | Masc | Fem |
---|---|---|
Number=Sing | truc | truc |
Number=Plur | trucs |
Gender
seems to be lexical feature of NOUN
. 98% lemmas (1093) occur only with one value of Gender
.
PRON
3248 PRON tokens (50% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Person=3 (3205; 99%), Number=Sing (3082; 95%).
PRON
tokens may have the following values of Gender
:
Fem
(310; 10% of non-emptyGender
): elle, elles, la, une, personne, auxquelles, certaines, elle-même, lesquellesMasc
(2938; 90% of non-emptyGender
): on, c’, il, ça, lui, ils, ce, le, -ce, toutEMPTY
(3188): je, j’, y, qui, tu, me, moi, s’, se, nous
Paradigm lui | Masc | Fem |
---|---|---|
ExtPos=ADP|Person=3 | il | |
ExtPos=VERB|Person=3 | il | |
Person=3 | il, lui, le, elle, l' | elle, la |
le |
DET
2274 DET tokens (66% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number=Sing (2262; 99%), Number[psor]=EMPTY (2158; 95%), Person[psor]=EMPTY (2158; 95%), Poss=EMPTY (2158; 95%), PronType=Art (2043; 90%), Definite=Def (1358; 60%).
DET
tokens may have the following values of Gender
:
Fem
(870; 38% of non-emptyGender
): la, une, ma, cette, sa, ta, aucune, quelle, certaines, touteMasc
(1404; 62% of non-emptyGender
): le, un, ce, du, cet, des, les, l’, aucun, quelquesEMPTY
(1194): les, l’, des, mon, mes, son, ses, nos, notre, quelque
Paradigm le | Masc | Fem |
---|---|---|
Definite=Def|Number=Sing | le, l' | la, l' |
Definite=Def|Number=Plur | les | |
Definite=Ind|Number=Sing | le |
VERB
1155 VERB tokens (26% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: VerbForm=Part (1104; 96%), Mood=EMPTY (1101; 95%), Tense=Past (1099; 95%), Person=EMPTY (1097; 95%), Number=Sing (1084; 94%).
VERB
tokens may have the following values of Gender
:
Fem
(208; 18% of non-emptyGender
): allée, rencontrée, vue, arrivée, partie, venue, accompagnée, rentrée, mise, devenueMasc
(947; 82% of non-emptyGender
): fait, dit, eu, vu, passé, pris, allé, parlé, commencé, rencontréEMPTY
(3262): avait, a, est, sais, voilà, faire, était, dit, va, aller
Paradigm avoir | Masc | Fem |
---|---|---|
eu | eue |
ADJ
870 ADJ tokens (72% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=Sing (663; 76%).
ADJ
tokens may have the following values of Gender
:
Fem
(345; 40% of non-emptyGender
): première, petite, bonne, toute, seule, toutes, grande, petites, autre, contenteMasc
(525; 60% of non-emptyGender
): tout, petit, tous, gros, vrai, mignon, petits, beau, bizarre, sympaEMPTY
(344): tout, petit, même, tous, premier, autre, horrible, petite, contente, sympa
Paradigm tout | Masc | Fem |
---|---|---|
Number=Sing | tout | toute |
Number=Sing|PronType=Ind | tout | toute |
Number=Plur | tous | toutes |
Number=Plur|PronType=Ind | tous |
AUX
38 AUX tokens (2% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Number=Sing (38; 100%), VerbForm=Part (38; 100%), Mood=EMPTY (37; 97%), Person=EMPTY (37; 97%), Tense=Past (37; 97%).
AUX
tokens may have the following values of Gender
:
Masc
(38; 100% of non-emptyGender
): été, fait, euEMPTY
(2066): est, était, a, ai, suis, étais, avait, avais, sont, étaient
ADV
33 ADV tokens (1% of all ADV
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADV
and Gender
co-occurred: ExtPos=EMPTY (33; 100%).
ADV
tokens may have the following values of Gender
:
Masc
(33; 100% of non-emptyGender
): mal, tout, plus, superEMPTY
(3506): pas, donc, parce, enfin, plus, vraiment, là, très, même, après
PROPN
16 PROPN tokens (4% of all PROPN
tokens) have a non-empty value of Gender
.
PROPN
tokens may have the following values of Gender
:
Fem
(8; 50% of non-emptyGender
): Flora, Caraïbes, GoPro, Latine, TerresMasc
(8; 50% of non-emptyGender
): Anglais, PSG, Chevaliers, MEMPTY
(404): Paris, CROUS, Z, Agen, Ecosse, CP, Sanga, Athis, France, Liège
NUM
6 NUM tokens (3% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: Number=Sing (4; 67%).
NUM
tokens may have the following values of Gender
:
Fem
(1; 17% of non-emptyGender
): uneMasc
(5; 83% of non-emptyGender
): neuf, unEMPTY
(234): deux, trois, six, dix, mille, cinq, quatre, huit, quatorze, sept
Paradigm un | Masc | Fem |
---|---|---|
un | une |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (1965; 66%),
NOUN –[amod]–> ADJ (440; 75%),
ADJ –[nsubj]–> PRON (149; 55%),
NOUN –[nsubj]–> PRON (128; 51%),
DET –[fixed]–> NOUN (79; 96%),
NOUN –[conj]–> NOUN (68; 61%),
PRON –[reparandum]–> PRON (64; 91%),
NOUN –[reparandum]–> NOUN (57; 77%),
DET –[reparandum]–> DET (46; 79%),
ADJ –[obl:mod]–> NOUN (32; 53%).