Treebank Statistics: UD_French-GSD: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
161000 tokens (40%) have a non-empty value of Gender
.
23083 types (54%) occur at least once with a non-empty value of Gender
.
15909 lemmas (48%) occur at least once with a non-empty value of Gender
.
The feature is used with 10 part-of-speech tags: NOUN (74822; 19% instances), DET (37490; 9% instances), ADJ (23687; 6% instances), VERB (11169; 3% instances), PRON (9552; 2% instances), PROPN (3216; 1% instances), AUX (904; 0% instances), X (85; 0% instances), NUM (61; 0% instances), SYM (14; 0% instances).
NOUN
74822 NOUN tokens (99% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (56333; 75%).
NOUN
tokens may have the following values of Gender
:
Fem
(33169; 44% of non-emptyGender
): ville, partie, fois, région, commune, années, famille, année, fin, placeMasc
(41653; 56% of non-emptyGender
): ans, pays, nom, monde, temps, groupe, siècle, état, cours, lieuEMPTY
(458): enfants, grâce, peu, suite, face, gens, contre, enfant, Place, Abbé
Paradigm partie | Masc | Fem |
---|---|---|
Number=Sing | partie | |
Number=Sing|Typo=Yes | parti | partie |
Number=Plur | parties |
Gender
seems to be lexical feature of NOUN
. 99% lemmas (9339) occur only with one value of Gender
.
DET
37490 DET tokens (61% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number=Sing (37245; 99%), PronType=Art (34196; 91%), Definite=Def (26385; 70%).
DET
tokens may have the following values of Gender
:
Fem
(16597; 44% of non-emptyGender
): la, une, sa, cette, ma, aucune, certaines, toute, toutes, différentesMasc
(20893; 56% of non-emptyGender
): le, un, ce, cet, du, certains, aucun, tout, différents, diversEMPTY
(23604): les, l’, des, son, ses, leur, de, ces, plusieurs, leurs
Paradigm le | Masc | Fem |
---|---|---|
ExtPos=ADV | le | la |
ExtPos=NOUN | le | |
ExtPos=PRON | le | |
le, L' | la | |
Typo=Yes | le | la, là |
ADJ
23687 ADJ tokens (99% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=Sing (17035; 72%).
ADJ
tokens may have the following values of Gender
:
Fem
(11076; 47% of non-emptyGender
): première, française, grande, même, nouvelle, toutes, nombreuses, nationale, autres, seuleMasc
(12611; 53% of non-emptyGender
): premier, français, tous, dernier, grand, autres, nouveau, même, nombreux, petitEMPTY
(133): super, vidéo, autre, autres, jeunes, audio, double, marron, possible, quitte
Paradigm premier | Masc | Fem |
---|---|---|
Number=Sing | premier | première |
Number=Sing|Typo=Yes | premier | |
Number=Plur | premiers | premières |
VERB
11169 VERB tokens (35% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (11169; 100%), Person=EMPTY (11169; 100%), VerbForm=Part (11169; 100%), Tense=Past (11168; 100%), Number=Sing (8853; 79%), Voice=Pass (7701; 69%).
VERB
tokens may have the following values of Gender
:
Fem
(3252; 29% of non-emptyGender
): située, née, créée, appelée, utilisée, connue, construite, mise, publiée, nomméeMasc
(7917; 71% of non-emptyGender
): né, situé, eu, fait, mort, connu, nommé, réalisé, utilisé, misEMPTY
(20604): a, peut, fait, faire, partir, trouve, devient, doit, ont, permet
Paradigm faire | Masc | Fem |
---|---|---|
ExtPos=ADJ|Number=Sing|Voice=Pass | faite | |
Number=Sing|Typo=Yes|Voice=Act | fais | |
Number=Sing|Typo=Yes|Voice=Pass | fait | |
Number=Sing|Voice=Act | fait | faite |
Number=Sing|Voice=Pass | fait | faite |
Number=Plur|Voice=Act | faits | |
Number=Plur|Voice=Pass | faits | faites |
PRON
9552 PRON tokens (53% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Reflex=EMPTY (9451; 99%), Person=3 (9213; 96%), Number=Sing (8165; 85%), Emph=No (6596; 69%), PronType=Prs (6345; 66%).
PRON
tokens may have the following values of Gender
:
Fem
(1720; 18% of non-emptyGender
): elle, elles, une, la, celle, laquelle, celles, celle-ci, lesquelles, elle-mêmeMasc
(7832; 82% of non-emptyGender
): il, c’, on, ils, lui, ce, le, un, cela, toutEMPTY
(8633): qui, se, s’, y, où, nous, dont, je, en, vous
Paradigm lui | Masc | Fem |
---|---|---|
Emph=No|ExtPos=ADP | il | |
Emph=No | il, le, lui, -t-il, -il | elle, la, -elle, -t-elle |
Emph=No|Typo=Yes | il, t-il, -il, -le, t'il | elle |
Emph=Yes | lui | elle |
PROPN
3216 PROPN tokens (12% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Number=Sing (3166; 98%).
PROPN
tokens may have the following values of Gender
:
Fem
(1217; 38% of non-emptyGender
): France, Russie, Chine, Loire, Grèce, Amérique, Belgique, Europe, Mauritanie, RenaissanceMasc
(1999; 62% of non-emptyGender
): Maroc, Sahara, Canada, Québec, Japon, Royaume-Uni, Brésil, Mali, Mans, MexiqueEMPTY
(24502): France, Paris, États-Unis, Europe, de, Jean, Espagne, York, New, Pierre
Paradigm Afrique | Masc | Fem |
---|---|---|
Afrique | Afrique |
Gender
seems to be lexical feature of PROPN
. 99% lemmas (1867) occur only with one value of Gender
.
AUX
904 AUX tokens (7% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Mood=EMPTY (904; 100%), Number=Sing (904; 100%), Person=EMPTY (904; 100%), Tense=Past (904; 100%), VerbForm=Part (904; 100%).
AUX
tokens may have the following values of Gender
:
Fem
(1; 0% of non-emptyGender
): faiteMasc
(903; 100% of non-emptyGender
): été, fait, vuEMPTY
(12177): est, a, sont, ont, était, fut, être, avait, avoir, ai
Paradigm faire | Masc | Fem |
---|---|---|
fait | faite |
X
85 X tokens (3% of all X
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which X
and Gender
co-occurred: Foreign=EMPTY (85; 100%), ExtPos=PROPN (43; 51%).
X
tokens may have the following values of Gender
:
Fem
(22; 26% of non-emptyGender
): 3D, BoJ, CEDH, CSL, FW17, Lincoln’s, RN113, SFIO, Scouting, TVMasc
(63; 74% of non-emptyGender
): DKK, statu, B, CWA, D.III, DA, FDLP, FPLP, G.I., G8EMPTY
(2848): the, of, de, and, etc., in, a, del, for, Company
Gender
seems to be lexical feature of X
. 100% lemmas (83) occur only with one value of Gender
.
NUM
61 NUM tokens (1% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: Number=Sing (61; 100%).
NUM
tokens may have the following values of Gender
:
Fem
(61; 100% of non-emptyGender
): uneEMPTY
(10461): deux, trois, 2, 3, 5, quatre, 2010, 4, 20, 2009
SYM
14 SYM tokens (2% of all SYM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which SYM
and Gender
co-occurred: Number=Sing (13; 93%), ExtPos=NOUN (12; 86%).
SYM
tokens may have the following values of Gender
:
Fem
(1; 7% of non-emptyGender
): n°Masc
(13; 93% of non-emptyGender
): n°, %, CsBi4Te6, M, X, kEMPTY
(703): %, /, €, °, &, +, $, =, k, A
Paradigm n° | Masc | Fem |
---|---|---|
n° | n° |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (32358; 61%),
NOUN –[amod]–> ADJ (19287; 99%),
NOUN –[conj]–> NOUN (3285; 63%),
NOUN –[acl]–> VERB (3000; 71%),
PROPN –[det]–> DET (2953; 96%),
VERB –[nsubj:pass]–> NOUN (1834; 81%),
ADJ –[nsubj]–> NOUN (948; 98%),
ADJ –[conj]–> ADJ (897; 96%),
NOUN –[appos]–> NOUN (879; 62%),
VERB –[nsubj:pass]–> PRON (686; 70%).