Treebank Statistics: UD_Latvian-Cairo: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
64 tokens (38%) have a non-empty value of Gender
.
56 types (49%) occur at least once with a non-empty value of Gender
.
47 lemmas (46%) occur at least once with a non-empty value of Gender
.
The feature is used with 6 part-of-speech tags: NOUN (26; 15% instances), PROPN (14; 8% instances), PRON (10; 6% instances), DET (9; 5% instances), ADJ (4; 2% instances), VERB (1; 1% instances).
NOUN
26 NOUN tokens (100% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (23; 88%).
NOUN
tokens may have the following values of Gender
:
Fem
(14; 54% of non-emptyGender
): mašīnu, Meitene, bronzu, draudzenei, dzeršanu, galvaspilsētā, istabu, jausmas, krāsā, smēķēšanuMasc
(12; 46% of non-emptyGender
): brālis, iemesla, kaimiņi, lietus, logu, matus, sudrabu, tētis, velosipēdu, vīram
Gender
seems to be lexical feature of NOUN
. 100% lemmas (24) occur only with one value of Gender
.
PROPN
14 PROPN tokens (93% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Number=Sing (14; 100%).
PROPN
tokens may have the following values of Gender
:
Fem
(7; 50% of non-emptyGender
): Marija, Braunu, Džeina, Francijas, Mariju, ParīzēMasc
(7; 50% of non-emptyGender
): Pētera, Pēteris, Pēteri, Sem, SmituEMPTY
(1): Igvasu
PRON
10 PRON tokens (59% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Person=3 (10; 100%), PronType=Prs (10; 100%), Number=Sing (9; 90%), Case=Nom (6; 60%).
PRON
tokens may have the following values of Gender
:
Fem
(4; 40% of non-emptyGender
): viņa, ViņaiMasc
(6; 60% of non-emptyGender
): viņš, Viņiem, viņa, viņamEMPTY
(7): tu, Es, Man, ko
DET
9 DET tokens (100% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number=Sing (9; 100%), Definite=EMPTY (8; 89%), Degree=EMPTY (8; 89%), Person=EMPTY (6; 67%), Poss=EMPTY (6; 67%).
DET
tokens may have the following values of Gender
:
Fem
(2; 22% of non-emptyGender
): savai, ŠīMasc
(7; 78% of non-emptyGender
): to, Mans, kurš, kāda, savam, tavējais
ADJ
4 ADJ tokens (100% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Definite=Ind (4; 100%), Number=Sing (4; 100%), Case=Nom (3; 75%), Degree=Pos (3; 75%).
ADJ
tokens may have the following values of Gender
:
Fem
(3; 75% of non-emptyGender
): liela, maza, sarkanāMasc
(1; 25% of non-emptyGender
): foršāks
VERB
1 VERB tokens (3% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Evident=EMPTY (1; 100%), Mood=EMPTY (1; 100%), Number=Sing (1; 100%), Person=EMPTY (1; 100%), Polarity=Pos (1; 100%), Reflex=EMPTY (1; 100%), Tense=Past (1; 100%), VerbForm=Part (1; 100%), Voice=Pass (1; 100%).
VERB
tokens may have the following values of Gender
:
Fem
(1; 100% of non-emptyGender
): piegādātaEMPTY
(30): uzrakstīja, Nevarēja, apgriezt, apskāvās, atmest, atnākt, atstāja, atver, centās, domā
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (5; 100%),
NOUN –[amod]–> ADJ (2; 100%),
NOUN –[nmod]–> PROPN (2; 100%),
PROPN –[flat:name]–> PROPN (2; 100%),
ADJ –[advcl]–> DET (1; 100%),
ADJ –[conj]–> ADJ (1; 100%),
ADJ –[nsubj]–> NOUN (1; 100%),
NOUN –[conj]–> NOUN (1; 100%),
NOUN –[nmod]–> PRON (1; 100%),
NOUN –[orphan]–> NOUN (1; 100%).