Treebank Statistics: UD_Romanian-ArT: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
132 tokens (23%) have a non-empty value of Gender
.
107 types (36%) occur at least once with a non-empty value of Gender
.
83 lemmas (37%) occur at least once with a non-empty value of Gender
.
The feature is used with 6 part-of-speech tags: NOUN (78; 14% instances), PRON (22; 4% instances), DET (14; 2% instances), ADJ (8; 1% instances), NUM (5; 1% instances), VERB (5; 1% instances).
NOUN
78 NOUN tokens (99% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (69; 88%), Case=Acc,Nom (57; 73%), Definite=Def (40; 51%).
NOUN
tokens may have the following values of Gender
:
Fem
(41; 53% of non-emptyGender
): casă, feata, lamńa, oară, amiroańa, apă, barba, broască, bucăţle, caleaMasc
(37; 47% of non-emptyGender
): fiĉorlu, ańi, gardu, lucru, ќiro, Araplu, Merlu, Tată, Uvreulu, aluatluEMPTY
(1): Muşata-Locului
Paradigm mer | Masc | Fem |
---|---|---|
Case=Acc,Nom|Definite=Def|Number=Sing | Merlu | |
Definite=Ind|Number=Plur | meari |
Gender
seems to be lexical feature of NOUN
. 98% lemmas (58) occur only with one value of Gender
.
PRON
22 PRON tokens (31% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Person=3 (22; 100%), Reflex=EMPTY (22; 100%), Number=Sing (19; 86%), PronType=Prs (18; 82%), Variant=EMPTY (15; 68%).
PRON
tokens may have the following values of Gender
:
Fem
(10; 45% of non-emptyGender
): u, nâsă, -o, O, aestă, liMasc
(12; 55% of non-emptyGender
): lu, vârnu, -l, Nâs, Nâşi, aestu, lo-, nâsă, năs, ĺ-EMPTY
(49): s-, mi, si, ti, tine, ţi, -ĺi, cari, io, se-
Paradigm el | Masc | Fem |
---|---|---|
Case=Acc|Number=Sing|Strength=Strong | u | |
Case=Acc|Number=Sing|Strength=Weak | u, O | |
Case=Acc|Number=Sing|Strength=Weak|Variant=Short | lu, -l | -o |
Case=Acc|Number=Plur|Strength=Weak|Variant=Short | lo- | li |
Case=Dat|Number=Sing|Strength=Weak|Variant=Short | ĺ- |
DET
14 DET tokens (93% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Poss=EMPTY (13; 93%), Case=Acc,Nom (12; 86%), Position=EMPTY (12; 86%), Number=Sing (11; 79%), PronType=Ind (11; 79%), Person=EMPTY (8; 57%).
DET
tokens may have the following values of Gender
:
Fem
(7; 50% of non-emptyGender
): nă, Ună, alti, câte, ndoauăMasc
(7; 50% of non-emptyGender
): un, -su, Aestu, multuEMPTY
(1): a
Paradigm un | Masc | Fem |
---|---|---|
un | nă, Ună |
ADJ
8 ADJ tokens (89% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Degree=Pos (8; 100%), Definite=Ind (7; 88%), Number=Sing (7; 88%), Case=Acc,Nom (6; 75%).
ADJ
tokens may have the following values of Gender
:
Fem
(4; 50% of non-emptyGender
): albă, greauă, mari, uscatMasc
(4; 50% of non-emptyGender
): aleptu, bun, mărli, sănătosEMPTY
(1): Ahtare
Paradigm mare | Masc | Fem |
---|---|---|
Definite=Def|Number=Plur | mărli | |
Definite=Ind|Number=Sing | mari |
NUM
5 NUM tokens (83% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: Case=Acc,Nom (5; 100%), NumForm=Word (5; 100%), NumType=Card (5; 100%), Definite=Def (4; 80%), Number=Sing (3; 60%).
NUM
tokens may have the following values of Gender
:
Fem
(3; 60% of non-emptyGender
): ună, năMasc
(2; 40% of non-emptyGender
): doľi, treiľiEMPTY
(1): unsprăyinģiţĺi
VERB
5 VERB tokens (4% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (5; 100%), Number=Sing (5; 100%), Person=EMPTY (5; 100%), Tense=EMPTY (5; 100%), VerbForm=Part (5; 100%).
VERB
tokens may have the following values of Gender
:
Fem
(1; 20% of non-emptyGender
): ascăpatăMasc
(4; 80% of non-emptyGender
): faptă, minduit, niapirită, nidatăEMPTY
(110): adară, ascapă, ascăpă, aştipta, facă, feaţi, fudzirâ, imna, lucra, mutreaşte
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (13; 93%),
NOUN –[amod]–> ADJ (4; 67%),
ADJ –[nsubj]–> NOUN (1; 100%),
DET –[fixed]–> NOUN (1; 100%),
NOUN –[amod]–> NUM (1; 100%).