Treebank Statistics: UD_Romanian-Nonstandard: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
180626 tokens (32%) have a non-empty value of Gender
.
23602 types (74%) occur at least once with a non-empty value of Gender
.
10137 lemmas (82%) occur at least once with a non-empty value of Gender
.
The feature is used with 9 part-of-speech tags: NOUN (96782; 17% instances), PRON (27320; 5% instances), PROPN (19968; 3% instances), DET (19701; 3% instances), ADJ (10019; 2% instances), VERB (4616; 1% instances), NUM (2215; 0% instances), AUX (4; 0% instances), ADV (1; 0% instances).
NOUN
96782 NOUN tokens (100% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Case=Acc,Nom (87450; 90%), Number=Sing (71135; 74%), Definite=Ind (52239; 54%).
NOUN
tokens may have the following values of Gender
:
Fem
(49597; 51% of non-emptyGender
): țara, țară, oaste, lume, pace, parte, casa, credință, vreme, casăMasc
(47185; 49% of non-emptyGender
): vodă, domnul, doamne, omul, om, domnului, cuvîntul, oameni, împăratul, turciiEMPTY
(1): neamure
Paradigm domn | Masc | Fem |
---|---|---|
Case=Acc,Nom|Definite=Def|Number=Sing | domnul, domnu, domnu-, Domnulu | |
Case=Acc,Nom|Definite=Def|Number=Plur | domnii, domnu | |
Case=Acc,Nom|Definite=Ind|Number=Sing | domnu, domn | |
Case=Acc,Nom|Definite=Ind|Number=Plur | domni, domnu | |
Case=Dat,Gen|Definite=Def|Number=Sing | domnului, Domn, Domnul, Domnunlui, Domului | domnii |
Case=Dat,Gen|Definite=Def|Number=Plur | domnilor | |
Case=Dat,Gen|Definite=Ind|Number=Sing | Domnului | |
Case=Voc|Definite=Def|Number=Sing | Doamne | |
Case=Voc|Definite=Def|Number=Plur | domnilor | |
Case=Voc|Definite=Ind|Number=Sing | doamne |
Gender
seems to be lexical feature of NOUN
. 90% lemmas (5851) occur only with one value of Gender
.
PRON
27320 PRON tokens (42% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Person=3 (27119; 99%), Number=Sing (18307; 67%), Strength=EMPTY (17542; 64%), PronType=Prs (15734; 58%).
PRON
tokens may have the following values of Gender
:
Fem
(7046; 26% of non-emptyGender
): o, aceaia, le, aceasta, carea, toate, aceastea, -o, ei, eaMasc
(20274; 74% of non-emptyGender
): lui, el, -l, -i, carele, ei, l-, carii, i-, toțiEMPTY
(37308): să, ce, s-, lor, -i, mă, voi, eu, se, cine
Paradigm el | Masc | Fem |
---|---|---|
Case=Acc,Nom|Number=Sing|PronType=Prs | el, elu, iel, l, îl, ei, Еl, Lui, Părinte | ea, ia, -o, O, ei |
Case=Acc,Nom|Number=Plur|PronType=Prs | ei | |
Case=Acc|Number=Sing|PronType=Prs|Strength=Strong | el, elu, еl, -l, ei, l- | ia, ea, -o, o, ei |
Case=Acc|Number=Sing|PronType=Prs|Strength=Weak | -l, l-, l, îl, i-, lu, el, -i, îlu, îi, li-, Il, o | o, -o, o-, ia, -l, l, li- |
Case=Acc|Number=Plur|PronType=Prs|Strength=Strong | ei, -i, lor, îi | iale, ele, le, eale, ia |
Case=Acc|Number=Plur|PronType=Prs|Strength=Weak | -i, i-, îi, -l, i, îl, ei, le, l, l-, le-, li | le, le-, -le, li, li-, o, -i, -li |
Case=Dat,Gen|Number=Sing|PronType=Dem | lui | |
Case=Dat,Gen|Number=Plur|PronType=Dem | lui | |
Case=Dat|Number=Sing|PronType=Prs|Strength=Strong | lui, ei, lor | ei |
Case=Gen|Number=Sing|PronType=Prs | lui, ei, lor, -i | ei, o, iei |
Case=Gen|Number=Plur|PronType=Prs | lor, lui | |
Case=Nom|Number=Plur|PronType=Prs | ei, iei, îi, i, -i, I-, ii | ele, le, iale, eale, le- |
PROPN
19968 PROPN tokens (99% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Number=Sing (19121; 96%), Case=Acc,Nom (18754; 94%), Definite=Ind (15278; 77%).
PROPN
tokens may have the following values of Gender
:
Fem
(3151; 16% of non-emptyGender
): Poartă, Moldova, Muntenească, Evangheliia, Tighine, Cameniță, Leșască, Ungurească, Leşască, MariaMasc
(16817; 84% of non-emptyGender
): dumnezău, Hristos, Iisus, Pavel, David, Pătru, Ioan, Mihai-, Duca, Dumitraşco-EMPTY
(167): tîrgu, târgu, greșală, Dunărea, războiu, boer, iarăș, Catargiul, Chipru, Filimon
Paradigm Iisus | Masc | Fem |
---|---|---|
Case=Acc,Nom|Definite=Def | Iisus | |
Case=Acc,Nom|Definite=Ind | Iisus, IISUS, Isus | |
Case=Voc|Definite=Ind | Iisuse |
Gender
seems to be lexical feature of PROPN
. 95% lemmas (2274) occur only with one value of Gender
.
DET
19701 DET tokens (83% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Definite=EMPTY (19559; 99%), Poss=EMPTY (16545; 84%), Case=Acc,Nom (16233; 82%), Number[psor]=EMPTY (14948; 76%), Number=Sing (14886; 76%).
DET
tokens may have the following values of Gender
:
Fem
(12041; 61% of non-emptyGender
): a, o, toată, ta, toate, tot, cea, mea, multe, saMasc
(7660; 39% of non-emptyGender
): un, al, cel, mieu, tău, cei, său, acel, toți, nostruEMPTY
(4157): lui, ce, care, celor, nește, tuturor, niște, nişte, lu, vo
Paradigm -ul | Masc | Fem |
---|---|---|
Case=Acc,Nom|Definite=Def|Number=Sing|PronType=Art | -lea, -le | -a |
Case=Acc,Nom|Number=Plur|PronType=Dem | lui | |
Case=Dat,Gen|Definite=Def|Number=Sing|PronType=Art | -lui | |
Case=Dat,Gen|Number=Sing|PronType=Ind | lui |
ADJ
10019 ADJ tokens (86% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Degree=Pos (10018; 100%), Case=Acc,Nom (9396; 94%), Definite=Ind (9207; 92%), Number=Sing (7302; 73%).
ADJ
tokens may have the following values of Gender
:
Fem
(4400; 44% of non-emptyGender
): bună, svînta, svîntă, bune, frumoasă, mare, sfîntă, grea, curată, plinăMasc
(5619; 56% of non-emptyGender
): bun, svinte, sfînt, datoriu, mic, rău, omenesc, verde, viu, nouEMPTY
(1644): mare, mari, vel, vel-, tare, verde, dulce, rece, dulci, iute
Paradigm mare | Masc | Fem |
---|---|---|
Case=Acc,Nom|Definite=Def|Number=Sing | marele, mareli, marili | marea |
Case=Acc,Nom|Definite=Def|Number=Plur | marii | |
Case=Acc,Nom|Definite=Ind|Number=Sing | mare | mare |
Case=Acc,Nom|Definite=Ind|Number=Plur | mari, mare | |
Case=Dat,Gen|Definite=Def|Number=Sing | marelui | marei |
Case=Dat,Gen|Definite=Ind|Number=Sing | mari | |
Definite=Ind|Number=Plur | mari |
VERB
4616 VERB tokens (6% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (4616; 100%), Person=EMPTY (4616; 100%), Tense=EMPTY (4616; 100%), VerbForm=Part (4615; 100%), Polarity=Pos (4331; 94%), Number=Sing (3325; 72%).
VERB
tokens may have the following values of Gender
:
Fem
(1737; 38% of non-emptyGender
): scrisă, dată, scrise, făcută, făcute, adevărată, pusă, adevărate, aleasă, ascunsăMasc
(2879; 62% of non-emptyGender
): scris, făcut, dat, pus, născut, dus, zis, ales, iubit, legatEMPTY
(70507): zise, făcut, face, da, dat, era, zice, veni, luat, avea
Paradigm zice | Masc | Fem |
---|---|---|
Case=Acc,Nom|Number=Sing | zisă, zîsă | |
Number=Sing | zis, dzis | |
Number=Plur | zise |
NUM
2215 NUM tokens (43% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumForm=Word (2084; 94%), Definite=EMPTY (1461; 66%), Case=EMPTY (1432; 65%), NumType=Card (1271; 57%), Number=Sing (1127; 51%).
NUM
tokens may have the following values of Gender
:
Fem
(1172; 53% of non-emptyGender
): doao, treia, mii, două, doa, mie, sute, doo, sută, patraMasc
(1043; 47% of non-emptyGender
): doi, întîiu, amîndoi, doisprăzeace, întîi, întăiu, dintîiu, un, doilea, dentîiuEMPTY
(2958): trei, 2, 3, cinci, patru, 4, 7, 12, 5, 1
Paradigm doi | Masc | Fem |
---|---|---|
Case=Acc,Nom|Definite=Def|Number=Sing|NumType=Ord | doilea, doile, doiele, doili | doa, doao |
Case=Acc,Nom|Definite=Ind|Number=Sing|NumType=Card | doao, doo, doă, doaă | |
Case=Acc,Nom|Definite=Ind|Number=Sing|NumType=Ord | doo, doao, DOA | |
Case=Acc,Nom|Definite=Ind|Number=Plur|NumType=Card | doao, doo, doauă, doă, da, dao, doua, douo | |
Case=Acc,Nom|Definite=Ind|Number=Plur|NumType=Ord | doo | |
Definite=Ind|Number=Sing|NumType=Ord | doile | |
Number=Sing|NumType=Ord | doilea, doile, doili | doaoa, doa, doua, doaua |
Number=Plur|NumType=Card | doi | două, doao, doa, doo |
AUX
4 AUX tokens (0% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Mood=EMPTY (4; 100%), Number=Sing (4; 100%), Person=EMPTY (4; 100%), Tense=EMPTY (4; 100%).
AUX
tokens may have the following values of Gender
:
Masc
(4; 100% of non-emptyGender
): fost, vrutEMPTY
(31922): au, va, -i, -au, era, am, a, iaste, vor, fi
ADV
1 ADV tokens (0% of all ADV
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADV
and Gender
co-occurred: Polarity=EMPTY (1; 100%), PronType=Int,Rel (1; 100%).
ADV
tokens may have the following values of Gender
:
Masc
(1; 100% of non-emptyGender
): cîtEMPTY
(34595): nu, mai, și, cum, n-, cînd, numai, şi, tot, unde
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (13804; 85%),
NOUN –[nmod]–> NOUN (5882; 51%),
NOUN –[amod]–> ADJ (5845; 76%),
NOUN –[conj]–> NOUN (4692; 69%),
PROPN –[nmod]–> NOUN (2857; 95%),
NOUN –[nmod]–> PROPN (2818; 58%),
NOUN –[amod]–> VERB (1171; 95%),
PROPN –[appos]–> NOUN (946; 92%),
PROPN –[nmod]–> PROPN (940; 88%),
PROPN –[conj]–> PROPN (860; 86%).