Treebank Statistics: UD_Romanian-TueCL: Features: Case
This feature is universal.
It occurs with 5 different values: Acc
, Dat
, Gen
, Nom
, Voc
.
Some words have combined values of the feature; 2 combinations have been observed: Acc|Nom
, Dat|Gen
.
1669 tokens (38%) have a non-empty value of Case
.
629 types (40%) occur at least once with a non-empty value of Case
.
444 lemmas (39%) occur at least once with a non-empty value of Case
.
The feature is used with 8 part-of-speech tags: NOUN (517; 12% instances), ADP (474; 11% instances), PRON (386; 9% instances), DET (173; 4% instances), ADJ (108; 2% instances), PROPN (6; 0% instances), ADV (3; 0% instances), NUM (2; 0% instances).
NOUN
517 NOUN tokens (61% of all NOUN
tokens) have a non-empty value of Case
.
The most frequent other feature values with which NOUN
and Case
co-occurred: Typo=EMPTY (453; 88%), Number=Sing (418; 81%), Gender=Fem (381; 74%), Definite=Def (300; 58%).
NOUN
tokens may have the following values of Case
:
Acc
(4; 1% of non-emptyCase
): buzele, cauciuc, decolteu, pantaloniAcc,Nom
(477; 92% of non-emptyCase
): femeie, femeia, femeile, fetele, bărbatul, bărbații, fată, iubire, mamă, minteDat,Gen
(28; 5% of non-emptyCase
): bărbaților, femeii, femeilor, Apărării, Ipocritilor, amenintatilor, anului, criminalului, educației, feminismuluiNom
(5; 1% of non-emptyCase
): PUPICI, Felul, sarmaleVoc
(3; 1% of non-emptyCase
): doamne, mamiEMPTY
(327): femei, bărbat, PUPICI, barbat, bărbați, fund, bani, fete, sutien, cazuri
Paradigm femeie | Acc,Nom | Dat,Gen |
---|---|---|
Definite=Def|Number=Sing | femeia | femeii |
Definite=Def|Number=Sing|Typo=Yes | femeii | |
Definite=Def|Number=Plur | femeile | femeilor |
Definite=Ind|Number=Sing | femeie | |
Definite=Ind|Number=Sing|Typo=Yes | femei | |
Number=Sing | femeie |
Case
seems to be lexical feature of NOUN
. 95% lemmas (282) occur only with one value of Case
.
ADP
474 ADP tokens (99% of all ADP
tokens) have a non-empty value of Case
.
The most frequent other feature values with which ADP
and Case
co-occurred: AdpType=Prep (474; 100%).
ADP
tokens may have the following values of Case
:
Acc
(472; 100% of non-emptyCase
): de, la, cu, în, pe, in, din, pentru, ca, dupăGen
(2; 0% of non-emptyCase
): asupra, impotrivaEMPTY
(3): ex, in
Case
seems to be lexical feature of ADP
. 100% lemmas (21) occur only with one value of Case
.
PRON
386 PRON tokens (99% of all PRON
tokens) have a non-empty value of Case
.
The most frequent other feature values with which PRON
and Case
co-occurred: Reflex=EMPTY (345; 89%), Variant=EMPTY (338; 88%), Gender=EMPTY (289; 75%), PronType=Prs (251; 65%), Person=3 (239; 62%), Number=Sing (220; 57%).
PRON
tokens may have the following values of Case
:
Acc
(147; 38% of non-emptyCase
): se, te, tine, mine, mă, o, -o, le, m-, neAcc,Nom
(164; 42% of non-emptyCase
): ce, care, el, asta, ea, nimic, noi, una, unu, voiDat
(57; 15% of non-emptyCase
): îți, îmi, -mi, i-, mi-, isi, îi, își, -ți, iDat,Gen
(1; 0% of non-emptyCase
): luiNom
(17; 4% of non-emptyCase
): eu, tu, iEMPTY
(5): tale, ei, lui, multe
Paradigm eu | Acc,Nom | Nom | Acc | Dat |
---|---|---|---|---|
Abbr=Yes|Number=Sing|Person=1|Strength=Strong | mn | |||
Gender=Masc|Number=Plur|Person=3|Strength=Strong | ei | |||
Number=Sing|Person=1|Strength=Strong | eu | mine | ||
Number=Sing|Person=1|Strength=Weak | mă | îmi, -mi | ||
Number=Sing|Person=1|Strength=Weak|Typo=Yes | ma | imi | ||
Number=Sing|Person=1|Strength=Weak|Typo=Yes|Variant=Short | m | mi | ||
Number=Sing|Person=1|Strength=Weak|Variant=Short | m-, -mă | mi-, -mi | ||
Number=Plur|Person=1|Strength=Strong | noi | |||
Number=Plur|Person=1|Strength=Weak | ne | |||
Number=Plur|Person=1|Strength=Weak|Variant=Short | Ne- | |||
Person=3|Reflex=Yes|Strength=Weak | vă |
DET
173 DET tokens (82% of all DET
tokens) have a non-empty value of Case
.
The most frequent other feature values with which DET
and Case
co-occurred: Number[psor]=EMPTY (161; 93%), Poss=EMPTY (161; 93%), Number=Sing (140; 81%), Position=EMPTY (139; 80%), PronType=Ind (121; 70%), Person=EMPTY (107; 62%), Gender=Fem (106; 61%).
DET
tokens may have the following values of Case
:
Acc
(1; 1% of non-emptyCase
): OAcc,Nom
(161; 93% of non-emptyCase
): o, un, asta, mea, toate, ta, ce, alea, toți, aceastăDat,Gen
(11; 6% of non-emptyCase
): unui, acestui, unei, lui, unorEMPTY
(39): multe, mulți, lor, a, al, ei, lui, niste, tău, Puține
Paradigm un | Acc,Nom | Dat,Gen |
---|---|---|
Gender=Masc|Number=Sing | un | unui |
Gender=Fem|Number=Sing | o | unei |
Gender=Fem|Number=Sing|Typo=Yes | unui | |
Number=Plur | unor |
ADJ
108 ADJ tokens (46% of all ADJ
tokens) have a non-empty value of Case
.
The most frequent other feature values with which ADJ
and Case
co-occurred: Definite=Ind (101; 94%), Number=Sing (101; 94%), Gender=Fem (100; 93%), Degree=Pos (98; 91%), Typo=EMPTY (84; 78%).
ADJ
tokens may have the following values of Case
:
Acc
(2; 2% of non-emptyCase
): imens, scurțiAcc,Nom
(97; 90% of non-emptyCase
): frumoasă, frumoasa, bună, urâtă, drăguță, dulce, ieftină, rece, superbă, SuperbaDat,Gen
(5; 5% of non-emptyCase
): ascultătoare, neajutorate, propriilor, sexuale, slabeNom
(3; 3% of non-emptyCase
): DULCIVoc
(1; 1% of non-emptyCase
): frumușicoEMPTY
(127): DULCI, mare, misogini, sexy, FRUMOȘI, atent, așa, existente, feministe, frumoase
Paradigm dulce | Acc,Nom | Nom |
---|---|---|
Degree=Pos|Gender=Fem|Number=Sing | dulce | |
Gender=Masc|Number=Plur | DULCI | |
Gender=Fem|Number=Sing | dulce |
Case
seems to be lexical feature of ADJ
. 97% lemmas (73) occur only with one value of Case
.
PROPN
6 PROPN tokens (8% of all PROPN
tokens) have a non-empty value of Case
.
PROPN
tokens may have the following values of Case
:
Acc,Nom
(4; 67% of non-emptyCase
): Ezada, Maica, Marea, SinnDat,Gen
(1; 17% of non-emptyCase
): EleneiVoc
(1; 17% of non-emptyCase
): DoamneEMPTY
(66): România, Mirela, Vaida, Irinel, Maria, @KlausIohannis, @Utilizator_x3, ALEXANDRA, Africa, Alex
ADV
3 ADV tokens (1% of all ADV
tokens) have a non-empty value of Case
.
The most frequent other feature values with which ADV
and Case
co-occurred: Degree=EMPTY (3; 100%), Typo=EMPTY (3; 100%), PronType=Int,Rel (2; 67%).
ADV
tokens may have the following values of Case
:
Acc,Nom
(3; 100% of non-emptyCase
): ce, nimicEMPTY
(260): mai, cum, doar, nici, bine, când, iar, tot, asa, așa
NUM
2 NUM tokens (6% of all NUM
tokens) have a non-empty value of Case
.
NUM
tokens may have the following values of Case
:
Acc,Nom
(2; 100% of non-emptyCase
): amândoi, primaEMPTY
(29): 10, 2, 3, 9, doi, 1, 112, 12, 12000, 2,5
Relations with Agreement in Case
The 10 most frequent relations where parent and child node agree in Case
:
NOUN –[det]–> DET (106; 57%),
NOUN –[amod]–> ADJ (45; 56%),
ADP –[fixed]–> ADP (13; 100%),
ADJ –[conj]–> ADJ (8; 80%),
NOUN –[nsubj]–> NOUN (8; 53%),
ADJ –[obj]–> PRON (5; 100%),
ADJ –[obl]–> NOUN (5; 63%),
NOUN –[nsubj]–> PRON (3; 60%),
PRON –[appos]–> NOUN (3; 60%),
PRON –[nsubj]–> NOUN (3; 75%).