Treebank Statistics: UD_Romanian-RRT: Features: Definite
This feature is universal.
It occurs with 2 different values: Def
, Ind
.
69060 tokens (32%) have a non-empty value of Definite
.
22191 types (70%) occur at least once with a non-empty value of Definite
.
10814 lemmas (63%) occur at least once with a non-empty value of Definite
.
The feature is used with 5 part-of-speech tags: NOUN (52851; 24% instances), ADJ (15010; 7% instances), NUM (470; 0% instances), DET (407; 0% instances), PROPN (322; 0% instances).
NOUN
52851 NOUN tokens (97% of all NOUN
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which NOUN
and Definite
co-occurred: Number=Sing (38593; 73%), Gender=Fem (32519; 62%), Case=Acc,Nom (28805; 55%).
NOUN
tokens may have the following values of Definite
:
Def
(27198; 51% of non-emptyDefinite
): cazul, timpul, statele, Comisia, cadrul, partea, fața, comisiei, anul, articolulInd
(25653; 49% of non-emptyDefinite
): ani, timp, conformitate, loc, membre, mod, acord, parte, b, lucruEMPTY
(1407): art., a., nr., CE, b., mg, lit., alin., ml, CEE
Paradigm an | Ind | Def |
---|---|---|
Case=Acc,Nom|Number=Sing | anul | |
Case=Acc,Nom|Number=Plur | anii | |
Case=Dat,Gen|Number=Sing | anului | |
Case=Dat,Gen|Number=Plur | anilor | |
Number=Sing | an | |
Number=Plur | ani |
ADJ
15010 ADJ tokens (98% of all ADJ
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which ADJ
and Definite
co-occurred: Degree=Pos (14970; 100%), Number=Sing (9820; 65%), Case=EMPTY (9398; 63%), Gender=Fem (9207; 61%).
ADJ
tokens may have the following values of Definite
:
Def
(894; 6% of non-emptyDefinite
): prezentul, prezenta, prezentului, prezentei, întreaga, următoarele, noul, noua, fosta, principaleleInd
(14116; 94% of non-emptyDefinite
): mare, europene, nou, necesare, europeană, mari, european, mică, naționale, generalEMPTY
(288): asemenea, standard, anume, așa, n., aparte, atare, eficace, roz, e-
Paradigm mare | Ind | Def |
---|---|---|
Case=Acc,Nom|Gender=Masc|Number=Sing | marele | |
Case=Acc,Nom|Gender=Masc|Number=Plur | marii | |
Case=Acc,Nom|Gender=Fem|Number=Sing | marea | |
Case=Acc,Nom|Gender=Fem|Number=Plur | marile | |
Case=Dat,Gen|Gender=Masc|Number=Sing | marelui | |
Case=Dat,Gen|Gender=Fem|Number=Sing | mari | Marii |
Case=Dat,Gen|Number=Plur | marilor | |
Number=Sing | mare | |
Number=Plur | mari |
Definite
seems to be lexical feature of ADJ
. 96% lemmas (3277) occur only with one value of Definite
.
NUM
470 NUM tokens (8% of all NUM
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which NUM
and Definite
co-occurred: NumForm=Word (470; 100%), NumType=Ord (336; 71%), Gender=Fem (293; 62%), Number=Sing (277; 59%).
NUM
tokens may have the following values of Definite
:
Def
(311; 66% of non-emptyDefinite
): primul, prima, primele, ultimii, ultimul, primului, ultimele, ultima, primii, întâiaInd
(159; 34% of non-emptyDefinite
): milioane, mii, miliarde, o, sute, prim-, primă, zeci, sută, milionEMPTY
(5079): 1, 2, 3, două, 4, trei, 5, 6, doi, 7
Paradigm prim | Ind | Def |
---|---|---|
Case=Acc,Nom|Gender=Masc|Number=Sing | primul | |
Case=Acc,Nom|Gender=Masc|Number=Plur | primii | |
Case=Acc,Nom|Gender=Fem|Number=Sing | primă | prima |
Case=Acc,Nom|Gender=Fem|Number=Plur | primele | |
Case=Dat,Gen|Gender=Masc|Number=Sing | primului | |
Case=Dat,Gen|Gender=Masc|Number=Plur | primilor | |
Case=Dat,Gen|Gender=Fem|Number=Sing | prime | primei |
Case=Dat,Gen|Gender=Fem|Number=Plur | primelor | |
Gender=Masc|Number=Sing | prim-, prim |
DET
407 DET tokens (3% of all DET
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which DET
and Definite
co-occurred: Person=EMPTY (407; 100%), Position=EMPTY (407; 100%), Poss=EMPTY (407; 100%), PronType=Art (407; 100%), Number=Sing (403; 99%), Case=Dat,Gen (318; 78%), Gender=EMPTY (297; 73%).
DET
tokens may have the following values of Definite
:
Def
(407; 100% of non-emptyDefinite
): lui, -lea, -ul, -a, -ului, -urilor, -ilor, -urileEMPTY
(11618): o, un, a, al, ale, unei, unui, acest, lui, cel
PROPN
322 PROPN tokens (5% of all PROPN
tokens) have a non-empty value of Definite
.
PROPN
tokens may have the following values of Definite
:
Def
(314; 98% of non-emptyDefinite
): României, Moldovei, Dunării, Europei, Franței, Italiei, Norvegiei, Rusiei, Ungariei, GermanieiInd
(8; 2% of non-emptyDefinite
): Britanii, Americi, Eladă, Făt-frumos, Iugoslavie, Mediterane, NapoleonEMPTY
(5563): România, Winston, București, Timișoara, Iași, Ion, Paris, Alexandru, O’Brien, Moldova
Paradigm Iugoslavia | Ind | Def |
---|---|---|
Case=Acc,Nom | Iugoslavie | |
Case=Dat,Gen | Iugoslaviei |
Definite
seems to be lexical feature of PROPN
. 98% lemmas (102) occur only with one value of Definite
.
Relations with Agreement in Definite
The 10 most frequent relations where parent and child node agree in Definite
:
NOUN –[nmod]–> NOUN (9461; 56%),
NOUN –[conj]–> NOUN (3042; 89%),
ADJ –[conj]–> ADJ (713; 99%),
NOUN –[appos]–> NOUN (278; 51%),
ADJ –[conj]–> NOUN (86; 80%),
NOUN –[conj]–> ADJ (32; 73%),
ADJ –[amod]–> ADJ (28; 67%),
ADJ –[advmod]–> ADJ (26; 96%),
NOUN –[acl]–> NOUN (26; 54%),
ADJ –[xcomp]–> NOUN (24; 73%).