home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Romanian-RRT: Features: Definite

This feature is universal. It occurs with 2 different values: Def, Ind.

69060 tokens (32%) have a non-empty value of Definite. 22191 types (70%) occur at least once with a non-empty value of Definite. 10814 lemmas (63%) occur at least once with a non-empty value of Definite. The feature is used with 5 part-of-speech tags: NOUN (52851; 24% instances), ADJ (15010; 7% instances), NUM (470; 0% instances), DET (407; 0% instances), PROPN (322; 0% instances).

NOUN

52851 NOUN tokens (97% of all NOUN tokens) have a non-empty value of Definite.

The most frequent other feature values with which NOUN and Definite co-occurred: Number=Sing (38593; 73%), Gender=Fem (32519; 62%), Case=Acc,Nom (28805; 55%).

NOUN tokens may have the following values of Definite:

Paradigm anIndDef
Case=Acc,Nom|Number=Singanul
Case=Acc,Nom|Number=Pluranii
Case=Dat,Gen|Number=Singanului
Case=Dat,Gen|Number=Pluranilor
Number=Singan
Number=Plurani

ADJ

15010 ADJ tokens (98% of all ADJ tokens) have a non-empty value of Definite.

The most frequent other feature values with which ADJ and Definite co-occurred: Degree=Pos (14970; 100%), Number=Sing (9820; 65%), Case=EMPTY (9398; 63%), Gender=Fem (9207; 61%).

ADJ tokens may have the following values of Definite:

Paradigm mareIndDef
Case=Acc,Nom|Gender=Masc|Number=Singmarele
Case=Acc,Nom|Gender=Masc|Number=Plurmarii
Case=Acc,Nom|Gender=Fem|Number=Singmarea
Case=Acc,Nom|Gender=Fem|Number=Plurmarile
Case=Dat,Gen|Gender=Masc|Number=Singmarelui
Case=Dat,Gen|Gender=Fem|Number=SingmariMarii
Case=Dat,Gen|Number=Plurmarilor
Number=Singmare
Number=Plurmari

Definite seems to be lexical feature of ADJ. 96% lemmas (3277) occur only with one value of Definite.

NUM

470 NUM tokens (8% of all NUM tokens) have a non-empty value of Definite.

The most frequent other feature values with which NUM and Definite co-occurred: NumForm=Word (470; 100%), NumType=Ord (336; 71%), Gender=Fem (293; 62%), Number=Sing (277; 59%).

NUM tokens may have the following values of Definite:

Paradigm primIndDef
Case=Acc,Nom|Gender=Masc|Number=Singprimul
Case=Acc,Nom|Gender=Masc|Number=Plurprimii
Case=Acc,Nom|Gender=Fem|Number=Singprimăprima
Case=Acc,Nom|Gender=Fem|Number=Plurprimele
Case=Dat,Gen|Gender=Masc|Number=Singprimului
Case=Dat,Gen|Gender=Masc|Number=Plurprimilor
Case=Dat,Gen|Gender=Fem|Number=Singprimeprimei
Case=Dat,Gen|Gender=Fem|Number=Plurprimelor
Gender=Masc|Number=Singprim-, prim

DET

407 DET tokens (3% of all DET tokens) have a non-empty value of Definite.

The most frequent other feature values with which DET and Definite co-occurred: Person=EMPTY (407; 100%), Position=EMPTY (407; 100%), Poss=EMPTY (407; 100%), PronType=Art (407; 100%), Number=Sing (403; 99%), Case=Dat,Gen (318; 78%), Gender=EMPTY (297; 73%).

DET tokens may have the following values of Definite:

PROPN

322 PROPN tokens (5% of all PROPN tokens) have a non-empty value of Definite.

PROPN tokens may have the following values of Definite:

Paradigm IugoslaviaIndDef
Case=Acc,NomIugoslavie
Case=Dat,GenIugoslaviei

Definite seems to be lexical feature of PROPN. 98% lemmas (102) occur only with one value of Definite.

Relations with Agreement in Definite

The 10 most frequent relations where parent and child node agree in Definite: NOUN –[nmod]–> NOUN (9461; 56%), NOUN –[conj]–> NOUN (3042; 89%), ADJ –[conj]–> ADJ (713; 99%), NOUN –[appos]–> NOUN (278; 51%), ADJ –[conj]–> NOUN (86; 80%), NOUN –[conj]–> ADJ (32; 73%), ADJ –[amod]–> ADJ (28; 67%), ADJ –[advmod]–> ADJ (26; 96%), NOUN –[acl]–> NOUN (26; 54%), ADJ –[xcomp]–> NOUN (24; 73%).