home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Romanian-TueCL: POS Tags: NUM

There are 21 NUM lemmas (2%), 22 NUM types (1%) and 31 NUM tokens (1%). Out of 17 observed tags, the rank of NUM is: 10 in number of lemmas, 11 in number of types and 15 in number of tokens.

The 10 most frequent NUM lemmas: 10, 2, doi, 3, 9, 1, 112, 12, 12000, 2.5

The 10 most frequent NUM types: 10, 2, 3, 9, doi, 1, 112, 12, 12000, 2,5

The 10 most frequent ambiguous lemmas: întâi (ADV 1, NUM 1)

The 10 most frequent ambiguous types: întâi (ADV 1, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.047619 (the average of all parts of speech is 1.367279).

The 1st highest number of forms (2) was observed with the lemma “doi”: doi, doua.

The 2nd highest number of forms (1) was observed with the lemma “1”: 1.

The 3rd highest number of forms (1) was observed with the lemma “10”: 10.

NUM occurs with 8 features: NumType (31; 100% instances), Number (31; 100% instances), NumForm (30; 97% instances), Gender (6; 19% instances), Case (2; 6% instances), Typo (2; 6% instances), Definite (1; 3% instances), PronType (1; 3% instances)

NUM occurs with 12 feature-value pairs: Case=Acc,Nom, Definite=Def, Gender=Fem, Gender=Masc, NumForm=Digit, NumForm=Word, NumType=Card, NumType=Ord, Number=Plur, Number=Sing, PronType=Tot, Typo=Yes

NUM occurs with 7 feature combinations. The most frequent feature combination is Number=Sing|NumForm=Digit|NumType=Card (23 tokens). Examples: 10, 2, 3, 9, 1, 112, 12, 12000, 20, 200

Relations

NUM nodes are attached to their parents using 4 different relations: nummod (27; 87% instances), conj (2; 6% instances), obj (1; 3% instances), obl (1; 3% instances)

Parents of NUM nodes belong to 6 different parts of speech: NOUN (20; 65% instances), VERB (6; 19% instances), NUM (2; 6% instances), ADJ (1; 3% instances), ADV (1; 3% instances), SYM (1; 3% instances)

17 (55%) NUM nodes are leaves.

12 (39%) NUM nodes have one child.

2 (6%) NUM nodes have two children.

The highest child degree of a NUM node is 2.

Children of NUM nodes are attached using 5 different relations: case (8; 50% instances), advmod (2; 13% instances), conj (2; 13% instances), det (2; 13% instances), punct (2; 13% instances)

Children of NUM nodes belong to 5 different parts of speech: ADP (8; 50% instances), DET (3; 19% instances), NUM (2; 13% instances), PUNCT (2; 13% instances), ADV (1; 6% instances)