Treebank Statistics: UD_Romanian-TueCL: POS Tags: NUM
There are 21 NUM
lemmas (2%), 22 NUM
types (1%) and 31 NUM
tokens (1%).
Out of 17 observed tags, the rank of NUM
is: 10 in number of lemmas, 11 in number of types and 15 in number of tokens.
The 10 most frequent NUM
lemmas: 10, 2, doi, 3, 9, 1, 112, 12, 12000, 2.5
The 10 most frequent NUM
types: 10, 2, 3, 9, doi, 1, 112, 12, 12000, 2,5
The 10 most frequent ambiguous lemmas: întâi (ADV 1, NUM 1)
The 10 most frequent ambiguous types: întâi (ADV 1, NUM 1)
- întâi
Morphology
The form / lemma ratio of NUM
is 1.047619 (the average of all parts of speech is 1.367279).
The 1st highest number of forms (2) was observed with the lemma “doi”: doi, doua.
The 2nd highest number of forms (1) was observed with the lemma “1”: 1.
The 3rd highest number of forms (1) was observed with the lemma “10”: 10.
NUM
occurs with 8 features: NumType (31; 100% instances), Number (31; 100% instances), NumForm (30; 97% instances), Gender (6; 19% instances), Case (2; 6% instances), Typo (2; 6% instances), Definite (1; 3% instances), PronType (1; 3% instances)
NUM
occurs with 12 feature-value pairs: Case=Acc,Nom
, Definite=Def
, Gender=Fem
, Gender=Masc
, NumForm=Digit
, NumForm=Word
, NumType=Card
, NumType=Ord
, Number=Plur
, Number=Sing
, PronType=Tot
, Typo=Yes
NUM
occurs with 7 feature combinations.
The most frequent feature combination is Number=Sing|NumForm=Digit|NumType=Card
(23 tokens).
Examples: 10, 2, 3, 9, 1, 112, 12, 12000, 20, 200
Relations
NUM
nodes are attached to their parents using 4 different relations: nummod (27; 87% instances), conj (2; 6% instances), obj (1; 3% instances), obl (1; 3% instances)
Parents of NUM
nodes belong to 6 different parts of speech: NOUN (20; 65% instances), VERB (6; 19% instances), NUM (2; 6% instances), ADJ (1; 3% instances), ADV (1; 3% instances), SYM (1; 3% instances)
17 (55%) NUM
nodes are leaves.
12 (39%) NUM
nodes have one child.
2 (6%) NUM
nodes have two children.
The highest child degree of a NUM
node is 2.
Children of NUM
nodes are attached using 5 different relations: case (8; 50% instances), advmod (2; 13% instances), conj (2; 13% instances), det (2; 13% instances), punct (2; 13% instances)
Children of NUM
nodes belong to 5 different parts of speech: ADP (8; 50% instances), DET (3; 19% instances), NUM (2; 13% instances), PUNCT (2; 13% instances), ADV (1; 6% instances)