Treebank Statistics: UD_Tatar-NMCTT: POS Tags: NUM
There are 38 NUM
lemmas (4%), 39 NUM
types (3%) and 59 NUM
tokens (3%).
Out of 14 observed tags, the rank of NUM
is: 5 in number of lemmas, 5 in number of types and 8 in number of tokens.
The 10 most frequent NUM
lemmas: бер, миллион, 15, 20, миллиард, 1, 10, 11, 27, 5
The 10 most frequent NUM
types: бер, миллион, 15, 20, миллиард, 1, 10, 11, 27, 5
The 10 most frequent ambiguous lemmas:
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of NUM
is 1.026316 (the average of all parts of speech is 1.414579).
The 1st highest number of forms (2) was observed with the lemma “7”: 7, 7нче.
The 2nd highest number of forms (1) was observed with the lemma “1”: 1.
The 3rd highest number of forms (1) was observed with the lemma “10”: 10.
NUM
occurs with 3 features: NumType (45; 76% instances), Case (2; 3% instances), Number (1; 2% instances)
NUM
occurs with 5 feature-value pairs: Case=Abl
, Case=Loc
, NumType=Card
, NumType=Ord
, Number=Sing
NUM
occurs with 5 feature combinations.
The most frequent feature combination is NumType=Card
(37 tokens).
Examples: бер, миллион, миллиард, 1, 10, 100, 11, 11318, 120, 15
Relations
NUM
nodes are attached to their parents using 5 different relations: nummod (30; 51% instances), compound (12; 20% instances), amod (7; 12% instances), flat (5; 8% instances), obl (5; 8% instances)
Parents of NUM
nodes belong to 5 different parts of speech: NOUN (30; 51% instances), NUM (16; 27% instances), ADJ (6; 10% instances), VERB (5; 8% instances), SYM (2; 3% instances)
39 (66%) NUM
nodes are leaves.
12 (20%) NUM
nodes have one child.
7 (12%) NUM
nodes have two children.
1 (2%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 3.
Children of NUM
nodes are attached using 7 different relations: compound (12; 41% instances), flat (6; 21% instances), punct (4; 14% instances), advmod (3; 10% instances), amod (2; 7% instances), nmod (1; 3% instances), parataxis (1; 3% instances)
Children of NUM
nodes belong to 6 different parts of speech: NUM (16; 55% instances), NOUN (4; 14% instances), PUNCT (4; 14% instances), ADV (3; 10% instances), ADJ (1; 3% instances), SYM (1; 3% instances)