Treebank Statistics: UD_Komi_Zyrian-IKDP: POS Tags: NUM
There are 30 NUM
lemmas (4%), 32 NUM
types (3%) and 65 NUM
tokens (3%).
Out of 16 observed tags, the rank of NUM
is: 6 in number of lemmas, 6 in number of types and 8 in number of tokens.
The 10 most frequent NUM
lemmas: куим, нёль, кык, сизим, дас, вит, десятой, кызь, сорок, ӧти
The 10 most frequent NUM
types: куим, нёль, кык, сизим, дас, вит, десятой, кызь, сорок, Девять
The 10 most frequent ambiguous lemmas: кык (NUM 7, NOUN 1), мӧд (PRON 4, DET 1, NUM 1)
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of NUM
is 1.066667 (the average of all parts of speech is 1.332474).
The 1st highest number of forms (2) was observed with the lemma “куим”: Куимсэ, куим.
The 2nd highest number of forms (2) was observed with the lemma “ӧти”: Ӧтікес, ӧтік.
The 3rd highest number of forms (1) was observed with the lemma “Девять”: Девять.
NUM
occurs with 6 features: NumType (62; 95% instances), Case (53; 82% instances), Number (51; 78% instances), Number[psor] (2; 3% instances), Person[psor] (2; 3% instances), Clitic (1; 2% instances)
NUM
occurs with 9 feature-value pairs: Case=Acc
, Case=Nom
, Clitic=So
, NumType=Card
, NumType=Card,Ord
, NumType=Ord
, Number=Sing
, Number[psor]=Sing
, Person[psor]=1
NUM
occurs with 10 feature combinations.
The most frequent feature combination is Case=Nom|Number=Sing|NumType=Card
(44 tokens).
Examples: нёль, куим, кык, сизим, дас, кызь, вит, кызь-вит, три, тысяча
Relations
NUM
nodes are attached to their parents using 8 different relations: nummod (53; 82% instances), compound (2; 3% instances), fixed (2; 3% instances), nmod (2; 3% instances), obj (2; 3% instances), root (2; 3% instances), amod (1; 2% instances), conj (1; 2% instances)
Parents of NUM
nodes belong to 5 different parts of speech: NOUN (47; 72% instances), NUM (10; 15% instances), ADJ (4; 6% instances), (2; 3% instances), VERB (2; 3% instances)
46 (71%) NUM
nodes are leaves.
16 (25%) NUM
nodes have one child.
2 (3%) NUM
nodes have two children.
1 (2%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 3.
Children of NUM
nodes are attached using 12 different relations: nummod (6; 26% instances), punct (4; 17% instances), fixed (3; 13% instances), advmod (2; 9% instances), advcl (1; 4% instances), advmod:tmod (1; 4% instances), case (1; 4% instances), cc (1; 4% instances), compound (1; 4% instances), conj (1; 4% instances), orphan (1; 4% instances), reparandum (1; 4% instances)
Children of NUM
nodes belong to 6 different parts of speech: NUM (10; 43% instances), ADV (4; 17% instances), PUNCT (4; 17% instances), ADJ (3; 13% instances), ADP (1; 4% instances), CCONJ (1; 4% instances)