home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Moksha-JR: POS Tags: NUM

There are 12 NUM lemmas (1%), 15 NUM types (1%) and 37 NUM tokens (1%). Out of 16 observed tags, the rank of NUM is: 10 in number of lemmas, 11 in number of types and 12 in number of tokens.

The 10 most frequent NUM lemmas: кафта, фкя, колма, вете, кемонь, 225, 7, кафонц, кеветие, комсь

The 10 most frequent NUM types: кафта, фкя, колма, вете, 225, 7, Кафттне, Кемоньшка, кафонц, кеветиешка

The 10 most frequent ambiguous lemmas: вете (NUM 2, ADJ 1)

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of NUM is 1.250000 (the average of all parts of speech is 1.550071).

The 1st highest number of forms (2) was observed with the lemma “кафта”: Кафттне, кафта.

The 2nd highest number of forms (2) was observed with the lemma “кемонь”: Кемоньшка, кемотть.

The 3rd highest number of forms (2) was observed with the lemma “колма”: колма, колмоцьке.

NUM occurs with 5 features: NumType (34; 92% instances), Case (33; 89% instances), Number (33; 89% instances), Definite (29; 78% instances), NumForm (2; 5% instances)

NUM occurs with 11 feature-value pairs: Case=Cmp, Case=Nom, Definite=Def, Definite=Ind, NumForm=Digit, NumType=Card, NumType=Coll, NumType=Sets, Number=Plur, Number=Plur,Sing, Number=Sing

NUM occurs with 9 feature combinations. The most frequent feature combination is Case=Nom|Definite=Ind|Number=Sing|NumType=Card (25 tokens). Examples: кафта, фкя, колма, вете, комсь, ниле

Relations

NUM nodes are attached to their parents using 4 different relations: nummod (34; 92% instances), conj (1; 3% instances), nmod (1; 3% instances), nsubj:cop (1; 3% instances)

Parents of NUM nodes belong to 3 different parts of speech: NOUN (35; 95% instances), ADJ (1; 3% instances), VERB (1; 3% instances)

36 (97%) NUM nodes are leaves.

0 (0%) NUM nodes have one child.

1 (3%) NUM nodes have two children.

The highest child degree of a NUM node is 2.

Children of NUM nodes are attached using 2 different relations: advmod (1; 50% instances), cc (1; 50% instances)

Children of NUM nodes belong to 2 different parts of speech: ADV (1; 50% instances), CCONJ (1; 50% instances)