home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Kangri-KDTB: POS Tags: NUM

There are 27 NUM lemmas (3%), 27 NUM types (2%) and 41 NUM tokens (2%). Out of 15 observed tags, the rank of NUM is: 8 in number of lemmas, 9 in number of types and 12 in number of tokens.

The 10 most frequent NUM lemmas: इक्क, दो, दोयो, 8, इक, चौंह, 15, 19, 25, 250

The 10 most frequent NUM types: इक्क, दो, दोयो, 8, इक, चौंह, 15, 19, 25, 250

The 10 most frequent ambiguous lemmas: दोयो (NUM 3, DET 1)

The 10 most frequent ambiguous types: दोयो (NUM 3, DET 1)

Morphology

The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.115727).

The 1st highest number of forms (1) was observed with the lemma “15”: 15.

The 2nd highest number of forms (1) was observed with the lemma “19”: 19.

The 3rd highest number of forms (1) was observed with the lemma “25”: 25.

NUM occurs with 5 features: NumType (22; 54% instances), Case (18; 44% instances), Number (17; 41% instances), Person (17; 41% instances), Gender (15; 37% instances)

NUM occurs with 6 feature-value pairs: Case=Nom, Gender=Fem, Gender=Masc, NumType=Card, Number=Sing, Person=3

NUM occurs with 6 feature combinations. The most frequent feature combination is NumType=Card (22 tokens). Examples: दो, चौंह, दोयो, 19, 25, 250, 300, 35000, 40, 5

Relations

NUM nodes are attached to their parents using 6 different relations: nummod (31; 76% instances), nmod (4; 10% instances), compound (3; 7% instances), conj (1; 2% instances), iobj (1; 2% instances), nsubj (1; 2% instances)

Parents of NUM nodes belong to 5 different parts of speech: NOUN (35; 85% instances), NUM (2; 5% instances), VERB (2; 5% instances), ADJ (1; 2% instances), PROPN (1; 2% instances)

33 (80%) NUM nodes are leaves.

8 (20%) NUM nodes have one child.

The highest child degree of a NUM node is 1.

Children of NUM nodes are attached using 6 different relations: dep (3; 38% instances), case (1; 13% instances), cc (1; 13% instances), compound (1; 13% instances), conj (1; 13% instances), det (1; 13% instances)

Children of NUM nodes belong to 5 different parts of speech: PART (3; 38% instances), NUM (2; 25% instances), ADP (1; 13% instances), CCONJ (1; 13% instances), DET (1; 13% instances)