Treebank Statistics: UD_Sinhala-STB: POS Tags: NUM
There are 4 NUM
lemmas (1%), 4 NUM
types (1%) and 4 NUM
tokens (0%).
Out of 13 observed tags, the rank of NUM
is: 11 in number of lemmas, 11 in number of types and 13 in number of tokens.
The 10 most frequent NUM
lemmas: 1990, දෙවන, පළමු, හතර
The 10 most frequent NUM
types: 1990, දෙවැන්න, පළමු, හතර
The 10 most frequent ambiguous lemmas: පළමු (ADJ 1, NUM 1)
The 10 most frequent ambiguous types: දෙවැන්න (NOUN 1, NUM 1)
- දෙවැන්න
Morphology
The form / lemma ratio of NUM
is 1.000000 (the average of all parts of speech is 1.145336).
The 1st highest number of forms (1) was observed with the lemma “1990”: 1990.
The 2nd highest number of forms (1) was observed with the lemma “දෙවන”: දෙවැන්න.
The 3rd highest number of forms (1) was observed with the lemma “පළමු”: පළමු.
NUM
occurs with 1 features: NumType (4; 100% instances)
NUM
occurs with 2 feature-value pairs: NumType=Card
, NumType=Ord
NUM
occurs with 2 feature combinations.
The most frequent feature combination is NumType=Ord
(2 tokens).
Examples: දෙවැන්න, පළමු
Relations
NUM
nodes are attached to their parents using 4 different relations: amod (1; 25% instances), nsubj (1; 25% instances), nummod (1; 25% instances), obl (1; 25% instances)
Parents of NUM
nodes belong to 2 different parts of speech: NOUN (3; 75% instances), VERB (1; 25% instances)
3 (75%) NUM
nodes are leaves.
1 (25%) NUM
nodes have one child.
The highest child degree of a NUM
node is 1.
Children of NUM
nodes are attached using 1 different relations: dep (1; 100% instances)
Children of NUM
nodes belong to 1 different parts of speech: PART (1; 100% instances)