Treebank Statistics: UD_Malayalam-UFAL: POS Tags: NUM
There are 38 NUM
lemmas (3%), 39 NUM
types (2%) and 42 NUM
tokens (2%).
Out of 16 observed tags, the rank of NUM
is: 6 in number of lemmas, 7 in number of types and 11 in number of tokens.
The 10 most frequent NUM
lemmas: മൂന്ന്, 1, രണ്ട്, 0,7-0,8, 12, 15.56, 156, 16, 18, 19.2
The 10 most frequent NUM
types: 1, മൂന്ന്, രണ്ട്, 0,7-0,8, 12, 15.56, 156, 16, 18, 19.2
The 10 most frequent ambiguous lemmas:
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of NUM
is 1.026316 (the average of all parts of speech is 1.111893).
The 1st highest number of forms (2) was observed with the lemma “മൂന്ന്”: മൂന്നിന്, മൂന്ന്.
The 2nd highest number of forms (1) was observed with the lemma “0,7-0,8”: 0,7-0,8.
The 3rd highest number of forms (1) was observed with the lemma “1”: 1.
NUM
occurs with 3 features: NumForm (42; 100% instances), NumType (42; 100% instances), Case (18; 43% instances)
NUM
occurs with 6 feature-value pairs: Case=Dat
, Case=Nom
, NumForm=Digit
, NumForm=Word
, NumType=Card
, NumType=Frac
NUM
occurs with 4 feature combinations.
The most frequent feature combination is NumForm=Digit|NumType=Card
(24 tokens).
Examples: 1, 0,7-0,8, 12, 15.56, 156, 16, 18, 19.2, 1950, 2004
Relations
NUM
nodes are attached to their parents using 5 different relations: nummod (31; 74% instances), flat (4; 10% instances), obl (4; 10% instances), root (2; 5% instances), acl:relcl (1; 2% instances)
Parents of NUM
nodes belong to 7 different parts of speech: NOUN (31; 74% instances), PROPN (3; 7% instances), NUM (2; 5% instances), (2; 5% instances), VERB (2; 5% instances), ADJ (1; 2% instances), ADV (1; 2% instances)
32 (76%) NUM
nodes are leaves.
6 (14%) NUM
nodes have one child.
3 (7%) NUM
nodes have two children.
1 (2%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 4.
Children of NUM
nodes are attached using 8 different relations: case (5; 31% instances), amod (2; 13% instances), flat (2; 13% instances), nsubj (2; 13% instances), punct (2; 13% instances), cop (1; 6% instances), mark (1; 6% instances), nummod (1; 6% instances)
Children of NUM
nodes belong to 7 different parts of speech: ADP (5; 31% instances), NOUN (3; 19% instances), ADJ (2; 13% instances), NUM (2; 13% instances), PUNCT (2; 13% instances), AUX (1; 6% instances), PART (1; 6% instances)