Treebank Statistics: UD_Guajajara-TuDeT: POS Tags: NUM
There are 5 NUM
lemmas (1%), 5 NUM
types (0%) and 9 NUM
tokens (0%).
Out of 15 observed tags, the rank of NUM
is: 12 in number of lemmas, 13 in number of types and 14 in number of tokens.
The 10 most frequent NUM
lemmas: ru, mokoz, mukuz, pitai, pɨta
The 10 most frequent NUM
types: naʔiruz, mokoz, Mukuz, napɨtaʔikwaw, pitei
The 10 most frequent ambiguous lemmas: ru (NUM 4, ADP 2), mokoz (NUM 2, NOUN 1), pɨta (VERB 9, NUM 1)
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of NUM
is 1.000000 (the average of all parts of speech is 1.933709).
The 1st highest number of forms (1) was observed with the lemma “mokoz”: mokoz.
The 2nd highest number of forms (1) was observed with the lemma “mukuz”: Mukuz.
The 3rd highest number of forms (1) was observed with the lemma “pitai”: pitei.
NUM
occurs with 3 features: Polarity (5; 56% instances), Rel (4; 44% instances), Degree (1; 11% instances)
NUM
occurs with 3 feature-value pairs: Degree=Dim
, Polarity=Neg
, Rel=NCont
NUM
occurs with 3 feature combinations.
The most frequent feature combination is _
(4 tokens).
Examples: mokoz, Mukuz, pitei
Relations
NUM
nodes are attached to their parents using 3 different relations: nummod (6; 67% instances), nmod (2; 22% instances), root (1; 11% instances)
Parents of NUM
nodes belong to 2 different parts of speech: NOUN (8; 89% instances), (1; 11% instances)
8 (89%) NUM
nodes are leaves.
0 (0%) NUM
nodes have one child.
0 (0%) NUM
nodes have two children.
1 (11%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 5.
Children of NUM
nodes are attached using 4 different relations: discourse (2; 40% instances), obl (1; 20% instances), obl:subj (1; 20% instances), punct (1; 20% instances)
Children of NUM
nodes belong to 4 different parts of speech: NOUN (2; 40% instances), PART (1; 20% instances), PRON (1; 20% instances), PUNCT (1; 20% instances)