Treebank Statistics: UD_Hausa-SouthernAutogramm: POS Tags: NUM
There are 24 NUM
lemmas (2%), 24 NUM
types (1%) and 94 NUM
tokens (1%).
Out of 16 observed tags, the rank of NUM
is: 10 in number of lemmas, 12 in number of types and 14 in number of tokens.
The 10 most frequent NUM
lemmas: biyu, goːmà, ɗaya, bakwài, bìyar̃, dubuː, gùdaː, àr̃bàʼin, shâː, àshìr̃in
The 10 most frequent NUM
types: biyu, goːmà, ɗaya, bakwài, bìyar̃, dubuː, gùdaː, àr̃bàʼin, shâː, àshìr̃in
The 10 most frequent ambiguous lemmas: dubuː (NUM 5, X 1), shâː (NUM 4, VERB 1), huɗu (NUM 2, X 1)
The 10 most frequent ambiguous types: dubuː (NUM 5, X 1), shâː (NUM 4, VERB 4), huɗu (NUM 2, X 1)
- dubuː
- shâː
- huɗu
Morphology
The form / lemma ratio of NUM
is 1.000000 (the average of all parts of speech is 1.303635).
The 1st highest number of forms (1) was observed with the lemma “bakwài”: bakwài.
The 2nd highest number of forms (1) was observed with the lemma “biyu”: biyu.
The 3rd highest number of forms (1) was observed with the lemma “bìyar̃”: bìyar̃.
NUM
occurs with 3 features: Definite (20; 21% instances), ExtPos (1; 1% instances), PronType (1; 1% instances)
NUM
occurs with 4 feature-value pairs: Definite=Cons
, Definite=Def
, ExtPos=NOUN
, PronType=Int
NUM
occurs with 5 feature combinations.
The most frequent feature combination is _
(73 tokens).
Examples: biyu, goːmà, ɗaya, bakwài, bìyar̃, dubuː, gùdaː, shâː, huɗu, takwàs
Relations
NUM
nodes are attached to their parents using 12 different relations: nummod (44; 47% instances), flat (14; 15% instances), nmod (8; 9% instances), root (7; 7% instances), obl (4; 4% instances), xcomp (4; 4% instances), conj (3; 3% instances), nsubj (3; 3% instances), obj (3; 3% instances), fixed (2; 2% instances), advcl:cleft (1; 1% instances), obl:arg (1; 1% instances)
Parents of NUM
nodes belong to 6 different parts of speech: NOUN (53; 56% instances), NUM (19; 20% instances), VERB (9; 10% instances), (7; 7% instances), PART (5; 5% instances), AUX (1; 1% instances)
69 (73%) NUM
nodes are leaves.
9 (10%) NUM
nodes have one child.
10 (11%) NUM
nodes have two children.
6 (6%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 4.
Children of NUM
nodes are attached using 12 different relations: flat (14; 28% instances), punct (9; 18% instances), advmod (6; 12% instances), discourse (5; 10% instances), nmod (5; 10% instances), conj (4; 8% instances), case (2; 4% instances), advcl (1; 2% instances), cc (1; 2% instances), cop (1; 2% instances), dislocated (1; 2% instances), nsubj (1; 2% instances)
Children of NUM
nodes belong to 10 different parts of speech: NUM (19; 38% instances), PUNCT (9; 18% instances), NOUN (6; 12% instances), ADV (5; 10% instances), PART (5; 10% instances), ADP (2; 4% instances), AUX (1; 2% instances), CCONJ (1; 2% instances), INTJ (1; 2% instances), X (1; 2% instances)