Treebank Statistics: UD_Beja-Autogramm: POS Tags: NUM
There are 1 NUM
lemmas (6%), 15 NUM
types (1%) and 59 NUM
tokens (0%).
Out of 16 observed tags, the rank of NUM
is: 9 in number of lemmas, 14 in number of types and 16 in number of tokens.
The 10 most frequent NUM
lemmas: _
The 10 most frequent NUM
types: mhaj, gaːl, mhall, alif, asarama, gali, gaːt, mhali, awwal, faɖig
The 10 most frequent ambiguous lemmas: _ (VERB 2410, PUNCT 2363, DET 1737, NOUN 1719, PRON 820, ADP 766, SCONJ 592, CCONJ 338, PART 321, AUX 284, ADV 191, ADJ 149, X 73, INTJ 66, PROPN 63, NUM 59)
The 10 most frequent ambiguous types: ʔawwal (ADV 1, NUM 1)
- ʔawwal
Morphology
The form / lemma ratio of NUM
is 15.000000 (the average of all parts of speech is 126.812500).
The 1st highest number of forms (15) was observed with the lemma “_”: alif, asarama, awwal, faɖig, gali, gaːl, gaːlnaːj, gaːt, mhaj, mhali, mhall, mhallaː, mhaloː, ʃeː, ʔawwal.
NUM
occurs with 2 features: Gender (1; 2% instances), Number (1; 2% instances)
NUM
occurs with 2 feature-value pairs: Gender=Fem
, Number=Plur
NUM
occurs with 3 feature combinations.
The most frequent feature combination is _
(57 tokens).
Examples: mhaj, gaːl, mhall, alif, asarama, mhali, gali, gaːt, awwal, faɖig
Relations
NUM
nodes are attached to their parents using 11 different relations: nummod (29; 49% instances), nmod (12; 20% instances), dep:comp (6; 10% instances), dislocated:mod (3; 5% instances), nsubj (2; 3% instances), obj (2; 3% instances), dislocated:obj (1; 2% instances), dislocated:subj (1; 2% instances), obl:arg (1; 2% instances), obl:mod (1; 2% instances), root (1; 2% instances)
Parents of NUM
nodes belong to 6 different parts of speech: NOUN (42; 71% instances), VERB (8; 14% instances), ADP (6; 10% instances), ADJ (1; 2% instances), AUX (1; 2% instances), (1; 2% instances)
21 (36%) NUM
nodes are leaves.
23 (39%) NUM
nodes have one child.
11 (19%) NUM
nodes have two children.
4 (7%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 5.
Children of NUM
nodes are attached using 10 different relations: det (27; 44% instances), punct (14; 23% instances), dep (7; 11% instances), nmod (4; 7% instances), cop (2; 3% instances), nmod:poss (2; 3% instances), nsubj (2; 3% instances), acl:relcl (1; 2% instances), cc (1; 2% instances), dep:flat (1; 2% instances)
Children of NUM
nodes belong to 8 different parts of speech: DET (27; 44% instances), PUNCT (14; 23% instances), PRON (7; 11% instances), ADJ (5; 8% instances), SCONJ (3; 5% instances), AUX (2; 3% instances), NOUN (2; 3% instances), CCONJ (1; 2% instances)