home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Beja-Autogramm: POS Tags: NUM

There are 1 NUM lemmas (6%), 15 NUM types (1%) and 59 NUM tokens (0%). Out of 16 observed tags, the rank of NUM is: 9 in number of lemmas, 14 in number of types and 16 in number of tokens.

The 10 most frequent NUM lemmas: _

The 10 most frequent NUM types: mhaj, gaːl, mhall, alif, asarama, gali, gaːt, mhali, awwal, faɖig

The 10 most frequent ambiguous lemmas: _ (VERB 2410, PUNCT 2363, DET 1737, NOUN 1719, PRON 820, ADP 766, SCONJ 592, CCONJ 338, PART 321, AUX 284, ADV 191, ADJ 149, X 73, INTJ 66, PROPN 63, NUM 59)

The 10 most frequent ambiguous types: ʔawwal (ADV 1, NUM 1)

Morphology

The form / lemma ratio of NUM is 15.000000 (the average of all parts of speech is 126.812500).

The 1st highest number of forms (15) was observed with the lemma “_”: alif, asarama, awwal, faɖig, gali, gaːl, gaːlnaːj, gaːt, mhaj, mhali, mhall, mhallaː, mhaloː, ʃeː, ʔawwal.

NUM occurs with 2 features: Gender (1; 2% instances), Number (1; 2% instances)

NUM occurs with 2 feature-value pairs: Gender=Fem, Number=Plur

NUM occurs with 3 feature combinations. The most frequent feature combination is _ (57 tokens). Examples: mhaj, gaːl, mhall, alif, asarama, mhali, gali, gaːt, awwal, faɖig

Relations

NUM nodes are attached to their parents using 11 different relations: nummod (29; 49% instances), nmod (12; 20% instances), dep:comp (6; 10% instances), dislocated:mod (3; 5% instances), nsubj (2; 3% instances), obj (2; 3% instances), dislocated:obj (1; 2% instances), dislocated:subj (1; 2% instances), obl:arg (1; 2% instances), obl:mod (1; 2% instances), root (1; 2% instances)

Parents of NUM nodes belong to 6 different parts of speech: NOUN (42; 71% instances), VERB (8; 14% instances), ADP (6; 10% instances), ADJ (1; 2% instances), AUX (1; 2% instances), (1; 2% instances)

21 (36%) NUM nodes are leaves.

23 (39%) NUM nodes have one child.

11 (19%) NUM nodes have two children.

4 (7%) NUM nodes have three or more children.

The highest child degree of a NUM node is 5.

Children of NUM nodes are attached using 10 different relations: det (27; 44% instances), punct (14; 23% instances), dep (7; 11% instances), nmod (4; 7% instances), cop (2; 3% instances), nmod:poss (2; 3% instances), nsubj (2; 3% instances), acl:relcl (1; 2% instances), cc (1; 2% instances), dep:flat (1; 2% instances)

Children of NUM nodes belong to 8 different parts of speech: DET (27; 44% instances), PUNCT (14; 23% instances), PRON (7; 11% instances), ADJ (5; 8% instances), SCONJ (3; 5% instances), AUX (2; 3% instances), NOUN (2; 3% instances), CCONJ (1; 2% instances)