Treebank Statistics: UD_Beja-Autogramm: POS Tags: X
There are 1 X
lemmas (6%), 56 X
types (3%) and 73 X
tokens (1%).
Out of 16 observed tags, the rank of X
is: 16 in number of lemmas, 5 in number of types and 13 in number of tokens.
The 10 most frequent X
lemmas: _
The 10 most frequent X
types: {laughter}, e#, /, XXX, a#, alla, aː#, issalaːt, siddik, ssalaːm
The 10 most frequent ambiguous lemmas: _ (VERB 2410, PUNCT 2363, DET 1737, NOUN 1719, PRON 820, ADP 766, SCONJ 592, CCONJ 338, PART 321, AUX 284, ADV 191, ADJ 149, X 73, INTJ 66, PROPN 63, NUM 59)
The 10 most frequent ambiguous types: / (PUNCT 1631, X 2), alla (X 2, NOUN 1), ajwa (PART 1, X 1), dh# (ADP 1, X 1), əəə (INTJ 7, X 1)
- /
- alla
- ajwa
- dh#
- əəə
Morphology
The form / lemma ratio of X
is 56.000000 (the average of all parts of speech is 126.812500).
The 1st highest number of forms (56) was observed with the lemma “_”: /, XXX, a#, aa#, aaa#, aaaj#, aga#, ago#, ajwa, alamdulilla, alla, am#, astoːfor, aw#, aː#, aːn#, beːtalmaːl, dh#, e#, eː#, fira#, ha#, hahadn#, har#, ifi#, igam#, issalaːt, istaːf#, istaːfer, kaː#, malaːʔik, rif#, siddik, sima#, simas#, ssalaːm, tʔanoː#, uʔeː#, w#, wa, wauːːː#, weːːː#, {laughter}, {noise}, əddew#, əgəg#, əl#, ət#, əəə, ʃibi#, ʃɛːka#, ʃʔibaː#, ʔa#, ʔo#, ʔuweːːːh#, ʕalehu.
X
occurs with 2 features: Foreign (20; 27% instances), Number (1; 1% instances)
X
occurs with 2 feature-value pairs: Foreign=Yes
, Number=Plur
X
occurs with 3 feature combinations.
The most frequent feature combination is _
(53 tokens).
Examples: {laughter}, e#, /, a#, aː#, XXX, aa#, aaa#, aaaj#, aga#
Relations
X
nodes are attached to their parents using 17 different relations: dep:comp (11; 15% instances), discourse (10; 14% instances), nsubj (9; 12% instances), dep:conj (7; 10% instances), reparandum (7; 10% instances), obj (5; 7% instances), obl:arg (4; 5% instances), dep (3; 4% instances), nmod (3; 4% instances), root (3; 4% instances), cc (2; 3% instances), fixed (2; 3% instances), obl:mod (2; 3% instances), parataxis (2; 3% instances), advcl (1; 1% instances), dislocated:obj (1; 1% instances), xcomp (1; 1% instances)
Parents of X
nodes belong to 9 different parts of speech: VERB (22; 30% instances), X (19; 26% instances), NOUN (13; 18% instances), SCONJ (8; 11% instances), ADP (5; 7% instances), (3; 4% instances), ADJ (1; 1% instances), AUX (1; 1% instances), PRON (1; 1% instances)
24 (33%) X
nodes are leaves.
17 (23%) X
nodes have one child.
20 (27%) X
nodes have two children.
12 (16%) X
nodes have three or more children.
The highest child degree of a X
node is 5.
Children of X
nodes are attached using 14 different relations: reparandum (45; 45% instances), punct (36; 36% instances), dep:conj (3; 3% instances), cc (2; 2% instances), discourse (2; 2% instances), nmod (2; 2% instances), nsubj (2; 2% instances), obl:arg (2; 2% instances), aux (1; 1% instances), ccomp (1; 1% instances), dep (1; 1% instances), det (1; 1% instances), dislocated:subj (1; 1% instances), parataxis (1; 1% instances)
Children of X
nodes belong to 11 different parts of speech: PUNCT (36; 36% instances), NOUN (20; 20% instances), X (19; 19% instances), VERB (10; 10% instances), DET (5; 5% instances), PRON (3; 3% instances), ADP (2; 2% instances), ADV (2; 2% instances), AUX (1; 1% instances), INTJ (1; 1% instances), SCONJ (1; 1% instances)