home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Beja-Autogramm: POS Tags: X

There are 1 X lemmas (6%), 56 X types (3%) and 73 X tokens (1%). Out of 16 observed tags, the rank of X is: 16 in number of lemmas, 5 in number of types and 13 in number of tokens.

The 10 most frequent X lemmas: _

The 10 most frequent X types: {laughter}, e#, /, XXX, a#, alla, aː#, issalaːt, siddik, ssalaːm

The 10 most frequent ambiguous lemmas: _ (VERB 2410, PUNCT 2363, DET 1737, NOUN 1719, PRON 820, ADP 766, SCONJ 592, CCONJ 338, PART 321, AUX 284, ADV 191, ADJ 149, X 73, INTJ 66, PROPN 63, NUM 59)

The 10 most frequent ambiguous types: / (PUNCT 1631, X 2), alla (X 2, NOUN 1), ajwa (PART 1, X 1), dh# (ADP 1, X 1), əəə (INTJ 7, X 1)

Morphology

The form / lemma ratio of X is 56.000000 (the average of all parts of speech is 126.812500).

The 1st highest number of forms (56) was observed with the lemma “_”: /, XXX, a#, aa#, aaa#, aaaj#, aga#, ago#, ajwa, alamdulilla, alla, am#, astoːfor, aw#, aː#, aːn#, beːtalmaːl, dh#, e#, eː#, fira#, ha#, hahadn#, har#, ifi#, igam#, issalaːt, istaːf#, istaːfer, kaː#, malaːʔik, rif#, siddik, sima#, simas#, ssalaːm, tʔanoː#, uʔeː#, w#, wa, wauːːː#, weːːː#, {laughter}, {noise}, əddew#, əgəg#, əl#, ət#, əəə, ʃibi#, ʃɛːka#, ʃʔibaː#, ʔa#, ʔo#, ʔuweːːːh#, ʕalehu.

X occurs with 2 features: Foreign (20; 27% instances), Number (1; 1% instances)

X occurs with 2 feature-value pairs: Foreign=Yes, Number=Plur

X occurs with 3 feature combinations. The most frequent feature combination is _ (53 tokens). Examples: {laughter}, e#, /, a#, aː#, XXX, aa#, aaa#, aaaj#, aga#

Relations

X nodes are attached to their parents using 17 different relations: dep:comp (11; 15% instances), discourse (10; 14% instances), nsubj (9; 12% instances), dep:conj (7; 10% instances), reparandum (7; 10% instances), obj (5; 7% instances), obl:arg (4; 5% instances), dep (3; 4% instances), nmod (3; 4% instances), root (3; 4% instances), cc (2; 3% instances), fixed (2; 3% instances), obl:mod (2; 3% instances), parataxis (2; 3% instances), advcl (1; 1% instances), dislocated:obj (1; 1% instances), xcomp (1; 1% instances)

Parents of X nodes belong to 9 different parts of speech: VERB (22; 30% instances), X (19; 26% instances), NOUN (13; 18% instances), SCONJ (8; 11% instances), ADP (5; 7% instances), (3; 4% instances), ADJ (1; 1% instances), AUX (1; 1% instances), PRON (1; 1% instances)

24 (33%) X nodes are leaves.

17 (23%) X nodes have one child.

20 (27%) X nodes have two children.

12 (16%) X nodes have three or more children.

The highest child degree of a X node is 5.

Children of X nodes are attached using 14 different relations: reparandum (45; 45% instances), punct (36; 36% instances), dep:conj (3; 3% instances), cc (2; 2% instances), discourse (2; 2% instances), nmod (2; 2% instances), nsubj (2; 2% instances), obl:arg (2; 2% instances), aux (1; 1% instances), ccomp (1; 1% instances), dep (1; 1% instances), det (1; 1% instances), dislocated:subj (1; 1% instances), parataxis (1; 1% instances)

Children of X nodes belong to 11 different parts of speech: PUNCT (36; 36% instances), NOUN (20; 20% instances), X (19; 19% instances), VERB (10; 10% instances), DET (5; 5% instances), PRON (3; 3% instances), ADP (2; 2% instances), ADV (2; 2% instances), AUX (1; 1% instances), INTJ (1; 1% instances), SCONJ (1; 1% instances)