Treebank Statistics: UD_Uyghur-UDT: POS Tags: X
There are 2 X
lemmas (0%), 10 X
types (0%) and 28 X
tokens (0%).
Out of 16 observed tags, the rank of X
is: 15 in number of lemmas, 15 in number of types and 16 in number of tokens.
The 10 most frequent X
lemmas: _، advcl
The 10 most frequent X
types: >، <، غا، دىن، −، advcl، ئوقۇيالمايدۇ، تېپىۋالدۇق، كۈلۈپ، گە
The 10 most frequent ambiguous lemmas: _ (VERB 4560, NOUN 4224, PRON 479, PUNCT 434, ADJ 326, AUX 185, ADV 157, PART 119, NUM 75, CCONJ 72, ADP 51, INTJ 47, DET 28, X 27)
The 10 most frequent ambiguous types: > (PUNCT 20, X 7), < (PUNCT 17, X 7), − (NOUN 4, X 2), تېپىۋالدۇق (VERB 1, X 1), كۈلۈپ (VERB 16, X 1)
- >
- <
- −
- تېپىۋالدۇق
- كۈلۈپ
Morphology
The form / lemma ratio of X
is 5.000000 (the average of all parts of speech is 4.088599).
The 1st highest number of forms (9) was observed with the lemma “_”: >, <, ئوقۇيالمايدۇ, تېپىۋالدۇق, دىن, غا, كۈلۈپ, گە, −.
The 2nd highest number of forms (1) was observed with the lemma “advcl”: advcl.
X
does not occur with any features.
Relations
X
nodes are attached to their parents using 6 different relations: dep (17; 61% instances), dislocated (5; 18% instances), root (3; 11% instances), advcl (1; 4% instances), mark (1; 4% instances), nmod:comp (1; 4% instances)
Parents of X
nodes belong to 5 different parts of speech: VERB (13; 46% instances), NOUN (7; 25% instances), PRON (4; 14% instances), (3; 11% instances), ADJ (1; 4% instances)
23 (82%) X
nodes are leaves.
4 (14%) X
nodes have one child.
0 (0%) X
nodes have two children.
1 (4%) X
nodes have three or more children.
The highest child degree of a X
node is 4.
Children of X
nodes are attached using 6 different relations: obj (2; 25% instances), punct (2; 25% instances), aux (1; 13% instances), compound (1; 13% instances), discourse (1; 13% instances), nsubj (1; 13% instances)
Children of X
nodes belong to 4 different parts of speech: NOUN (4; 50% instances), PUNCT (2; 25% instances), AUX (1; 13% instances), NUM (1; 13% instances)