Treebank Statistics: UD_Korean-Kaist: POS Tags: X
There are 506 X
lemmas (0%), 506 X
types (1%) and 699 X
tokens (0%).
Out of 17 observed tags, the rank of X
is: 9 in number of lemmas, 9 in number of types and 14 in number of tokens.
The 10 most frequent X
lemmas: EC, 프롤레타리아, TV, S, the, km, ATM, PCS, 에퀴티, ABC
The 10 most frequent X
types: EC, 프롤레타리아, TV, S, the, km, ATM, PCS, 에퀴티, ABC
The 10 most frequent ambiguous lemmas: 프롤레타리아 (X 17, NOUN 16), km (X 7, SYM 4), 부르조아 (NOUN 7, X 4), C (X 2, SYM 1), KGB (PROPN 3, X 2), 꼬뻬라찌프 (PROPN 2, X 1), 더 (ADV 401, X 1), 레코드 (NOUN 1, X 1), 리스크 (NOUN 1, X 1), 삐로제닉 (PROPN 1, X 1)
The 10 most frequent ambiguous types: 프롤레타리아 (X 17, NOUN 16), km (X 7, SYM 4), 부르조아 (NOUN 7, X 4), C (X 2, SYM 1), KGB (PROPN 3, X 2), 꼬뻬라찌프 (PROPN 2, X 1), 더 (ADV 401, X 1), 레코드 (NOUN 1, X 1), 리스크 (NOUN 1, X 1), 삐로제닉 (PROPN 1, X 1)
- 프롤레타리아
- km
- 부르조아
- C
- KGB
- 꼬뻬라찌프
- 더
- 레코드
- 리스크
- 삐로제닉
Morphology
The form / lemma ratio of X
is 1.000000 (the average of all parts of speech is 0.998034).
The 1st highest number of forms (1) was observed with the lemma “230+b”: 230b.
The 2nd highest number of forms (1) was observed with the lemma “3+D”: 3D.
The 3rd highest number of forms (1) was observed with the lemma “A”: A.
X
does not occur with any features.
Relations
X
nodes are attached to their parents using 13 different relations: appos (253; 36% instances), flat (182; 26% instances), compound (170; 24% instances), conj (32; 5% instances), dislocated (12; 2% instances), dep (11; 2% instances), root (11; 2% instances), nmod (10; 1% instances), obj (7; 1% instances), advcl (5; 1% instances), nsubj (4; 1% instances), csubj (1; 0% instances), obl (1; 0% instances)
Parents of X
nodes belong to 12 different parts of speech: NOUN (311; 44% instances), X (197; 28% instances), PROPN (79; 11% instances), ADV (40; 6% instances), VERB (31; 4% instances), CCONJ (16; 2% instances), (11; 2% instances), NUM (8; 1% instances), PART (2; 0% instances), SCONJ (2; 0% instances), ADJ (1; 0% instances), PRON (1; 0% instances)
334 (48%) X
nodes are leaves.
48 (7%) X
nodes have one child.
169 (24%) X
nodes have two children.
148 (21%) X
nodes have three or more children.
The highest child degree of a X
node is 10.
Children of X
nodes are attached using 19 different relations: punct (564; 62% instances), flat (197; 22% instances), case (28; 3% instances), appos (26; 3% instances), dep (26; 3% instances), conj (22; 2% instances), nummod (13; 1% instances), acl (7; 1% instances), nmod (7; 1% instances), cop (5; 1% instances), dislocated (5; 1% instances), cc (3; 0% instances), ccomp (2; 0% instances), nsubj (2; 0% instances), advcl (1; 0% instances), advmod (1; 0% instances), amod (1; 0% instances), compound (1; 0% instances), obl (1; 0% instances)
Children of X
nodes belong to 13 different parts of speech: PUNCT (564; 62% instances), X (197; 22% instances), NOUN (67; 7% instances), ADP (27; 3% instances), VERB (16; 2% instances), NUM (14; 2% instances), PROPN (7; 1% instances), CCONJ (6; 1% instances), AUX (5; 1% instances), ADV (4; 0% instances), ADJ (2; 0% instances), PRON (2; 0% instances), SCONJ (1; 0% instances)