Treebank Statistics: UD_Korean-PUD: POS Tags: AUX
There are 6 AUX
lemmas (0%), 144 AUX
types (2%) and 663 AUX
tokens (4%).
Out of 13 observed tags, the rank of AUX
is: 9 in number of lemmas, 7 in number of types and 5 in number of tokens.
The 10 most frequent AUX
lemmas: 이, _, 있, 않, 하, 싶
The 10 most frequent AUX
types: 인, 이다, 이었다, 이라, 였다, 있다, 있는, 이며, 일, 라
The 10 most frequent ambiguous lemmas: 이 (AUX 436, PRON 23, PART 16), _ (NOUN 4295, VERB 1439, PROPN 1030, ADJ 596, ADV 516, DET 462, CCONJ 125, AUX 104, X 47, NUM 27, PRON 24, PUNCT 1)
The 10 most frequent ambiguous types: 인 (AUX 149, PROPN 1), 있다 (ADJ 45, VERB 24, AUX 20), 있는 (ADJ 50, AUX 16, VERB 7), 일 (NOUN 32, AUX 12, PROPN 3), 라 (AUX 9, NOUN 1), 않았다 (AUX 8, VERB 5), 가 (PART 20, AUX 7), 한다 (VERB 20, AUX 7), 못했다 (AUX 6, VERB 4), 않은 (AUX 6, VERB 3, ADJ 1)
- 인
- 있다
- 있는
- 일
- 라
- 않았다
- 가
- 한다
- 못했다
- 않은
Morphology
The form / lemma ratio of AUX
is 24.000000 (the average of all parts of speech is 3.181543).
The 1st highest number of forms (53) was observed with the lemma “_”: 가, 가는, 고, 나서, 내기, 내는, 낼, 냈다, 놓고, 놓는, 놓았다, 놓은, 니, 달라고, 두고, 둘, 라, 라는, 라면, 말고, 말라고, 못하게, 못하고, 못한다는, 못할, 못했다, 못했을, 버렸고, 버렸다, 버렸으며, 버린, 본, 봤으며, 여서, 오는, 왔지만, 주는, 준, 준다, 치우고, 한, 한다, 한다고, 한다는, 한다던, 할, 합니다, 해, 해야, 했는데, 했다, 했다고, 했을.
The 2nd highest number of forms (43) was observed with the lemma “이”: 였고, 였기, 였는데, 였는지, 였다, 였던, 였어, 였으며, 였음, 였지만, 이거나, 이고, 이기, 이다, 이다., 이던, 이든, 이라, 이라는, 이란, 이며, 이세요, 이어서, 이어야, 이었고, 이었기, 이었는데, 이었다, 이었던, 이었으며, 이었을, 이었음, 이자, 이지, 이지만, 이진, 인, 인가, 인데, 인지, 일, 일까, 일지.
The 3rd highest number of forms (23) was observed with the lemma “않”: 않게, 않고, 않기, 않는, 않는다, 않는다고, 않는다면, 않다, 않다고, 않다는, 않도록, 않아, 않았고, 않았는데, 않았다, 않았어요, 않았으면, 않았지만, 않으며, 않으면, 않으므로, 않은, 않을.
AUX
occurs with 7 features: Form (362; 55% instances), VerbForm (301; 45% instances), Mood (265; 40% instances), Tense (150; 23% instances), PronType (12; 2% instances), Polite (5; 1% instances), Case (1; 0% instances)
AUX
occurs with 12 feature-value pairs: Case=Acc
, Form=Adn
, Form=Aux
, Form=Compl
, Mood=Imp
, Mood=Ind
, Polite=Form
, PronType=Int
, Tense=Fut
, Tense=Past
, VerbForm=Fin
, VerbForm=Ger
AUX
occurs with 17 feature combinations.
The most frequent feature combination is Form=Adn
(218 tokens).
Examples: 인, 있는, 일, 이라는, 이란, 않은, 가는, 있던, 내는, 낼
Relations
AUX
nodes are attached to their parents using 2 different relations: cop (458; 69% instances), aux (205; 31% instances)
Parents of AUX
nodes belong to 8 different parts of speech: NOUN (432; 65% instances), VERB (185; 28% instances), PROPN (13; 2% instances), ADJ (12; 2% instances), PART (9; 1% instances), PRON (8; 1% instances), ADV (2; 0% instances), NUM (2; 0% instances)
604 (91%) AUX
nodes are leaves.
55 (8%) AUX
nodes have one child.
4 (1%) AUX
nodes have two children.
The highest child degree of a AUX
node is 2.
Children of AUX
nodes are attached using 2 different relations: punct (57; 90% instances), conj (6; 10% instances)
Children of AUX
nodes belong to 3 different parts of speech: PUNCT (57; 90% instances), VERB (4; 6% instances), NOUN (2; 3% instances)