Treebank Statistics: UD_Korean-PUD: POS Tags: PART
There are 91 PART
lemmas (3%), 94 PART
types (1%) and 478 PART
tokens (3%).
Out of 13 observed tags, the rank of PART
is: 4 in number of lemmas, 8 in number of types and 9 in number of tokens.
The 10 most frequent PART
lemmas: 는, 의, 고, 에, 도, 라고, 가, 와, 에서, 이
The 10 most frequent PART
types: 는, 의, 고, 에, 도, 라고, 가, 와, 에서, 이
The 10 most frequent ambiguous lemmas: 도 (PART 26, NOUN 2), 가 (PART 20, NOUN 1), 이 (AUX 436, PRON 23, PART 16), 과 (PART 10, NOUN 1), 만 (PART 5, NUM 1), 열기 (NOUN 1, PART 1)
The 10 most frequent ambiguous types: 고 (PART 38, AUX 1), 도 (PART 26, NOUN 1), 가 (PART 20, AUX 7), 와 (PART 19, VERB 1), 이 (DET 52, PART 16, NOUN 3, PRON 1, PROPN 1), 만 (DET 12, PART 5, NUM 1), 보다 (ADV 2, PART 2), 있음을 (PART 2, AUX 1), 들 (VERB 3, PART 1), 뿐 (NOUN 2, PART 1)
- 고
- 도
- 가
- 와
- 이
- DET 52: 이 부분에서 게임과 우리 일상 생활 사이의 유사점을 찾을 수 있습니다 .
- PART 16: 경찰 대변인은 “ 몇 마디 주고 받은 “ 후 “ 언쟁 “ 이 벌어졌지만 부상자는 없었다고 연합 통신에 밝혔다 .
- NOUN 3: 카리브 해는 1492 년까지 유라시아 사람들에게 는 알려지지 않은 곳 이었는데 이 때 크리스토퍼 콜럼버스 가 아시아 항로 탐색을 위한 목적으로 처음으로 카리브 해를 항해했다 .
- PRON 1: 원래 기단은 1950 년대에 일기 예보를 위해 사용되었지만 기상학자들은 1973 년 이 아이디어를 기반으로 종관 기후학 이란 분야를 만들기 시작했다 .
- PROPN 1: 이 박사는 “ 아고라 ( Agora ) 는 초대를 받아야만 입장할 수 있었지만 이 시장의 대부분은 검색 방법만 알면 쉽게 접근할 수 있다 ” 라고 덧붙였다 .
- 만
- 보다
- 있음을
- 들
- 뿐
Morphology
The form / lemma ratio of PART
is 1.032967 (the average of all parts of speech is 3.181543).
The 1st highest number of forms (2) was observed with the lemma “되기”: 되기를, 되기에.
The 2nd highest number of forms (2) was observed with the lemma “됨”: 됨으로써, 됨은.
The 3rd highest number of forms (2) was observed with the lemma “쓰기”: 쓰기도, 쓰기로.
PART
occurs with 8 features: Polite (278; 58% instances), Case (166; 35% instances), VerbForm (54; 11% instances), Mood (26; 5% instances), Tense (19; 4% instances), Form (18; 4% instances), Voice (3; 1% instances), PronType (2; 0% instances)
PART
occurs with 15 feature-value pairs: Case=Acc
, Case=Gen
, Case=Nom
, Form=Aux
, Form=Compl
, Mood=Imp
, Mood=Ind
, Polite=Form
, PronType=Int
, Tense=Fut
, Tense=Past
, VerbForm=Fin
, VerbForm=Ger
, Voice=Cau
, Voice=Pass
PART
occurs with 25 feature combinations.
The most frequent feature combination is _
(155 tokens).
Examples: 는, 고, 도, 라고, 만, 까지, 이라고, 밖에, 들, 마다
Relations
PART
nodes are attached to their parents using 10 different relations: case (404; 85% instances), advcl (49; 10% instances), ccomp (8; 2% instances), root (7; 1% instances), csubj (3; 1% instances), advmod (2; 0% instances), conj (2; 0% instances), acl:relcl (1; 0% instances), dep (1; 0% instances), nsubj (1; 0% instances)
Parents of PART
nodes belong to 8 different parts of speech: NOUN (246; 51% instances), PROPN (172; 36% instances), VERB (30; 6% instances), ADJ (11; 2% instances), (7; 1% instances), PRON (6; 1% instances), PART (5; 1% instances), ADV (1; 0% instances)
402 (84%) PART
nodes are leaves.
20 (4%) PART
nodes have one child.
24 (5%) PART
nodes have two children.
32 (7%) PART
nodes have three or more children.
The highest child degree of a PART
node is 6.
Children of PART
nodes are attached using 14 different relations: advcl (37; 19% instances), obj (35; 18% instances), obl (35; 18% instances), nsubj (30; 16% instances), advmod (16; 8% instances), punct (14; 7% instances), aux (9; 5% instances), case (3; 2% instances), ccomp (3; 2% instances), compound:lvc (3; 2% instances), iobj (3; 2% instances), conj (1; 1% instances), csubj (1; 1% instances), nsubj:pass (1; 1% instances)
Children of PART
nodes belong to 11 different parts of speech: NOUN (105; 55% instances), VERB (22; 12% instances), ADV (14; 7% instances), PUNCT (14; 7% instances), AUX (9; 5% instances), ADJ (7; 4% instances), PRON (7; 4% instances), PROPN (6; 3% instances), PART (5; 3% instances), CCONJ (1; 1% instances), NUM (1; 1% instances)