home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Korean-KSL: POS Tags: VERB

There are 6457 VERB lemmas (36%), 6394 VERB types (36%) and 17634 VERB tokens (26%). Out of 15 observed tags, the rank of VERB is: 2 in number of lemmas, 2 in number of types and 2 in number of tokens.

The 10 most frequent VERB lemmas: 것+이+ㅂ니다, 있+는, 하+ㄹ, 생각+하+ㄴ다, 가+ㄹ, 것+이+다, 하+고, 가+고, 가+았+습니다, 있+으면

The 10 most frequent VERB types: 겁니다, 있는, 할, 생각한다, 갈, 하고, 것이다, 가고, 갔습니다, 있으면

The 10 most frequent ambiguous lemmas: 있+는 (VERB 245, ADJ 1), 되+었+다 (VERB 84, ADJ 1), 좋+은 (ADJ 88, VERB 56, ADV 1), 오+ㄴ (VERB 53, DET 2), 나+는 (PRON 231, VERB 48, NOUN 3), 따르+아서 (VERB 33, CCONJ 1), 유명+하+ㄴ (VERB 31, ADJ 14), 보+ㄴ (VERB 29, NOUN 1), 사+았+습니다 (VERB 28, ADJ 1), 이+다 (VERB 28, ADJ 1)

The 10 most frequent ambiguous types: 있는 (VERB 244, AUX 58, ADJ 1), 할 (VERB 199, AUX 12, X 7), 하고 (VERB 142, AUX 6), 있으면 (VERB 117, AUX 4), 하는 (VERB 98, AUX 22), 했다 (VERB 90, AUX 12), 있어서 (VERB 75, AUX 12), 있고 (VERB 74, AUX 6), 한다 (AUX 113, VERB 62), 하면 (VERB 61, AUX 6)

Morphology

The form / lemma ratio of VERB is 0.990243 (the average of all parts of speech is 1.004265).

The 1st highest number of forms (3) was observed with the lemma “것+이+에요”: 거에요, 거예요, 것이에요.

The 2nd highest number of forms (3) was observed with the lemma “답+하+았+다”: 답하였다, 답햇다, 답했다.

The 3rd highest number of forms (2) was observed with the lemma “40+명+이+었+다”: 40명이었다, 40명이였다.

VERB does not occur with any features.

Relations

VERB nodes are attached to their parents using 23 different relations: root (5654; 32% instances), advcl (5314; 30% instances), acl (4328; 25% instances), conj (684; 4% instances), ccomp (605; 3% instances), obl (256; 1% instances), amod (234; 1% instances), nsubj (180; 1% instances), case (101; 1% instances), obj (75; 0% instances), dislocated (38; 0% instances), flat (38; 0% instances), parataxis (31; 0% instances), nmod (29; 0% instances), list (22; 0% instances), compound (14; 0% instances), csubj (11; 0% instances), fixed (7; 0% instances), discourse (4; 0% instances), nmod:poss (4; 0% instances), dep (3; 0% instances), appos (1; 0% instances), vocative (1; 0% instances)

Parents of VERB nodes belong to 9 different parts of speech: VERB (7099; 40% instances), (5654; 32% instances), NOUN (3460; 20% instances), ADJ (929; 5% instances), ADV (467; 3% instances), PRON (13; 0% instances), AUX (5; 0% instances), PROPN (4; 0% instances), NUM (3; 0% instances)

1684 (10%) VERB nodes are leaves.

4984 (28%) VERB nodes have one child.

4324 (25%) VERB nodes have two children.

6642 (38%) VERB nodes have three or more children.

The highest child degree of a VERB node is 9.

Children of VERB nodes are attached using 30 different relations: nsubj (6454; 17% instances), obl (6399; 16% instances), punct (5609; 14% instances), obj (5211; 13% instances), advcl (4477; 12% instances), advmod (4039; 10% instances), aux (1316; 3% instances), acl (1011; 3% instances), conj (960; 2% instances), cc (917; 2% instances), ccomp (610; 2% instances), mark (489; 1% instances), dislocated (427; 1% instances), nmod (279; 1% instances), amod (189; 0% instances), nmod:poss (121; 0% instances), case (83; 0% instances), list (70; 0% instances), goeswith (56; 0% instances), vocative (38; 0% instances), parataxis (31; 0% instances), compound (23; 0% instances), det (19; 0% instances), flat (15; 0% instances), csubj (13; 0% instances), appos (6; 0% instances), discourse (6; 0% instances), nummod (5; 0% instances), fixed (3; 0% instances), dep (1; 0% instances)

Children of VERB nodes belong to 15 different parts of speech: NOUN (12462; 32% instances), ADV (10592; 27% instances), VERB (7099; 18% instances), PUNCT (5609; 14% instances), AUX (1319; 3% instances), PRON (1222; 3% instances), ADJ (356; 1% instances), ADP (95; 0% instances), X (56; 0% instances), NUM (37; 0% instances), DET (23; 0% instances), PROPN (4; 0% instances), CCONJ (1; 0% instances), PART (1; 0% instances), SYM (1; 0% instances)