Treebank Statistics: UD_Korean-GSD: POS Tags: NOUN
There are 19268 NOUN
lemmas (53%), 19213 NOUN
types (52%) and 32347 NOUN
tokens (40%).
Out of 16 observed tags, the rank of NOUN
is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.
The 10 most frequent NOUN
lemmas: 수, 등, 것+이, 것, 때, 길, 맛+이, 것+을, 맛+도, 등+을
The 10 most frequent NOUN
types: 수, 등, 것, 때, 게, 길, 것이, 맛이, 것을, 맛도
The 10 most frequent ambiguous lemmas: 수 (NOUN 370, ADJ 1), 때 (NOUN 79, ADP 23, ADV 1), 이번 (NOUN 53, DET 1), 전 (NOUN 50, ADP 12, PRON 11, DET 3), 서울 (NOUN 45, PROPN 3), 원 (NOUN 40, ADP 1), 대전 (NOUN 36, ADV 1), 뒤 (NOUN 34, ADP 10, ADV 1), 미국 (NOUN 33, PROPN 6), 근처 (NOUN 29, ADV 2)
The 10 most frequent ambiguous types: 수 (NOUN 370, ADJ 1), 때 (NOUN 79, ADP 23, ADV 1), 이번 (NOUN 53, DET 1), 전 (NOUN 50, ADP 12, PRON 11, DET 3), 서울 (NOUN 45, PROPN 3), 원 (NOUN 40, ADP 1), 대전 (NOUN 36, ADV 1), 뒤 (NOUN 34, ADP 10, ADV 1), 미국 (NOUN 33, PROPN 6), 근처 (NOUN 29, ADV 2)
- 수
- 때
- 이번
- 전
- 서울
- 원
- 대전
- 뒤
- 미국
- 근처
Morphology
The form / lemma ratio of NOUN
is 0.997146 (the average of all parts of speech is 1.001499).
The 1st highest number of forms (2) was observed with the lemma “http:-FWS-”: http://www.khmais.net, http://www.modetour.com.
The 2nd highest number of forms (2) was observed with the lemma “것+이”: 것이, 게.
The 3rd highest number of forms (1) was observed with the lemma “0+시+를”: 0시를.
NOUN
does not occur with any features.
Relations
NOUN
nodes are attached to their parents using 22 different relations: flat (8260; 26% instances), nsubj (7222; 22% instances), obj (5473; 17% instances), obl (2537; 8% instances), nmod:poss (2275; 7% instances), conj (1887; 6% instances), nmod (1639; 5% instances), appos (790; 2% instances), root (646; 2% instances), dep (574; 2% instances), nsubj:pass (499; 2% instances), acl:relcl (174; 1% instances), advcl (114; 0% instances), iobj (102; 0% instances), ccomp (80; 0% instances), nummod (31; 0% instances), case (18; 0% instances), csubj (15; 0% instances), amod (8; 0% instances), compound (1; 0% instances), parataxis (1; 0% instances), xcomp (1; 0% instances)
Parents of NOUN
nodes belong to 12 different parts of speech: VERB (15312; 47% instances), NOUN (13789; 43% instances), ADJ (1122; 3% instances), ADV (858; 3% instances), (646; 2% instances), PROPN (373; 1% instances), NUM (186; 1% instances), PRON (36; 0% instances), DET (13; 0% instances), AUX (6; 0% instances), SYM (4; 0% instances), INTJ (2; 0% instances)
16153 (50%) NOUN
nodes are leaves.
9496 (29%) NOUN
nodes have one child.
4039 (12%) NOUN
nodes have two children.
2659 (8%) NOUN
nodes have three or more children.
The highest child degree of a NOUN
node is 17.
Children of NOUN
nodes are attached using 29 different relations: flat (10209; 37% instances), punct (3093; 11% instances), acl:relcl (2413; 9% instances), conj (2042; 7% instances), nmod:poss (1728; 6% instances), dep (1614; 6% instances), case (1187; 4% instances), amod (1118; 4% instances), appos (1059; 4% instances), nsubj (580; 2% instances), advmod (417; 1% instances), obj (404; 1% instances), det (400; 1% instances), nummod (350; 1% instances), obl (308; 1% instances), nmod (306; 1% instances), advcl (240; 1% instances), cc (168; 1% instances), det:poss (135; 0% instances), cop (49; 0% instances), ccomp (42; 0% instances), nsubj:pass (17; 0% instances), fixed (12; 0% instances), iobj (9; 0% instances), acl (6; 0% instances), csubj (3; 0% instances), mark (2; 0% instances), aux (1; 0% instances), parataxis (1; 0% instances)
Children of NOUN
nodes belong to 15 different parts of speech: NOUN (13789; 49% instances), VERB (4487; 16% instances), PUNCT (3093; 11% instances), ADV (2505; 9% instances), ADP (1277; 5% instances), ADJ (1091; 4% instances), NUM (466; 2% instances), DET (401; 1% instances), PROPN (216; 1% instances), PRON (187; 1% instances), CCONJ (168; 1% instances), SYM (154; 1% instances), AUX (50; 0% instances), PART (24; 0% instances), X (5; 0% instances)