Statistics of NOUN in UD

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Korean-Kaist: POS Tags: `NOUN`

There are 36315 NOUN lemmas (36%), 36002 NOUN types (36%) and 105020 NOUN tokens (30%). Out of 17 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: 수, 것+은, 것+이, 것+을, 때, 년, 가지, 것, 데, 등

The 10 most frequent NOUN types: 수, 것은, 것이, 것을, 때, 년, 가지, 것, 데, 경우

The 10 most frequent ambiguous lemmas: 수 (NOUN 2457, ADJ 1, NUM 1, PROPN 1), 것+이 (NOUN 896, PRON 1), 자신+의 (NOUN 187, PRON 30), 당시 (NOUN 173, ADV 2), 이후 (NOUN 153, ADV 3), 중 (NOUN 135, PROPN 2), 문제+는 (NOUN 115, PROPN 2), 오늘날 (NOUN 111, ADV 2), 현재 (NOUN 111, ADV 6), 이상 (NOUN 88, PROPN 1)

The 10 most frequent ambiguous types: 수 (NOUN 2457, ADJ 2, NUM 1, PROPN 1), 것이 (NOUN 884, PRON 1), 가지 (NOUN 265, VERB 8), 자신의 (NOUN 187, PRON 30), 당시 (NOUN 173, ADV 2), 이후 (NOUN 153, ADV 3), 중 (NOUN 134, PROPN 2), 문제는 (NOUN 115, PROPN 2), 오늘날 (NOUN 112, ADV 2), 현재 (NOUN 110, ADV 6)

수
- NOUN 2457: 학생들은 교과서를 읽으면서 공부하다가 부딛치는 수많은 법학인물들을 이 사전을 통하여 보다 가까이 알 수 있게 될 것이다 .
- ADJ 2: 현재 한국에서 성취된 광통신 최고 속도는 한국전자통신연구소의 565 메가비트 시스템으로서 , 수 년 후에는 2 3 기가비트를 기대해 보고 있다 .
- NUM 1: 그리하여 동학조직은 수운 당시부터 매우 끈끈한 공동체로서 자리잡기 시작하여 그가 처형당한 후에도 수 십년간 지하 조직으로 존립을 가능하게 하였다 .
- PROPN 1: 제국의 경계는 , 동으로 한반도 북부 , 북으로 바이칼호와 이르티시 ( Irtish ) 강변 , 서로는 아랄해 , 남으로는 중국의 웨이수이 ( 수 ) 와 티베트 고원 , 그리고 카라코람산맥을 잇는 거대한 영토를 포함하게 되었다 .
것이
- NOUN 884: 법률과 법학은 용어를 바르고 정확히 사용하는 것이 생명인데 , 이 사전은 그런 면에서 법학도와 법률가들이 꼭 알아야 할 내용이다 .
- PRON 1: 고려시대에 들어서는 연결되어 잊고 , 앉아있는 것조차도 잊어버리는 경지에 도달한 상태의 차라는 것이 있습니다 .
가지
- NOUN 265: 고대 출판문화 발전에는 한 가지 공통점이 있었다 .
- VERB 8: 잘 알려진 바와 같이 이것이 바로 암세포들이며 이러한 암세포들이 일단 발생하면 그 모체는 얼마 가지 않아 사멸되고 만다 .
자신의
- NOUN 187: 다시 말하면 그 생산물은 자신의 필요에 의해서가 아니라 타인을 위한 추상적인 교환가치를 위해 대형생산되는 것이다 .
- PRON 30: 그들은 자신의 불완전성까지도 스스럼 없이 받아주는 고향으로 돌아온 것이다 .
당시
- NOUN 173: 그 살곶이다리는 그 당시 남쪽으로 통하는 제일 큰 다리였습니다 .
- ADV 2: 당시 8월 초순에 위버반도에서 젠투펭귄 40마리를 만난다는 말을 들은 기억이 잇다 .
이후
- NOUN 153: 이후 영국의 골드스미스 , 포우 , 스코트 , 디킨즈 , 프랑스의 뒤마 , 으젠느슈 , 메리메의 작품들이 상품화되어 팔려 나갔다 .
- ADV 3: 노에제사회에서의 헤태리즘은 이후 봉건제사회에 들어서며 한층 더 음란하고 왜곡된 형태로 전개되었습니다 .
중
- NOUN 134: 다만 그 중 이승만은 배재 출신이므로 감리교회 선교사에게 보내서 세례를 받게 했다 .
- PROPN 2: 주말을 이용해 워커힐 호텔에서 만난 한 중 대표단은 수교 발표문에 포함될 공동성명의 내용과 정상회담 개최에 관한 원칙에 합의했다 .
문제는
- NOUN 115: 우리는 역사의 시대구분에 있어서 어느 시기로부터 근대라는 선을 긋느냐 하는 문제는 여러 가지 이론이 있어왔다 .
- PROPN 2: 그리고 나서 이윽고 문제는 하상공으로부터 주석이 달린 도덕경을 전수받았다 .
오늘날
- NOUN 112: 오늘날 출판인들이 입을 모아 하는 말들 중의 하나는 출판의 방향을 예측하기도 어렵고 전문적인 출판이 자리를 잡기가 어렵다고 한다 .
- ADV 2: 오늘날 우리 주변에는 자기 혼자 예수 잘 믿는 사람들이 많습니다 .
현재
- NOUN 110: 지금 현재 대학의 커리큘럼에 문학교육론은 존재합니다만 , 진정한 의미의 문학교육학은 존재하지 않고 있습니다 .
- ADV 6: 현재 우리나라에는 세계적으로 이름을 날리는 유명한 목사들이 많습니다 .

Morphology

The form / lemma ratio of NOUN is 0.991381 (the average of all parts of speech is 0.998034).

The 1st highest number of forms (4) was observed with the lemma “것+은”: 건, 건은, 것은, 것을.

The 2nd highest number of forms (3) was observed with the lemma “대하+어서+는”: 대하여서는, 대해서는, 대해선.

The 3rd highest number of forms (3) was observed with the lemma “등+의”: 드의, 등의, 등이.

NOUN does not occur with any features.

Relations

NOUN nodes are attached to their parents using 23 different relations: obj (20542; 20% instances), compound (18701; 18% instances), nmod (15343; 15% instances), dislocated (15028; 14% instances), nsubj (14281; 14% instances), obl (7833; 7% instances), conj (5711; 5% instances), amod (2974; 3% instances), dep (1985; 2% instances), csubj (1145; 1% instances), ccomp (404; 0% instances), appos (334; 0% instances), root (287; 0% instances), advcl (254; 0% instances), acl (121; 0% instances), fixed (34; 0% instances), flat (23; 0% instances), vocative (8; 0% instances), iobj (5; 0% instances), xcomp (3; 0% instances), nummod (2; 0% instances), clf (1; 0% instances), parataxis (1; 0% instances)

Parents of NOUN nodes belong to 15 different parts of speech: VERB (44080; 42% instances), NOUN (26884; 26% instances), ADV (10195; 10% instances), CCONJ (9221; 9% instances), SCONJ (7911; 8% instances), ADJ (5110; 5% instances), PROPN (683; 1% instances), NUM (413; 0% instances), (287; 0% instances), PRON (111; 0% instances), X (67; 0% instances), PART (46; 0% instances), SYM (5; 0% instances), AUX (4; 0% instances), INTJ (3; 0% instances)

45834 (44%) NOUN nodes are leaves.

44890 (43%) NOUN nodes have one child.

11467 (11%) NOUN nodes have two children.

2829 (3%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 10.

Children of NOUN nodes are attached using 27 different relations: compound (15404; 20% instances), acl (15217; 20% instances), nmod (13569; 17% instances), amod (10568; 14% instances), punct (6286; 8% instances), det (3464; 4% instances), fixed (2890; 4% instances), nummod (2134; 3% instances), conj (2073; 3% instances), obl (869; 1% instances), case (761; 1% instances), appos (663; 1% instances), advmod (653; 1% instances), cc (580; 1% instances), nsubj (454; 1% instances), advcl (432; 1% instances), obj (413; 1% instances), dislocated (341; 0% instances), dep (250; 0% instances), cop (201; 0% instances), ccomp (171; 0% instances), aux (117; 0% instances), xcomp (81; 0% instances), iobj (32; 0% instances), csubj (16; 0% instances), mark (14; 0% instances), discourse (2; 0% instances)

Children of NOUN nodes belong to 17 different parts of speech: NOUN (26884; 35% instances), VERB (17806; 23% instances), ADJ (7325; 9% instances), PUNCT (6286; 8% instances), PROPN (4598; 6% instances), DET (3464; 4% instances), CCONJ (3314; 4% instances), NUM (2506; 3% instances), ADV (1721; 2% instances), PRON (1701; 2% instances), ADP (687; 1% instances), SCONJ (556; 1% instances), AUX (320; 0% instances), X (311; 0% instances), PART (103; 0% instances), SYM (70; 0% instances), INTJ (3; 0% instances)

Treebank Statistics: UD_Korean-Kaist: POS Tags: NOUN

Morphology

Relations

Treebank Statistics: UD_Korean-Kaist: POS Tags: `NOUN`