Treebank Statistics: UD_Classical_Chinese-Kyoto: POS Tags: NOUN
There are 3800 NOUN
lemmas (28%), 3857 NOUN
types (28%) and 123065 NOUN
tokens (28%).
Out of 14 observed tags, the rank of NOUN
is: 2 in number of lemmas, 2 in number of types and 2 in number of tokens.
The 10 most frequent NOUN
lemmas: 王、 人、 子、 君、 天、 國、 下、 臣、 公、 上
The 10 most frequent NOUN
types: 王、 人、 子、 君、 天、 國、 下、 臣、 公、 上
The 10 most frequent ambiguous lemmas: 王 (NOUN 4834, PROPN 348, VERB 100), 子 (NOUN 3054, PRON 417, VERB 28, PROPN 8, PART 2, NUM 1), 君 (NOUN 2442, PROPN 23, VERB 23), 天 (NOUN 2196, VERB 6), 國 (NOUN 1870, PROPN 7, VERB 2), 下 (NOUN 1822, VERB 335, ADV 7), 臣 (NOUN 1752, VERB 60, PROPN 5, ADV 2), 公 (NOUN 1420, VERB 43, PROPN 14, ADV 8, PRON 1), 上 (NOUN 1409, VERB 181, ADV 29), 兵 (NOUN 1278, VERB 1)
The 10 most frequent ambiguous types: 王 (NOUN 4834, PROPN 348, VERB 100), 子 (NOUN 3054, PRON 417, VERB 28, PROPN 8, PART 2, NUM 1), 君 (NOUN 2442, PROPN 23, VERB 23), 天 (NOUN 2196, VERB 6), 國 (NOUN 1870, PROPN 7, VERB 2), 下 (NOUN 1822, VERB 335, ADV 7), 臣 (NOUN 1752, VERB 60, PROPN 5, ADV 2), 公 (NOUN 1420, VERB 43, PROPN 14, ADV 8, PRON 1), 上 (NOUN 1409, VERB 181, ADV 29), 兵 (NOUN 1278, VERB 1)
- 王
- 子
- 君
- 天
- 國
- 下
- 臣
- 公
- 上
- 兵
Morphology
The form / lemma ratio of NOUN
is 1.015000 (the average of all parts of speech is 1.013130).
The 1st highest number of forms (2) was observed with the lemma “內”: 內, 内.
The 2nd highest number of forms (2) was observed with the lemma “冰”: 冰, 氷.
The 3rd highest number of forms (2) was observed with the lemma “勳”: 勛, 勳.
NOUN
occurs with 3 features: Case (32519; 26% instances), NounType (913; 1% instances), Degree (1; 0% instances)
NOUN
occurs with 4 feature-value pairs: Case=Loc
, Case=Tem
, Degree=Pos
, NounType=Clf
NOUN
occurs with 5 feature combinations.
The most frequent feature combination is _
(89632 tokens).
Examples: 王、 人、 子、 君、 國、 臣、 公、 兵、 事、 帝
Relations
NOUN
nodes are attached to their parents using 27 different relations: obj (36544; 30% instances), nsubj (29531; 24% instances), nmod (22803; 19% instances), conj (7535; 6% instances), root (5705; 5% instances), obl:tmod (4058; 3% instances), obl (3480; 3% instances), obl:lmod (3473; 3% instances), flat (3379; 3% instances), clf (2066; 2% instances), compound (1605; 1% instances), nsubj:outer (676; 1% instances), iobj (438; 0% instances), parataxis (402; 0% instances), amod (291; 0% instances), ccomp (213; 0% instances), acl (208; 0% instances), advcl (171; 0% instances), csubj (101; 0% instances), dislocated (91; 0% instances), vocative (71; 0% instances), list (67; 0% instances), flat:foreign (64; 0% instances), compound:redup (58; 0% instances), xcomp (27; 0% instances), nsubj:pass (6; 0% instances), csubj:outer (2; 0% instances)
Parents of NOUN
nodes belong to 13 different parts of speech: VERB (76675; 62% instances), NOUN (34143; 28% instances), (5705; 5% instances), NUM (3191; 3% instances), PROPN (2201; 2% instances), PART (925; 1% instances), PRON (122; 0% instances), AUX (51; 0% instances), ADV (44; 0% instances), ADP (3; 0% instances), INTJ (2; 0% instances), SYM (2; 0% instances), SCONJ (1; 0% instances)
61053 (50%) NOUN
nodes are leaves.
43120 (35%) NOUN
nodes have one child.
13320 (11%) NOUN
nodes have two children.
5572 (5%) NOUN
nodes have three or more children.
The highest child degree of a NOUN
node is 27.
Children of NOUN
nodes are attached using 35 different relations: nmod (27426; 31% instances), amod (12815; 14% instances), case (10081; 11% instances), conj (7311; 8% instances), compound (5226; 6% instances), det (5013; 6% instances), flat (4785; 5% instances), nummod (4310; 5% instances), nsubj (3138; 4% instances), cop (2211; 2% instances), discourse:sp (2116; 2% instances), acl (1832; 2% instances), advmod (1068; 1% instances), csubj (387; 0% instances), cc (372; 0% instances), mark (218; 0% instances), obl:tmod (174; 0% instances), obl (113; 0% instances), discourse (96; 0% instances), nsubj:outer (87; 0% instances), parataxis (83; 0% instances), aux (76; 0% instances), list (71; 0% instances), advcl (67; 0% instances), flat:foreign (64; 0% instances), obl:lmod (62; 0% instances), compound:redup (56; 0% instances), clf (36; 0% instances), expl (34; 0% instances), dislocated (32; 0% instances), flat:vv (31; 0% instances), fixed (15; 0% instances), vocative (9; 0% instances), csubj:outer (2; 0% instances), nsubj:pass (1; 0% instances)
Children of NOUN
nodes belong to 13 different parts of speech: NOUN (34143; 38% instances), VERB (14214; 16% instances), PROPN (13195; 15% instances), PRON (5723; 6% instances), SCONJ (5649; 6% instances), ADP (4702; 5% instances), NUM (4516; 5% instances), PART (3001; 3% instances), AUX (2293; 3% instances), ADV (1805; 2% instances), CCONJ (161; 0% instances), INTJ (11; 0% instances), SYM (5; 0% instances)