Treebank Statistics: UD_Chinese-CFL: POS Tags: NOUN
There are 611 NOUN
lemmas (37%), 612 NOUN
types (37%) and 1352 NOUN
tokens (19%).
Out of 15 observed tags, the rank of NOUN
is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.
The 10 most frequent NOUN
lemmas: 个、 人、 时候、 天、 次、 朋友、 自行车、 旅行、 事、 时间
The 10 most frequent NOUN
types: 个、 人、 时候、 天、 次、 朋友、 自行车、 旅行、 事、 时间
The 10 most frequent ambiguous lemmas: 个 (NOUN 85, DET 1), 人 (NOUN 45, PRON 1), 旅行 (NOUN 16, VERB 10), 生活 (NOUN 13, VERB 2), 以前 (NOUN 5, ADP 2), 以后 (ADP 13, NOUN 4), 上 (ADP 28, VERB 10, NOUN 3), 后来 (NOUN 3, ADV 1), 下 (VERB 6, NOUN 2, ADP 1), 之前 (ADP 3, NOUN 2)
The 10 most frequent ambiguous types: 个 (NOUN 85, DET 1), 人 (NOUN 42, PRON 1), 旅行 (NOUN 16, VERB 10), 生活 (NOUN 13, VERB 2), 以前 (NOUN 5, ADP 2), 印象 (NOUN 5, VERB 1), 以后 (ADP 13, NOUN 4), 上 (ADP 28, VERB 10, NOUN 3), 后来 (NOUN 3, ADV 1), 下 (VERB 6, NOUN 2, ADP 1)
- 个
- 人
- 旅行
- 生活
- 以前
- 印象
- 以后
- 上
- 后来
- 下
Morphology
The form / lemma ratio of NOUN
is 1.001637 (the average of all parts of speech is 1.009709).
The 1st highest number of forms (2) was observed with the lemma “人”: 人, 人们.
The 2nd highest number of forms (1) was observed with the lemma “12月”: 12月.
The 3rd highest number of forms (1) was observed with the lemma “2012年”: 2012年.
NOUN
does not occur with any features.
Relations
NOUN
nodes are attached to their parents using 29 different relations: obj (396; 29% instances), nsubj (189; 14% instances), clf (140; 10% instances), obl:tmod (133; 10% instances), obl (131; 10% instances), nmod (111; 8% instances), conj (44; 3% instances), compound (38; 3% instances), root (33; 2% instances), compound:vo (25; 2% instances), appos (17; 1% instances), advcl (16; 1% instances), parataxis (15; 1% instances), ccomp (13; 1% instances), dislocated (10; 1% instances), xcomp (7; 1% instances), dep (6; 0% instances), obl:patient (5; 0% instances), acl (4; 0% instances), obl:agent (4; 0% instances), amod (3; 0% instances), flat (3; 0% instances), case:loc (2; 0% instances), nsubj:outer (2; 0% instances), case (1; 0% instances), compound:dir (1; 0% instances), iobj (1; 0% instances), nummod (1; 0% instances), vocative (1; 0% instances)
Parents of NOUN
nodes belong to 9 different parts of speech: VERB (865; 64% instances), NOUN (231; 17% instances), NUM (90; 7% instances), ADJ (60; 4% instances), DET (33; 2% instances), PRON (33; 2% instances), (33; 2% instances), PROPN (6; 0% instances), ADP (1; 0% instances)
510 (38%) NOUN
nodes are leaves.
484 (36%) NOUN
nodes have one child.
225 (17%) NOUN
nodes have two children.
133 (10%) NOUN
nodes have three or more children.
The highest child degree of a NOUN
node is 9.
Children of NOUN
nodes are attached using 29 different relations: nmod (249; 17% instances), case (173; 12% instances), amod (142; 10% instances), nummod (131; 9% instances), punct (121; 8% instances), det (120; 8% instances), acl (101; 7% instances), case:loc (75; 5% instances), cop (51; 3% instances), compound (50; 3% instances), nsubj (49; 3% instances), advmod (37; 3% instances), conj (34; 2% instances), cc (31; 2% instances), parataxis (21; 1% instances), mark:rel (15; 1% instances), clf (14; 1% instances), obl (10; 1% instances), mark (8; 1% instances), advcl (7; 0% instances), appos (6; 0% instances), discourse:sp (6; 0% instances), dep (4; 0% instances), flat (3; 0% instances), obj (2; 0% instances), xcomp (2; 0% instances), discourse (1; 0% instances), mark:adv (1; 0% instances), obl:tmod (1; 0% instances)
Children of NOUN
nodes belong to 14 different parts of speech: NOUN (231; 16% instances), ADP (182; 12% instances), PRON (155; 11% instances), ADJ (145; 10% instances), NUM (133; 9% instances), VERB (129; 9% instances), DET (121; 8% instances), PUNCT (121; 8% instances), PART (87; 6% instances), AUX (51; 3% instances), ADV (37; 3% instances), PROPN (35; 2% instances), CCONJ (30; 2% instances), SCONJ (8; 1% instances)