Treebank Statistics: UD_Chinese-PUD: POS Tags: PART
There are 26 PART
lemmas (0%), 26 PART
types (0%) and 1483 PART
tokens (7%).
Out of 15 observed tags, the rank of PART
is: 11 in number of lemmas, 12 in number of types and 4 in number of tokens.
The 10 most frequent PART
lemmas: 的、 了、 地、 之、 得、 嗎、 人、 者、 黨、 區
The 10 most frequent PART
types: 的、 了、 地、 之、 得、 嗎、 人、 者、 黨、 區
The 10 most frequent ambiguous lemmas: 的 (PART 1361, X 1), 了 (AUX 339, PART 41), 地 (PART 22, NOUN 2), 之 (PART 21, PRON 6), 得 (PART 9, AUX 5, VERB 3), 人 (NOUN 105, PART 3), 者 (NOUN 2, PART 2), 區 (NOUN 2, PART 1), 家 (NOUN 9, PART 1), 法 (PROPN 2, PART 1)
The 10 most frequent ambiguous types: 的 (PART 1361, X 1), 了 (AUX 339, PART 41), 地 (PART 22, NOUN 2), 之 (PART 21, PRON 6), 得 (PART 9, VERB 3), 人 (NOUN 91, PART 3), 者 (NOUN 2, PART 2), 區 (NOUN 2, PART 1), 家 (NOUN 9, PART 1), 法 (PROPN 2, PART 1)
- 的
- 了
- 地
- 之
- 得
- 人
- 者
- 區
- 家
- 法
Morphology
The form / lemma ratio of PART
is 1.000000 (the average of all parts of speech is 1.006233).
The 1st highest number of forms (1) was observed with the lemma “之”: 之.
The 2nd highest number of forms (1) was observed with the lemma “了”: 了.
The 3rd highest number of forms (1) was observed with the lemma “人”: 人.
PART
occurs with 1 features: Case (709; 48% instances)
PART
occurs with 1 feature-value pairs: Case=Gen
PART
occurs with 2 feature combinations.
The most frequent feature combination is _
(774 tokens).
Examples: 的、 了、 地、 得、 之、 嗎、 人、 者、 黨、 區
Relations
PART
nodes are attached to their parents using 13 different relations: case (709; 48% instances), mark:rel (626; 42% instances), discourse:sp (87; 6% instances), mark:adv (22; 1% instances), mark:prt (16; 1% instances), obj (8; 1% instances), appos (4; 0% instances), compound (3; 0% instances), conj (3; 0% instances), nsubj (2; 0% instances), dep (1; 0% instances), nmod (1; 0% instances), obl (1; 0% instances)
Parents of PART
nodes belong to 9 different parts of speech: VERB (558; 38% instances), NOUN (366; 25% instances), ADJ (222; 15% instances), PROPN (139; 9% instances), PRON (108; 7% instances), ADP (63; 4% instances), X (15; 1% instances), DET (6; 0% instances), NUM (6; 0% instances)
1461 (99%) PART
nodes are leaves.
11 (1%) PART
nodes have one child.
4 (0%) PART
nodes have two children.
7 (0%) PART
nodes have three or more children.
The highest child degree of a PART
node is 5.
Children of PART
nodes are attached using 13 different relations: dep (11; 25% instances), compound (10; 23% instances), punct (6; 14% instances), case (2; 5% instances), case:loc (2; 5% instances), cc (2; 5% instances), conj (2; 5% instances), nmod (2; 5% instances), nsubj (2; 5% instances), nummod (2; 5% instances), clf (1; 2% instances), cop (1; 2% instances), obl (1; 2% instances)
Children of PART
nodes belong to 10 different parts of speech: NOUN (19; 43% instances), PUNCT (6; 14% instances), ADP (4; 9% instances), PROPN (4; 9% instances), VERB (3; 7% instances), CCONJ (2; 5% instances), NUM (2; 5% instances), X (2; 5% instances), AUX (1; 2% instances), PRON (1; 2% instances)