Treebank Statistics: UD_Chinese-CFL: POS Tags: PART
There are 11 PART
lemmas (1%), 11 PART
types (1%) and 545 PART
tokens (8%).
Out of 15 observed tags, the rank of PART
is: 14 in number of lemmas, 14 in number of types and 6 in number of tokens.
The 10 most frequent PART
lemmas: 的、 了、 地、 得、 没、 吧、 吗、 呢、 啊、 和
The 10 most frequent PART
types: 的、 了、 地、 得、 没、 吧、 吗、 呢、 啊、 和
The 10 most frequent ambiguous lemmas: 的 (PART 409, NOUN 1), 了 (AUX 124, PART 80, VERB 1), 得 (PART 11, VERB 6, AUX 4), 没 (PART 9, ADV 2), 吧 (PART 4, INTJ 1), 和 (CCONJ 28, ADP 13, PART 1)
The 10 most frequent ambiguous types: 的 (PART 409, NOUN 1), 了 (AUX 124, PART 80, VERB 1), 得 (PART 11, VERB 6, AUX 3), 没 (PART 9, ADV 2), 吧 (PART 4, INTJ 1), 和 (CCONJ 28, ADP 13, PART 1)
- 的
- 了
- 得
- 没
- 吧
- 和
Morphology
The form / lemma ratio of PART
is 1.000000 (the average of all parts of speech is 1.009709).
The 1st highest number of forms (1) was observed with the lemma “了”: 了.
The 2nd highest number of forms (1) was observed with the lemma “吗”: 吗.
The 3rd highest number of forms (1) was observed with the lemma “吧”: 吧.
PART
occurs with 1 features: Polarity (9; 2% instances)
PART
occurs with 1 feature-value pairs: Polarity=Neg
PART
occurs with 2 feature combinations.
The most frequent feature combination is _
(536 tokens).
Examples: 的、 了、 地、 得、 吧、 吗、 呢、 啊、 和、 嗬
Relations
PART
nodes are attached to their parents using 7 different relations: mark:rel (229; 42% instances), case (171; 31% instances), discourse:sp (100; 18% instances), mark:adv (25; 5% instances), compound:ext (10; 2% instances), advmod (9; 2% instances), cc (1; 0% instances)
Parents of PART
nodes belong to 8 different parts of speech: VERB (195; 36% instances), ADJ (148; 27% instances), NOUN (87; 16% instances), PRON (84; 15% instances), PROPN (17; 3% instances), DET (7; 1% instances), ADV (6; 1% instances), ADP (1; 0% instances)
545 (100%) PART
nodes are leaves.
The highest child degree of a PART
node is 0.