home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-CFL: POS Tags: PART

There are 11 PART lemmas (1%), 11 PART types (1%) and 545 PART tokens (8%). Out of 15 observed tags, the rank of PART is: 14 in number of lemmas, 14 in number of types and 6 in number of tokens.

The 10 most frequent PART lemmas: 的、 了、 地、 得、 没、 吧、 吗、 呢、 啊、 和

The 10 most frequent PART types: 的、 了、 地、 得、 没、 吧、 吗、 呢、 啊、 和

The 10 most frequent ambiguous lemmas: 的 (PART 409, NOUN 1), 了 (AUX 124, PART 80, VERB 1), 得 (PART 11, VERB 6, AUX 4), 没 (PART 9, ADV 2), 吧 (PART 4, INTJ 1), 和 (CCONJ 28, ADP 13, PART 1)

The 10 most frequent ambiguous types: 的 (PART 409, NOUN 1), 了 (AUX 124, PART 80, VERB 1), 得 (PART 11, VERB 6, AUX 3), 没 (PART 9, ADV 2), 吧 (PART 4, INTJ 1), 和 (CCONJ 28, ADP 13, PART 1)

Morphology

The form / lemma ratio of PART is 1.000000 (the average of all parts of speech is 1.009709).

The 1st highest number of forms (1) was observed with the lemma “了”: 了.

The 2nd highest number of forms (1) was observed with the lemma “吗”: 吗.

The 3rd highest number of forms (1) was observed with the lemma “吧”: 吧.

PART occurs with 1 features: Polarity (9; 2% instances)

PART occurs with 1 feature-value pairs: Polarity=Neg

PART occurs with 2 feature combinations. The most frequent feature combination is _ (536 tokens). Examples: 的、 了、 地、 得、 吧、 吗、 呢、 啊、 和、 嗬

Relations

PART nodes are attached to their parents using 7 different relations: mark:rel (229; 42% instances), case (171; 31% instances), discourse:sp (100; 18% instances), mark:adv (25; 5% instances), compound:ext (10; 2% instances), advmod (9; 2% instances), cc (1; 0% instances)

Parents of PART nodes belong to 8 different parts of speech: VERB (195; 36% instances), ADJ (148; 27% instances), NOUN (87; 16% instances), PRON (84; 15% instances), PROPN (17; 3% instances), DET (7; 1% instances), ADV (6; 1% instances), ADP (1; 0% instances)

545 (100%) PART nodes are leaves.

The highest child degree of a PART node is 0.