home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-HK: POS Tags: AUX

There are 30 AUX lemmas (2%), 32 AUX types (2%) and 471 AUX tokens (5%). Out of 16 observed tags, the rank of AUX is: 9 in number of lemmas, 9 in number of types and 7 in number of tokens.

The 10 most frequent AUX lemmas: 是、 要、 了、 能、 可以、 會、 想、 過、 能夠、 應該

The 10 most frequent AUX types: 是、 要、 了、 能、 可以、 會、 想、 過、 能夠、 應該

The 10 most frequent ambiguous lemmas: 是 (AUX 108, VERB 77), 要 (AUX 66, VERB 11), 了 (PART 65, AUX 39, VERB 2), 可以 (AUX 35, VERB 3), 想 (AUX 28, VERB 3), 過 (AUX 17, VERB 3), 能夠 (AUX 15, VERB 1), 有 (VERB 122, AUX 10, ADJ 1, ADV 1), 別 (AUX 8, DET 1), 用 (AUX 8, VERB 7, ADP 1)

The 10 most frequent ambiguous types: 是 (AUX 108, VERB 77), 要 (AUX 60, VERB 11), 了 (PART 65, AUX 39, VERB 2), 可以 (AUX 35, VERB 3), 想 (AUX 28, VERB 3), 過 (AUX 17, VERB 3), 能夠 (AUX 15, VERB 1), 不用 (AUX 8, VERB 2), 別 (AUX 8, DET 1), 不要 (AUX 6, ADV 1)

Morphology

The form / lemma ratio of AUX is 1.066667 (the average of all parts of speech is 1.007013).

The 1st highest number of forms (2) was observed with the lemma “有”: 有, 沒有.

The 2nd highest number of forms (2) was observed with the lemma “要”: 不要, 要.

The 3rd highest number of forms (1) was observed with the lemma “了”: 了.

AUX occurs with 1 features: Polarity (18; 4% instances)

AUX occurs with 1 feature-value pairs: Polarity=Neg

AUX occurs with 2 feature combinations. The most frequent feature combination is _ (453 tokens). Examples: 是、 要、 了、 能、 可以、 會、 想、 過、 能夠、 應該

Relations

AUX nodes are attached to their parents using 10 different relations: aux (349; 74% instances), cop (92; 20% instances), root (21; 4% instances), aux:pass (2; 0% instances), parataxis (2; 0% instances), acl (1; 0% instances), advcl (1; 0% instances), ccomp (1; 0% instances), conj (1; 0% instances), xcomp (1; 0% instances)

Parents of AUX nodes belong to 8 different parts of speech: VERB (336; 71% instances), NOUN (71; 15% instances), ADJ (22; 5% instances), (21; 4% instances), PROPN (10; 2% instances), PRON (8; 2% instances), ADV (2; 0% instances), ADP (1; 0% instances)

440 (93%) AUX nodes are leaves.

7 (1%) AUX nodes have one child.

11 (2%) AUX nodes have two children.

13 (3%) AUX nodes have three or more children.

The highest child degree of a AUX node is 5.

Children of AUX nodes are attached using 10 different relations: punct (31; 41% instances), discourse:sp (20; 26% instances), parataxis (7; 9% instances), advmod (6; 8% instances), conj (4; 5% instances), nsubj (3; 4% instances), ccomp (2; 3% instances), obj (1; 1% instances), obl (1; 1% instances), xcomp (1; 1% instances)

Children of AUX nodes belong to 8 different parts of speech: PUNCT (31; 41% instances), PART (19; 25% instances), ADV (10; 13% instances), VERB (7; 9% instances), ADJ (3; 4% instances), PRON (3; 4% instances), NOUN (2; 3% instances), INTJ (1; 1% instances)