home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-PUD: POS Tags: AUX

There are 22 AUX lemmas (0%), 38 AUX types (1%) and 955 AUX tokens (4%). Out of 15 observed tags, the rank of AUX is: 13 in number of lemmas, 10 in number of types and 8 in number of tokens.

The 10 most frequent AUX lemmas: 了、 是、 被、 會、 能、 著、 可以、 為、 可能、 要

The 10 most frequent AUX types: 了、 是、 被、 會、 著、 能、 可以、 為、 可能、 要

The 10 most frequent ambiguous lemmas: 了 (AUX 339, PART 41), 是 (AUX 167, VERB 98), 被 (AUX 79, ADP 22, X 1), 會 (AUX 74, VERB 1), 為 (VERB 95, ADP 35, AUX 29), 可能 (AUX 24, NOUN 8, VERB 1), 過 (AUX 18, VERB 2), 必須 (AUX 10, VERB 1), 想 (AUX 10, VERB 2), 需要 (VERB 10, AUX 6)

The 10 most frequent ambiguous types: 了 (AUX 339, PART 41), 是 (AUX 148, VERB 67), 被 (AUX 79, ADP 22, X 1), 會 (AUX 66, VERB 1), 為 (VERB 63, ADP 35, AUX 29), 可能 (AUX 24, NOUN 8, VERB 1), 過 (AUX 18, VERB 2), 必須 (AUX 10, VERB 1), 想 (AUX 8, VERB 2), 需要 (VERB 10, AUX 6)

Morphology

The form / lemma ratio of AUX is 1.727273 (the average of all parts of speech is 1.006233).

The 1st highest number of forms (9) was observed with the lemma “是”: 不是, 也是, 仍是, 就是, 才是, 是, 是否是, 還是, 都是.

The 2nd highest number of forms (3) was observed with the lemma “會”: 不會, 會, 會不會.

The 3rd highest number of forms (3) was observed with the lemma “能”: 不能, 未能, 能.

AUX occurs with 4 features: Aspect (397; 42% instances), Voice (79; 8% instances), Polarity (35; 4% instances), Mood (2; 0% instances)

AUX occurs with 5 feature-value pairs: Aspect=Perf, Aspect=Prog, Mood=Int, Polarity=Neg, Voice=Pass

AUX occurs with 6 feature combinations. The most frequent feature combination is _ (442 tokens). Examples: 是、 會、 能、 可以、 為、 可能、 要、 可、 能夠、 必須

Relations

AUX nodes are attached to their parents using 3 different relations: aux (680; 71% instances), cop (196; 21% instances), aux:pass (79; 8% instances)

Parents of AUX nodes belong to 9 different parts of speech: VERB (754; 79% instances), NOUN (149; 16% instances), ADJ (22; 2% instances), NUM (11; 1% instances), PROPN (8; 1% instances), PRON (7; 1% instances), X (2; 0% instances), DET (1; 0% instances), PART (1; 0% instances)

953 (100%) AUX nodes are leaves.

2 (0%) AUX nodes have one child.

The highest child degree of a AUX node is 1.

Children of AUX nodes are attached using 1 different relations: punct (2; 100% instances)

Children of AUX nodes belong to 1 different parts of speech: PUNCT (2; 100% instances)