home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-Beginner: POS Tags: AUX

There are 16 AUX lemmas (1%), 16 AUX types (1%) and 1029 AUX tokens (5%). Out of 15 observed tags, the rank of AUX is: 13 in number of lemmas, 13 in number of types and 7 in number of tokens.

The 10 most frequent AUX lemmas: 是、 要、 了、 想、 会、 过、 可以、 能、 应该、 有

The 10 most frequent AUX types: 是、 要、 了、 想、 会、 过、 可以、 能、 应该、 有

The 10 most frequent ambiguous lemmas: 是 (AUX 277, VERB 48, CCONJ 2, ADV 1), 要 (AUX 205, VERB 1), 了 (PART 421, AUX 156, VERB 1), 想 (AUX 79, VERB 23), 会 (AUX 73, NOUN 19, VERB 8), 过 (AUX 58, VERB 14), 可以 (AUX 50, ADJ 10), 有 (VERB 180, AUX 17, ADJ 1, ADP 1, SCONJ 1), 得 (PART 61, AUX 12, ADV 2, VERB 1), 用 (VERB 21, AUX 10, NOUN 1)

The 10 most frequent ambiguous types: 是 (AUX 277, VERB 48, CCONJ 2, ADV 1), 要 (AUX 205, VERB 1), 了 (PART 421, AUX 156, VERB 1), 想 (AUX 79, VERB 23), 会 (AUX 73, NOUN 19, VERB 8), 过 (AUX 58, VERB 14), 可以 (AUX 50, ADJ 10), 有 (VERB 180, AUX 17, ADJ 1, ADP 1, SCONJ 1), 得 (PART 61, AUX 12, ADV 2, VERB 1), 用 (VERB 21, AUX 10, NOUN 1)

Morphology

The form / lemma ratio of AUX is 1.000000 (the average of all parts of speech is 1.000000).

The 1st highest number of forms (1) was observed with the lemma “了”: 了.

The 2nd highest number of forms (1) was observed with the lemma “会”: 会.

The 3rd highest number of forms (1) was observed with the lemma “可以”: 可以.

AUX occurs with 1 features: Aspect (4; 0% instances)

AUX occurs with 1 feature-value pairs: Aspect=Perf

AUX occurs with 2 feature combinations. The most frequent feature combination is _ (1025 tokens). Examples: 是、 要、 了、 想、 会、 过、 可以、 能、 应该、 有

Relations

AUX nodes are attached to their parents using 8 different relations: aux (713; 69% instances), cop (275; 27% instances), root (18; 2% instances), conj (12; 1% instances), parataxis (8; 1% instances), ccomp (1; 0% instances), csubj (1; 0% instances), flat (1; 0% instances)

Parents of AUX nodes belong to 9 different parts of speech: VERB (749; 73% instances), NOUN (162; 16% instances), ADJ (60; 6% instances), (18; 2% instances), PRON (17; 2% instances), AUX (12; 1% instances), PROPN (6; 1% instances), NUM (3; 0% instances), PART (2; 0% instances)

976 (95%) AUX nodes are leaves.

23 (2%) AUX nodes have one child.

3 (0%) AUX nodes have two children.

27 (3%) AUX nodes have three or more children.

The highest child degree of a AUX node is 6.

Children of AUX nodes are attached using 14 different relations: punct (30; 22% instances), advmod (23; 17% instances), obj (22; 16% instances), nsubj (21; 16% instances), conj (17; 13% instances), discourse:sp (6; 4% instances), ccomp (4; 3% instances), advcl (2; 1% instances), discourse (2; 1% instances), mark (2; 1% instances), obl:arg (2; 1% instances), cc (1; 1% instances), flat (1; 1% instances), obl:lmod (1; 1% instances)

Children of AUX nodes belong to 10 different parts of speech: PUNCT (30; 22% instances), ADV (23; 17% instances), NOUN (22; 16% instances), PRON (19; 14% instances), VERB (13; 10% instances), AUX (12; 9% instances), PART (10; 7% instances), PROPN (3; 2% instances), ADJ (1; 1% instances), CCONJ (1; 1% instances)