Treebank Statistics: UD_Japanese-PUD: POS Tags: AUX
There are 26 AUX
lemmas (1%), 55 AUX
types (1%) and 3322 AUX
tokens (12%).
Out of 16 observed tags, the rank of AUX
is: 7 in number of lemmas, 7 in number of types and 3 in number of tokens.
The 10 most frequent AUX
lemmas: た, 為る, だ, れる, ます, ない, られる, 出来る, 様, せる
The 10 most frequent AUX
types: た, し, で, れ, な, さ, する, に, ない, だ
The 10 most frequent ambiguous lemmas: た (AUX 940, SCONJ 1), 為る (AUX 864, VERB 70, SCONJ 2), だ (AUX 703, CCONJ 1), 出来る (AUX 44, VERB 14), 様 (AUX 42, NOUN 22), せる (AUX 32, SCONJ 1), です (AUX 30, CCONJ 1), ず (AUX 29, SCONJ 6), 無い (ADJ 43, AUX 27), そう (ADV 3, AUX 2)
The 10 most frequent ambiguous types: し (AUX 491, VERB 50, SCONJ 7), で (ADP 312, AUX 254, SCONJ 19), さ (AUX 182, PART 14, VERB 5), する (AUX 181, VERB 12), に (ADP 982, AUX 152, SCONJ 27, CCONJ 1), ない (AUX 90, ADJ 20), だ (AUX 73, CCONJ 1), よう (AUX 42, NOUN 22), なかっ (AUX 34, ADJ 5), できる (AUX 25, VERB 6)
- し
- で
- さ
- する
- に
- ない
- だ
- よう
- なかっ
- できる
Morphology
The form / lemma ratio of AUX
is 2.115385 (the average of all parts of speech is 1.068660).
The 1st highest number of forms (7) was observed with the lemma “だ”: だ, だっ, だろう, で, な, なら, に.
The 2nd highest number of forms (6) was observed with the lemma “為る”: さ, し, しよう, す, する, せ.
The 3rd highest number of forms (4) was observed with the lemma “た”: た, たら, たろう, だ.
AUX
occurs with 1 features: Polarity (144; 4% instances)
AUX
occurs with 1 feature-value pairs: Polarity=Neg
AUX
occurs with 2 feature combinations.
The most frequent feature combination is _
(3178 tokens).
Examples: た, し, で, れ, な, さ, する, に, だ, ます
Relations
AUX
nodes are attached to their parents using 8 different relations: aux (2763; 83% instances), cop (347; 10% instances), fixed (205; 6% instances), acl (2; 0% instances), root (2; 0% instances), appos (1; 0% instances), dep (1; 0% instances), nmod (1; 0% instances)
Parents of AUX
nodes belong to 12 different parts of speech: VERB (2369; 71% instances), NOUN (474; 14% instances), ADJ (285; 9% instances), ADP (80; 2% instances), SCONJ (45; 1% instances), AUX (36; 1% instances), ADV (9; 0% instances), PRON (8; 0% instances), PROPN (7; 0% instances), PART (6; 0% instances), (2; 0% instances), CCONJ (1; 0% instances)
3114 (94%) AUX
nodes are leaves.
181 (5%) AUX
nodes have one child.
14 (0%) AUX
nodes have two children.
13 (0%) AUX
nodes have three or more children.
The highest child degree of a AUX
node is 7.
Children of AUX
nodes are attached using 7 different relations: fixed (244; 95% instances), punct (4; 2% instances), dep (3; 1% instances), compound (2; 1% instances), nmod (2; 1% instances), advcl (1; 0% instances), appos (1; 0% instances)
Children of AUX
nodes belong to 6 different parts of speech: VERB (187; 73% instances), AUX (36; 14% instances), ADP (19; 7% instances), SCONJ (7; 3% instances), NOUN (4; 2% instances), PUNCT (4; 2% instances)