Treebank Statistics: UD_Japanese-GSD: POS Tags: AUX
There are 44 AUX
lemmas (0%), 131 AUX
types (1%) and 21158 AUX
tokens (11%).
Out of 16 observed tags, the rank of AUX
is: 7 in number of lemmas, 7 in number of types and 4 in number of tokens.
The 10 most frequent AUX
lemmas: た, 為る, だ, れる, ます, です, ない, られる, ず, 様
The 10 most frequent AUX
types: た, し, で, れ, さ, する, な, に, ます, です
The 10 most frequent ambiguous lemmas: た (AUX 5321, SCONJ 10), 為る (AUX 5115, VERB 675, SCONJ 12, CCONJ 3), だ (AUX 4004, CCONJ 31), です (AUX 828, CCONJ 2), ず (AUX 320, SCONJ 13), 様 (AUX 257, NOUN 172), 出来る (AUX 237, VERB 91), 無い (ADJ 325, AUX 188), そう (AUX 57, ADV 30), 頂く (AUX 52, VERB 15)
The 10 most frequent ambiguous types: た (AUX 5186, SCONJ 10), し (AUX 2933, VERB 417, SCONJ 54), で (ADP 2600, AUX 1641, SCONJ 163, CCONJ 24, VERB 2), さ (AUX 1095, VERB 98, PART 90), する (AUX 1029, VERB 144, CCONJ 3), な (AUX 974, PART 28, CCONJ 2), に (ADP 6428, AUX 808, SCONJ 137, CCONJ 3), です (AUX 652, CCONJ 2), ない (AUX 562, ADJ 161), だ (AUX 372, CCONJ 28)
- た
- し
- で
- さ
- する
- な
- に
- です
- ない
- だ
Morphology
The form / lemma ratio of AUX
is 2.977273 (the average of all parts of speech is 1.115220).
The 1st highest number of forms (10) was observed with the lemma “頂く”: いただい, いただき, いただく, いただけ, いただける, 頂い, 頂き, 頂く, 頂け, 頂ける.
The 2nd highest number of forms (9) was observed with the lemma “だ”: じゃ, だ, だっ, だろ, だろう, で, な, なら, に.
The 3rd highest number of forms (8) was observed with the lemma “為る”: さ, し, しよう, す, する, すれ, せ, せよ.
AUX
occurs with 1 features: Polarity (946; 4% instances)
AUX
occurs with 1 feature-value pairs: Polarity=Neg
AUX
occurs with 2 feature combinations.
The most frequent feature combination is _
(20212 tokens).
Examples: た, し, で, れ, さ, する, な, に, ます, です
Relations
AUX
nodes are attached to their parents using 5 different relations: aux (17235; 81% instances), cop (2441; 12% instances), fixed (1467; 7% instances), acl (8; 0% instances), root (7; 0% instances)
Parents of AUX
nodes belong to 13 different parts of speech: VERB (14646; 69% instances), NOUN (3200; 15% instances), ADJ (1707; 8% instances), SCONJ (582; 3% instances), ADP (562; 3% instances), AUX (207; 1% instances), ADV (110; 1% instances), PROPN (53; 0% instances), PRON (40; 0% instances), PART (29; 0% instances), NUM (12; 0% instances), (7; 0% instances), CCONJ (3; 0% instances)
20188 (95%) AUX
nodes are leaves.
790 (4%) AUX
nodes have one child.
140 (1%) AUX
nodes have two children.
40 (0%) AUX
nodes have three or more children.
The highest child degree of a AUX
node is 5.
Children of AUX
nodes are attached using 8 different relations: fixed (1196; 98% instances), nmod (7; 1% instances), punct (7; 1% instances), advcl (2; 0% instances), advmod (2; 0% instances), nsubj (2; 0% instances), mark (1; 0% instances), obl (1; 0% instances)
Children of AUX
nodes belong to 8 different parts of speech: VERB (817; 67% instances), AUX (207; 17% instances), ADP (160; 13% instances), SCONJ (14; 1% instances), NOUN (10; 1% instances), PUNCT (7; 1% instances), ADV (2; 0% instances), PART (1; 0% instances)