Treebank Statistics: UD_Japanese-GSDLUW: POS Tags: AUX
There are 79 AUX
lemmas (0%), 269 AUX
types (1%) and 18393 AUX
tokens (12%).
Out of 17 observed tags, the rank of AUX
is: 7 in number of lemmas, 7 in number of types and 4 in number of tokens.
The 10 most frequent AUX
lemmas: た, だ, ている, れる, ます, である, です, ない, られる, のだ
The 10 most frequent AUX
types: た, れ, ている, な, てい, に, ます, です, である, まし
The 10 most frequent ambiguous lemmas: ている (AUX 2072, VERB 4), 様 (AUX 257, NOUN 137), そう (AUX 57, ADV 30), つう (ADP 18, AUX 7, PART 6), たり (PART 78, ADP 4, AUX 4), や (ADP 608, AUX 2, PART 1)
The 10 most frequent ambiguous types: ている (AUX 1193, VERB 2), な (AUX 973, PART 28), てい (AUX 814, VERB 2), に (ADP 5333, AUX 804, SCONJ 32, X 1), ない (AUX 411, ADJ 156), で (ADP 2583, AUX 403, SCONJ 62, VERB 2, CCONJ 1), よう (AUX 256, NOUN 128), せ (AUX 127, VERB 6), ん (AUX 110, SCONJ 3), なかっ (AUX 101, ADJ 26)
- ている
- な
- てい
- に
- ない
- で
- よう
- せ
- ん
- なかっ
Morphology
The form / lemma ratio of AUX
is 3.405063 (the average of all parts of speech is 1.095294).
The 1st highest number of forms (13) was observed with the lemma “ていく”: ていか, ていき, ていく, ていけ, ていける, ていこう, ていっ, てゆか, てゆく, て行き, て行く, でいく, でいっ.
The 2nd highest number of forms (11) was observed with the lemma “てもらう”: てもらい, てもらう, てもらえ, てもらえる, てもらおう, てもらっ, て貰い, て貰え, て貰っ, でもらい, でもらっ.
The 3rd highest number of forms (10) was observed with the lemma “ていただく”: ていただい, ていただき, ていただく, ていただけ, ていただける, て頂い, て頂き, て頂く, て頂け, て頂ける.
AUX
occurs with 1 features: Polarity (863; 5% instances)
AUX
occurs with 1 feature-value pairs: Polarity=Neg
AUX
occurs with 2 feature combinations.
The most frequent feature combination is _
(17530 tokens).
Examples: た, れ, ている, な, てい, に, ます, です, である, まし
Relations
AUX
nodes are attached to their parents using 4 different relations: aux (17151; 93% instances), cop (1212; 7% instances), fixed (22; 0% instances), root (8; 0% instances)
Parents of AUX
nodes belong to 10 different parts of speech: VERB (13545; 74% instances), NOUN (2361; 13% instances), ADJ (2194; 12% instances), NUM (106; 1% instances), PROPN (74; 0% instances), ADV (47; 0% instances), PRON (35; 0% instances), AUX (22; 0% instances), (8; 0% instances), INTJ (1; 0% instances)
18369 (100%) AUX
nodes are leaves.
9 (0%) AUX
nodes have one child.
3 (0%) AUX
nodes have two children.
12 (0%) AUX
nodes have three or more children.
The highest child degree of a AUX
node is 5.
Children of AUX
nodes are attached using 8 different relations: fixed (28; 43% instances), punct (21; 32% instances), nmod (7; 11% instances), advcl (3; 5% instances), advmod (2; 3% instances), nsubj (2; 3% instances), mark (1; 2% instances), obl (1; 2% instances)
Children of AUX
nodes belong to 10 different parts of speech: AUX (22; 34% instances), PUNCT (21; 32% instances), NOUN (8; 12% instances), SCONJ (4; 6% instances), VERB (3; 5% instances), ADP (2; 3% instances), ADV (2; 3% instances), NUM (1; 2% instances), PART (1; 2% instances), PROPN (1; 2% instances)