Treebank Statistics: UD_Japanese-GSD: POS Tags: VERB
There are 2831 VERB
lemmas (13%), 4286 VERB
types (18%) and 21226 VERB
tokens (11%).
Out of 16 observed tags, the rank of VERB
is: 3 in number of lemmas, 3 in number of types and 3 in number of tokens.
The 10 most frequent VERB
lemmas: 居る, 有る, 成る, 為る, 言う, 因る, 行く, 行う, 思う, 来る
The 10 most frequent VERB
types: いる, ある, い, なっ, いう, し, あり, あっ, なる, おり
The 10 most frequent ambiguous lemmas: 為る (AUX 5115, VERB 675, SCONJ 12, CCONJ 3), 言う (VERB 665, SCONJ 5), 因る (VERB 428, CCONJ 2), 出来る (AUX 237, VERB 91), 使用 (VERB 79, NOUN 27), 利用 (VERB 78, NOUN 25), 参加 (VERB 41, NOUN 38), 存在 (VERB 41, NOUN 21), 発表 (VERB 36, NOUN 25), 登場 (NOUN 36, VERB 34)
The 10 most frequent ambiguous types: ある (VERB 1008, DET 33), し (AUX 2933, VERB 417, SCONJ 54), あり (VERB 327, NOUN 2), あっ (VERB 251, INTJ 2), なる (VERB 236, AUX 4), つい (VERB 169, ADV 1), なり (VERB 166, NOUN 2, AUX 1), する (AUX 1029, VERB 144, CCONJ 3), よっ (VERB 136, CCONJ 2), き (VERB 131, NOUN 1)
- ある
- し
- あり
- あっ
- なる
- つい
- なり
- する
- よっ
- き
Morphology
The form / lemma ratio of VERB
is 1.513953 (the average of all parts of speech is 1.115220).
The 1st highest number of forms (20) was observed with the lemma “取る”: とっ, とら, とり, とる, とろう, 取っ, 取ら, 取り, 取る, 取れ, 取ろう, 執っ, 執ら, 執り, 執る, 採ら, 採る, 撮ら, 撮り, 撮れ.
The 2nd highest number of forms (16) was observed with the lemma “行く”: いか, いき, いく, いけ, いける, いこう, いっ, ゆか, ゆく, 行か, 行き, 行く, 行け, 行ける, 行こう, 行っ.
The 3rd highest number of forms (16) was observed with the lemma “言う”: いい, いう, いえ, いえよう, いえる, いっ, いわ, 云う, 云える, 言い, 言う, 言え, 言えよう, 言える, 言っ, 言わ.
VERB
does not occur with any features.
Relations
VERB
nodes are attached to their parents using 14 different relations: fixed (5194; 24% instances), acl (5124; 24% instances), root (5097; 24% instances), advcl (4966; 23% instances), ccomp (272; 1% instances), compound (237; 1% instances), csubj (143; 1% instances), nmod (126; 1% instances), obl (40; 0% instances), obj (9; 0% instances), nsubj (7; 0% instances), csubj:outer (6; 0% instances), dep (3; 0% instances), nsubj:outer (2; 0% instances)
Parents of VERB
nodes belong to 14 different parts of speech: NOUN (5488; 26% instances), (5097; 24% instances), VERB (4901; 23% instances), SCONJ (2883; 14% instances), ADP (1378; 6% instances), AUX (817; 4% instances), ADJ (366; 2% instances), PROPN (190; 1% instances), ADV (50; 0% instances), PRON (26; 0% instances), PART (14; 0% instances), NUM (13; 0% instances), CCONJ (2; 0% instances), INTJ (1; 0% instances)
5698 (27%) VERB
nodes are leaves.
1042 (5%) VERB
nodes have one child.
2561 (12%) VERB
nodes have two children.
11925 (56%) VERB
nodes have three or more children.
The highest child degree of a VERB
node is 12.
Children of VERB
nodes are attached using 20 different relations: aux (14646; 24% instances), obl (11295; 18% instances), punct (8243; 13% instances), mark (6333; 10% instances), advcl (6323; 10% instances), nsubj (5435; 9% instances), obj (4941; 8% instances), case (1518; 2% instances), advmod (1486; 2% instances), cc (544; 1% instances), compound (526; 1% instances), ccomp (389; 1% instances), nsubj:outer (298; 0% instances), nmod (127; 0% instances), dep (18; 0% instances), amod (9; 0% instances), discourse (8; 0% instances), csubj (3; 0% instances), det (1; 0% instances), nummod (1; 0% instances)
Children of VERB
nodes belong to 16 different parts of speech: NOUN (21201; 34% instances), AUX (14646; 24% instances), PUNCT (8243; 13% instances), SCONJ (5799; 9% instances), VERB (4901; 8% instances), ADV (1510; 2% instances), ADP (1499; 2% instances), PROPN (1398; 2% instances), ADJ (1150; 2% instances), PRON (573; 1% instances), PART (554; 1% instances), CCONJ (544; 1% instances), NUM (95; 0% instances), SYM (26; 0% instances), INTJ (4; 0% instances), DET (1; 0% instances)