Treebank Statistics: UD_Chinese-PUD: POS Tags: VERB
There are 1393 VERB
lemmas (24%), 1408 VERB
types (24%) and 3528 VERB
tokens (16%).
Out of 15 observed tags, the rank of VERB
is: 2 in number of lemmas, 2 in number of types and 2 in number of tokens.
The 10 most frequent VERB
lemmas: 有、 在、 是、 為、 於、 說、 開始、 來、 到、 讓
The 10 most frequent VERB
types: 在、 有、 於、 是、 為、 說、 沒有、 開始、 來、 到
The 10 most frequent ambiguous lemmas: 在 (ADP 272, VERB 118, ADV 25), 是 (AUX 167, VERB 98), 為 (VERB 95, ADP 35, AUX 29), 於 (VERB 80, ADP 45), 開始 (VERB 27, NOUN 1), 來 (ADV 40, VERB 26, ADP 2), 到 (VERB 26, ADP 4, CCONJ 4), 由 (VERB 20, ADP 1), 發生 (VERB 19, NOUN 2), 建立 (VERB 16, NOUN 1)
The 10 most frequent ambiguous types: 在 (ADP 272, VERB 118, ADV 25), 於 (VERB 80, ADP 45), 是 (AUX 148, VERB 67), 為 (VERB 63, ADP 35, AUX 29), 開始 (VERB 27, NOUN 1), 來 (ADV 40, VERB 26, ADP 2), 到 (VERB 26, ADP 4, CCONJ 4), 由 (VERB 20, ADP 1), 發生 (VERB 19, NOUN 2), 建立 (VERB 16, NOUN 1)
- 在
- 於
- 是
- 為
- 開始
- 來
- 到
- 由
- 發生
- 建立
Morphology
The form / lemma ratio of VERB
is 1.010768 (the average of all parts of speech is 1.006233).
The 1st highest number of forms (9) was observed with the lemma “是”: 不是, 也是, 像是, 就是, 是, 是不是, 是否, 正是, 都是.
The 2nd highest number of forms (6) was observed with the lemma “為”: 一分為二, 以為, 引以為豪, 為, 為期, 認為.
The 3rd highest number of forms (2) was observed with the lemma “有”: 有, 沒有.
VERB
occurs with 2 features: Voice (90; 3% instances), Polarity (48; 1% instances)
VERB
occurs with 2 feature-value pairs: Polarity=Neg
, Voice=Cau
VERB
occurs with 3 feature combinations.
The most frequent feature combination is _
(3390 tokens).
Examples: 在、 有、 於、 是、 為、 說、 開始、 來、 到、 認為
Relations
VERB
nodes are attached to their parents using 24 different relations: root (890; 25% instances), advcl (482; 14% instances), xcomp (443; 13% instances), acl:relcl (409; 12% instances), ccomp (336; 10% instances), dep (308; 9% instances), mark:prt (240; 7% instances), obj (84; 2% instances), nsubj (69; 2% instances), csubj (62; 2% instances), conj (51; 1% instances), compound (39; 1% instances), amod (36; 1% instances), acl (17; 0% instances), nmod (16; 0% instances), obl (15; 0% instances), appos (10; 0% instances), nsubj:pass (8; 0% instances), dislocated (5; 0% instances), parataxis (3; 0% instances), obl:patient (2; 0% instances), iobj (1; 0% instances), obl:agent (1; 0% instances), obl:tmod (1; 0% instances)
Parents of VERB
nodes belong to 11 different parts of speech: VERB (1968; 56% instances), (890; 25% instances), NOUN (530; 15% instances), ADJ (89; 3% instances), PROPN (23; 1% instances), ADP (10; 0% instances), X (7; 0% instances), PRON (4; 0% instances), NUM (3; 0% instances), PART (3; 0% instances), DET (1; 0% instances)
420 (12%) VERB
nodes are leaves.
518 (15%) VERB
nodes have one child.
605 (17%) VERB
nodes have two children.
1985 (56%) VERB
nodes have three or more children.
The highest child degree of a VERB
node is 11.
Children of VERB
nodes are attached using 39 different relations: punct (1763; 16% instances), nsubj (1535; 14% instances), obj (1438; 13% instances), advmod (998; 9% instances), aux (667; 6% instances), obl (651; 6% instances), xcomp (504; 5% instances), advcl (494; 5% instances), mark:rel (424; 4% instances), ccomp (402; 4% instances), dep (322; 3% instances), mark:prt (321; 3% instances), mark (269; 3% instances), obl:tmod (200; 2% instances), compound (109; 1% instances), aux:pass (79; 1% instances), discourse:sp (72; 1% instances), nsubj:pass (71; 1% instances), conj (51; 0% instances), csubj (45; 0% instances), obl:patient (39; 0% instances), cc (37; 0% instances), appos (30; 0% instances), mark:adv (22; 0% instances), obl:agent (22; 0% instances), clf (20; 0% instances), acl:relcl (19; 0% instances), amod (18; 0% instances), case (16; 0% instances), iobj (15; 0% instances), case:loc (10; 0% instances), flat:name (10; 0% instances), cop (8; 0% instances), det (8; 0% instances), nummod (7; 0% instances), flat (2; 0% instances), parataxis (2; 0% instances), discourse (1; 0% instances), vocative (1; 0% instances)
Children of VERB
nodes belong to 15 different parts of speech: NOUN (2985; 28% instances), VERB (1968; 18% instances), PUNCT (1763; 16% instances), ADV (1056; 10% instances), AUX (754; 7% instances), PART (558; 5% instances), PROPN (524; 5% instances), PRON (465; 4% instances), ADP (265; 2% instances), ADJ (177; 2% instances), X (66; 1% instances), NUM (43; 0% instances), CCONJ (37; 0% instances), SCONJ (23; 0% instances), DET (18; 0% instances)