Treebank Statistics: UD_Chinese-Beginner: POS Tags: PRON
There are 26 PRON
lemmas (1%), 26 PRON
types (1%) and 2157 PRON
tokens (11%).
Out of 15 observed tags, the rank of PRON
is: 10 in number of lemmas, 10 in number of types and 5 in number of tokens.
The 10 most frequent PRON
lemmas: 我、 你、 他、 我们、 她、 他们、 你们、 什么、 怎么、 谁
The 10 most frequent PRON
types: 我、 你、 他、 我们、 她、 他们、 你们、 什么、 怎么、 谁
The 10 most frequent ambiguous lemmas: 什么 (PRON 58, ADV 4, NOUN 1), 怎么 (PRON 56, ADV 19), 自己 (PRON 9, NOUN 3, ADV 2), 干吗 (PRON 6, VERB 1), 它 (NOUN 3, PRON 2, PART 1), 其 (DET 1, PRON 1), 几 (NUM 41, DET 1, PRON 1), 别 (VERB 41, ADJ 8, PRON 1), 这么 (ADV 38, PRON 1)
The 10 most frequent ambiguous types: 什么 (PRON 58, ADV 4, NOUN 1), 怎么 (PRON 56, ADV 19), 自己 (PRON 9, NOUN 3, ADV 2), 干吗 (PRON 6, VERB 1), 它 (NOUN 3, PRON 2, PART 1), 其 (DET 1, PRON 1), 几 (NUM 41, DET 1, PRON 1), 别 (VERB 41, ADJ 8, PRON 1), 这么 (ADV 38, PRON 1)
- 什么
- 怎么
- 自己
- 干吗
- 它
- 其
- 几
- 别
- 这么
Morphology
The form / lemma ratio of PRON
is 1.000000 (the average of all parts of speech is 1.000000).
The 1st highest number of forms (1) was observed with the lemma “为什么”: 为什么.
The 2nd highest number of forms (1) was observed with the lemma “什么”: 什么.
The 3rd highest number of forms (1) was observed with the lemma “他”: 他.
PRON
occurs with 3 features: Person (1959; 91% instances), Number (360; 17% instances), PronType (167; 8% instances)
PRON
occurs with 6 feature-value pairs: Number=Plur
, Person=1
, Person=2
, Person=3
, PronType=Ind
, PronType=Int
PRON
occurs with 9 feature combinations.
The most frequent feature combination is Person=1
(634 tokens).
Examples: 我、 我们、 咱们
Relations
PRON
nodes are attached to their parents using 13 different relations: nsubj (1390; 64% instances), nmod (397; 18% instances), obj (185; 9% instances), obl (77; 4% instances), obl:arg (73; 3% instances), root (12; 1% instances), det (10; 0% instances), ccomp (5; 0% instances), conj (3; 0% instances), parataxis (2; 0% instances), appos (1; 0% instances), iobj (1; 0% instances), nsubj:outer (1; 0% instances)
Parents of PRON
nodes belong to 9 different parts of speech: VERB (1498; 69% instances), NOUN (491; 23% instances), ADJ (125; 6% instances), AUX (19; 1% instances), (12; 1% instances), PRON (7; 0% instances), NUM (2; 0% instances), PROPN (2; 0% instances), PART (1; 0% instances)
1898 (88%) PRON
nodes are leaves.
234 (11%) PRON
nodes have one child.
11 (1%) PRON
nodes have two children.
14 (1%) PRON
nodes have three or more children.
The highest child degree of a PRON
node is 7.
Children of PRON
nodes are attached using 23 different relations: case (211; 67% instances), conj (21; 7% instances), cop (16; 5% instances), punct (15; 5% instances), nsubj (13; 4% instances), appos (8; 3% instances), advmod (6; 2% instances), cc (3; 1% instances), obl:tmod (3; 1% instances), acl (2; 1% instances), nmod (2; 1% instances), obl:arg (2; 1% instances), parataxis (2; 1% instances), advcl (1; 0% instances), amod (1; 0% instances), aux (1; 0% instances), csubj (1; 0% instances), dep (1; 0% instances), fixed (1; 0% instances), mark (1; 0% instances), nsubj:outer (1; 0% instances), obj (1; 0% instances), vocative (1; 0% instances)
Children of PRON
nodes belong to 13 different parts of speech: PART (137; 44% instances), ADP (74; 24% instances), NOUN (40; 13% instances), AUX (17; 5% instances), PUNCT (15; 5% instances), ADV (9; 3% instances), VERB (8; 3% instances), PRON (7; 2% instances), CCONJ (3; 1% instances), ADJ (1; 0% instances), DET (1; 0% instances), PROPN (1; 0% instances), SCONJ (1; 0% instances)