home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-PUD: POS Tags: PRON

There are 33 PRON lemmas (1%), 37 PRON types (1%) and 710 PRON tokens (3%). Out of 15 observed tags, the rank of PRON is: 10 in number of lemmas, 11 in number of types and 10 in number of tokens.

The 10 most frequent PRON lemmas: 他、 我、 她、 它、 這、 其、 自己、 此、 你、 什麼

The 10 most frequent PRON types: 他、 他們、 她、 我、 這、 其、 它、 我們、 自己、 此

The 10 most frequent ambiguous lemmas: 這 (DET 107, PRON 45), 此 (PRON 20, DET 9), 之 (PART 21, PRON 6), 那 (DET 21, PRON 6, ADV 2), 怎麼 (ADV 2, PRON 2), 個人 (ADJ 2, PRON 1)

The 10 most frequent ambiguous types: 這 (DET 107, PRON 45), 此 (PRON 20, DET 9), 之 (PART 21, PRON 6), 那 (DET 21, PRON 6, ADV 2), 怎麼 (ADV 2, PRON 2), 個人 (ADJ 2, PRON 1)

Morphology

The form / lemma ratio of PRON is 1.121212 (the average of all parts of speech is 1.006233).

The 1st highest number of forms (2) was observed with the lemma “他”: 他, 他們.

The 2nd highest number of forms (2) was observed with the lemma “她”: 她, 她們.

The 3rd highest number of forms (2) was observed with the lemma “它”: 它, 它們.

PRON occurs with 2 features: Person (543; 76% instances), Number (144; 20% instances)

PRON occurs with 4 feature-value pairs: Number=Plur, Person=1, Person=2, Person=3

PRON occurs with 7 feature combinations. The most frequent feature combination is Person=3 (323 tokens). Examples: 他、 她、 其、 他們、 它、 它們、 她們、 牠們

Relations

PRON nodes are attached to their parents using 15 different relations: nsubj (393; 55% instances), nmod (105; 15% instances), compound (73; 10% instances), obj (53; 7% instances), obl (49; 7% instances), nsubj:pass (9; 1% instances), appos (8; 1% instances), obl:patient (6; 1% instances), det (3; 0% instances), root (3; 0% instances), ccomp (2; 0% instances), obl:agent (2; 0% instances), xcomp (2; 0% instances), acl:relcl (1; 0% instances), iobj (1; 0% instances)

Parents of PRON nodes belong to 10 different parts of speech: VERB (465; 65% instances), NOUN (212; 30% instances), ADJ (17; 2% instances), ADP (3; 0% instances), PRON (3; 0% instances), PROPN (3; 0% instances), (3; 0% instances), NUM (2; 0% instances), PART (1; 0% instances), X (1; 0% instances)

529 (75%) PRON nodes are leaves.

163 (23%) PRON nodes have one child.

11 (2%) PRON nodes have two children.

7 (1%) PRON nodes have three or more children.

The highest child degree of a PRON node is 4.

Children of PRON nodes are attached using 13 different relations: case (154; 74% instances), punct (11; 5% instances), case:loc (9; 4% instances), appos (7; 3% instances), cop (7; 3% instances), nsubj (7; 3% instances), conj (5; 2% instances), acl:relcl (3; 1% instances), mark (2; 1% instances), acl (1; 0% instances), advmod (1; 0% instances), det (1; 0% instances), mark:rel (1; 0% instances)

Children of PRON nodes belong to 12 different parts of speech: PART (108; 52% instances), ADP (56; 27% instances), NOUN (14; 7% instances), PUNCT (11; 5% instances), AUX (7; 3% instances), VERB (4; 2% instances), PRON (3; 1% instances), SCONJ (2; 1% instances), ADV (1; 0% instances), DET (1; 0% instances), PROPN (1; 0% instances), X (1; 0% instances)