home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-HK: POS Tags: DET

There are 38 DET lemmas (2%), 38 DET types (2%) and 260 DET tokens (3%). Out of 16 observed tags, the rank of DET is: 8 in number of lemmas, 8 in number of types and 10 in number of tokens.

The 10 most frequent DET lemmas: 這、 那、 什麼、 其他、 每、 任何、 這些、 所有、 多、 這個

The 10 most frequent DET types: 這、 那、 什麼、 其他、 每、 任何、 這些、 所有、 多、 這個

The 10 most frequent ambiguous lemmas: 這 (DET 52, PRON 19), 那 (DET 28, ADV 9, PRON 3), 什麼 (DET 22, PRON 10), 這些 (DET 13, PRON 3), 多 (DET 8, ADJ 5, ADV 4), 這個 (DET 8, PRON 1), 下 (VERB 10, DET 6, ADP 5, ADV 4), 此 (DET 6, PRON 2), 甚麼 (DET 5, PRON 4, NOUN 1), 一些 (DET 4, ADV 1)

The 10 most frequent ambiguous types: 這 (DET 52, PRON 19), 那 (DET 28, ADV 9, PRON 3), 什麼 (DET 22, PRON 10), 這些 (DET 13, PRON 3), 多 (DET 8, ADJ 5, ADV 4), 這個 (DET 8, PRON 1), 下 (VERB 10, DET 6, ADP 5, ADV 4), 此 (DET 6, PRON 2), 甚麼 (DET 5, PRON 4, NOUN 1), 一些 (DET 4, ADV 1)

Morphology

The form / lemma ratio of DET is 1.000000 (the average of all parts of speech is 1.007013).

The 1st highest number of forms (1) was observed with the lemma “一些”: 一些.

The 2nd highest number of forms (1) was observed with the lemma “上”: 上.

The 3rd highest number of forms (1) was observed with the lemma “下”: 下.

DET does not occur with any features.

Relations

DET nodes are attached to their parents using 8 different relations: det (251; 97% instances), amod (2; 1% instances), obj (2; 1% instances), conj (1; 0% instances), discourse (1; 0% instances), nmod (1; 0% instances), nsubj (1; 0% instances), root (1; 0% instances)

Parents of DET nodes belong to 6 different parts of speech: NOUN (247; 95% instances), VERB (7; 3% instances), PROPN (3; 1% instances), ADJ (1; 0% instances), NUM (1; 0% instances), (1; 0% instances)

210 (81%) DET nodes are leaves.

49 (19%) DET nodes have one child.

0 (0%) DET nodes have two children.

1 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 3.

Children of DET nodes are attached using 7 different relations: clf (43; 83% instances), acl (2; 4% instances), advmod (2; 4% instances), discourse:sp (2; 4% instances), cc (1; 2% instances), nummod (1; 2% instances), punct (1; 2% instances)

Children of DET nodes belong to 7 different parts of speech: NOUN (43; 83% instances), ADV (2; 4% instances), PART (2; 4% instances), VERB (2; 4% instances), CCONJ (1; 2% instances), NUM (1; 2% instances), PUNCT (1; 2% instances)