Treebank Statistics: UD_Vietnamese-TueCL: POS Tags: DET
There are 17 DET
lemmas (2%), 17 DET
types (2%) and 84 DET
tokens (4%).
Out of 15 observed tags, the rank of DET
is: 8 in number of lemmas, 8 in number of types and 7 in number of tokens.
The 10 most frequent DET
lemmas: những, này, các, đó, bất cứ, mọi, cả, một vài, cả hai, tất cả
The 10 most frequent DET
types: những, này, các, đó, bất cứ, mọi, cả, một vài, cả hai, tất cả
The 10 most frequent ambiguous lemmas: đó (DET 10, PRON 10), ai (PRON 6, DET 1), kia (DET 1, PRON 1), nay (ADV 1, DET 1), từng (ADV 1, DET 1)
The 10 most frequent ambiguous types: đó (DET 10, PRON 8), ai (PRON 6, DET 1), kia (DET 1, PRON 1), nay (ADV 1, DET 1), từng (ADV 1, DET 1)
- đó
- ai
- kia
- nay
- từng
Morphology
The form / lemma ratio of DET
is 1.000000 (the average of all parts of speech is 1.000000).
The 1st highest number of forms (1) was observed with the lemma “ai”: ai.
The 2nd highest number of forms (1) was observed with the lemma “bất cứ”: bất cứ.
The 3rd highest number of forms (1) was observed with the lemma “các”: các.
DET
occurs with 3 features: PronType (36; 43% instances), Number (12; 14% instances), Deixis (11; 13% instances)
DET
occurs with 7 feature-value pairs: Deixis=Prox
, Deixis=Remt
, Number=Plur
, PronType=Dem
, PronType=Ind
, PronType=Int
, PronType=Tot
DET
occurs with 8 feature combinations.
The most frequent feature combination is _
(36 tokens).
Examples: những, bất cứ, mọi, cả, mấy, từng
Relations
DET
nodes are attached to their parents using 4 different relations: det (81; 96% instances), compound (1; 1% instances), nmod:poss (1; 1% instances), nsubj (1; 1% instances)
Parents of DET
nodes belong to 2 different parts of speech: NOUN (78; 93% instances), PRON (6; 7% instances)
81 (96%) DET
nodes are leaves.
3 (4%) DET
nodes have one child.
The highest child degree of a DET
node is 1.
Children of DET
nodes are attached using 1 different relations: clf (3; 100% instances)
Children of DET
nodes belong to 1 different parts of speech: NOUN (3; 100% instances)