Treebank Statistics: UD_Xavante-XDT: POS Tags: DET
There are 4 DET
lemmas (1%), 5 DET
types (1%) and 65 DET
tokens (4%).
Out of 15 observed tags, the rank of DET
is: 11 in number of lemmas, 10 in number of types and 8 in number of tokens.
The 10 most frequent DET
lemmas: hã, ã, tahata, õ
The 10 most frequent DET
types: hã, Tahata, Ãhã, Õhõ, ã
The 10 most frequent ambiguous lemmas: hã (DET 61, PART 27), õ (PART 15, ADV 4, DET 1)
The 10 most frequent ambiguous types: hã (DET 61, PART 28), Ãhã (DET 1, PRON 1)
- hã
- Ãhã
Morphology
The form / lemma ratio of DET
is 1.250000 (the average of all parts of speech is 1.290598).
The 1st highest number of forms (2) was observed with the lemma “ã”: Ãhã, ã.
The 2nd highest number of forms (1) was observed with the lemma “hã”: hã.
The 3rd highest number of forms (1) was observed with the lemma “tahata”: Tahata.
DET
occurs with 2 features: Deixis (3; 5% instances), Emph (2; 3% instances)
DET
occurs with 3 feature-value pairs: Deixis=Prox
, Deixis=Remt
, Emph=Yes
DET
occurs with 4 feature combinations.
The most frequent feature combination is _
(62 tokens).
Examples: hã, Tahata
Relations
DET
nodes are attached to their parents using 2 different relations: det (64; 98% instances), nsubj (1; 2% instances)
Parents of DET
nodes belong to 3 different parts of speech: NOUN (61; 94% instances), VERB (3; 5% instances), PROPN (1; 2% instances)
65 (100%) DET
nodes are leaves.
The highest child degree of a DET
node is 0.