Treebank Statistics: UD_Akuntsu-TuDeT: POS Tags: DET
There are 9 DET
lemmas (2%), 12 DET
types (2%) and 45 DET
tokens (3%).
Out of 13 observed tags, the rank of DET
is: 6 in number of lemmas, 7 in number of types and 8 in number of tokens.
The 10 most frequent DET
lemmas: jẽ, ke, jẽrom, ẽ, eme, jõ, ta, atʃo, ẽrom
The 10 most frequent DET
types: jẽ, ke, jẽrom, kebõ, ẽ, eme, jõ, atʃorom, jẽbõ, ta
The 10 most frequent ambiguous lemmas: jẽ (DET 16, NOUN 1), ẽ (DET 4, INTJ 2), jõ (ADV 2, DET 2), ta (DET 2, VERB 2), atʃo (VERB 4, DET 1), ẽrom (DET 1, PRON 1)
The 10 most frequent ambiguous types: ẽ (DET 4, INTJ 2), jõ (ADV 2, DET 2), atʃorom (DET 1, VERB 1), ta (DET 1, VERB 1)
- ẽ
- jõ
- atʃorom
- ta
Morphology
The form / lemma ratio of DET
is 1.333333 (the average of all parts of speech is 1.442049).
The 1st highest number of forms (3) was observed with the lemma “jẽ”: jẽ, jẽbõ, jẽrom.
The 2nd highest number of forms (2) was observed with the lemma “ke”: ke, kebõ.
The 3rd highest number of forms (2) was observed with the lemma “ta”: ta, tarom.
DET
occurs with 2 features: Deixis (42; 93% instances), Case (6; 13% instances)
DET
occurs with 4 feature-value pairs: Case=All
, Case=Dat
, Deixis=Dist
, Deixis=Prox
DET
occurs with 6 feature combinations.
The most frequent feature combination is Deixis=Prox
(20 tokens).
Examples: jẽ, ẽ, eme, kebõ, jẽbõ
Relations
DET
nodes are attached to their parents using 8 different relations: nsubj (16; 36% instances), nmod (15; 33% instances), parataxis (5; 11% instances), obl (3; 7% instances), advmod (2; 4% instances), det (2; 4% instances), appos (1; 2% instances), dislocated (1; 2% instances)
Parents of DET
nodes belong to 6 different parts of speech: NOUN (26; 58% instances), VERB (15; 33% instances), ADV (1; 2% instances), NUM (1; 2% instances), PRON (1; 2% instances), PROPN (1; 2% instances)
34 (76%) DET
nodes are leaves.
9 (20%) DET
nodes have one child.
2 (4%) DET
nodes have two children.
The highest child degree of a DET
node is 2.
Children of DET
nodes are attached using 6 different relations: punct (4; 31% instances), advmod (3; 23% instances), det (2; 15% instances), dislocated (2; 15% instances), nsubj (1; 8% instances), nummod (1; 8% instances)
Children of DET
nodes belong to 5 different parts of speech: PUNCT (4; 31% instances), ADV (3; 23% instances), PRON (3; 23% instances), NOUN (2; 15% instances), NUM (1; 8% instances)