home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Cappadocian-AMGiC: POS Tags: DET

There are 8 DET lemmas (4%), 12 DET types (5%) and 29 DET tokens (6%). Out of 15 observed tags, the rank of DET is: 7 in number of lemmas, 5 in number of types and 7 in number of tokens.

The 10 most frequent DET lemmas: (ο), (o), tiás, tu, téna, xer, énas, ís

The 10 most frequent DET types: tu, ta, čin, t, éna, či, da, tes, tiyá, tus

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types: tu (DET 8, PRON 5), ta (DET 5, PRON 3), či (PRON 4, DET 2), da (PRON 4, DET 1), tus (DET 1, PRON 1)

Morphology

The form / lemma ratio of DET is 1.500000 (the average of all parts of speech is 1.238806).

The 1st highest number of forms (6) was observed with the lemma “(o)”: da, t, ta, tes, tu, čin.

The 2nd highest number of forms (5) was observed with the lemma “(ο)”: ta, tu, tus, či, čin.

The 3rd highest number of forms (1) was observed with the lemma “tiás”: tiyá.

DET occurs with 5 features: Case (28; 97% instances), Gender (28; 97% instances), Number (28; 97% instances), Definite (27; 93% instances), PronType (27; 93% instances)

DET occurs with 11 feature-value pairs: Case=Acc, Case=Gen, Case=Nom, Definite=Def, Definite=Ind, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, PronType=Art

DET occurs with 11 feature combinations. The most frequent feature combination is Case=Acc|Definite=Def|Gender=Neut|Number=Sing|PronType=Art (8 tokens). Examples: tu, t, da, ta

Relations

DET nodes are attached to their parents using 1 different relations: det (29; 100% instances)

Parents of DET nodes belong to 1 different parts of speech: NOUN (29; 100% instances)

29 (100%) DET nodes are leaves.

The highest child degree of a DET node is 0.