home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Breton-KEB: POS Tags: DET

There are 15 DET lemmas (1%), 29 DET types (1%) and 1202 DET tokens (12%). Out of 17 observed tags, the rank of DET is: 11 in number of lemmas, 11 in number of types and 3 in number of tokens.

The 10 most frequent DET lemmas: an, un, e, ho, ma, o, he, holl, bep, hon

The 10 most frequent DET types: ar, an, ur, r, al, un, e, ho, n, o

The 10 most frequent ambiguous lemmas: an (DET 862, X 2), e (ADP 244, AUX 214, DET 37), ma (DET 27, SCONJ 17, X 8), o (AUX 40, DET 22, INTJ 2), holl (DET 14, PRON 8, ADV 1), a (AUX 303, ADP 57, DET 6, PRON 5, X 2), pep (DET 5, X 3), da (ADP 247, X 10, DET 3), kement (DET 2, SCONJ 1), peseurt (ADJ 1, DET 1)

The 10 most frequent ambiguous types: ar (DET 384, X 12), an (DET 258, X 4), ur (DET 62, X 8), e (AUX 208, ADP 173, DET 33, PRON 2), o (AUX 34, DET 22, PRON 17), he (DET 16, PRON 2), ma (DET 14, SCONJ 8, X 8), holl (DET 14, PRON 8, ADV 1), a (AUX 296, ADP 55, DET 6, PRON 5, X 2), pep (DET 4, X 3)

Morphology

The form / lemma ratio of DET is 1.933333 (the average of all parts of speech is 1.395664).

The 1st highest number of forms (7) was observed with the lemma “an”: ‘r, al, an, ar, l, n, r.

The 2nd highest number of forms (4) was observed with the lemma “ma”: am, m, ma, va.

The 3rd highest number of forms (4) was observed with the lemma “un”: ‘n, ul, un, ur.

DET occurs with 4 features: Poss (152; 13% instances), Gender[psor] (54; 4% instances), Number (20; 2% instances), PronType (1; 0% instances)

DET occurs with 6 feature-value pairs: Gender[psor]=Fem, Gender[psor]=Masc, Number=Plur, Number=Sing, Poss=Yes, PronType=Int

DET occurs with 7 feature combinations. The most frequent feature combination is _ (1029 tokens). Examples: ar, an, ur, r, al, un, n, holl, ul, ‘n

Relations

DET nodes are attached to their parents using 2 different relations: det (1201; 100% instances), nmod (1; 0% instances)

Parents of DET nodes belong to 6 different parts of speech: NOUN (1159; 96% instances), PROPN (12; 1% instances), ADJ (9; 1% instances), NUM (9; 1% instances), PRON (9; 1% instances), VERB (4; 0% instances)

1186 (99%) DET nodes are leaves.

15 (1%) DET nodes have one child.

1 (0%) DET nodes have two children.

The highest child degree of a DET node is 2.

Children of DET nodes are attached using 5 different relations: case (7; 41% instances), dep (6; 35% instances), cc (2; 12% instances), nmod:gen (1; 6% instances), punct (1; 6% instances)

Children of DET nodes belong to 5 different parts of speech: ADP (7; 41% instances), X (6; 35% instances), CCONJ (2; 12% instances), PROPN (1; 6% instances), PUNCT (1; 6% instances)