home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Albanian-TSA: POS Tags: AUX

There are 3 AUX lemmas (1%), 7 AUX types (1%) and 37 AUX tokens (4%). Out of 14 observed tags, the rank of AUX is: 11 in number of lemmas, 9 in number of types and 8 in number of tokens.

The 10 most frequent AUX lemmas: jam, kam, u

The 10 most frequent AUX types: janë, është, u, kanë, ishte, ka, qenë

The 10 most frequent ambiguous lemmas: kam (VERB 8, AUX 7)

The 10 most frequent ambiguous types: u (AUX 7, PRON 1), kanë (AUX 5, VERB 2), ka (AUX 2, VERB 2)

Morphology

The form / lemma ratio of AUX is 2.333333 (the average of all parts of speech is 1.167464).

The 1st highest number of forms (4) was observed with the lemma “jam”: ishte, janë, qenë, është.

The 2nd highest number of forms (2) was observed with the lemma “kam”: ka, kanë.

The 3rd highest number of forms (1) was observed with the lemma “u”: u.

AUX occurs with 7 features: Aspect (28; 76% instances), Mood (28; 76% instances), Number (28; 76% instances), Person (28; 76% instances), Tense (28; 76% instances), Voice (28; 76% instances), VerbForm (2; 5% instances)

AUX occurs with 9 feature-value pairs: Aspect=Imp, Mood=Ind, Number=Plur, Number=Sing, Person=3, Tense=Past, Tense=Pres, VerbForm=Part, Voice=Act

AUX occurs with 5 feature combinations. The most frequent feature combination is Aspect=Imp|Mood=Ind|Number=Plur|Person=3|Tense=Pres|Voice=Act (15 tokens). Examples: janë, kanë

Relations

AUX nodes are attached to their parents using 2 different relations: cop (20; 54% instances), aux (17; 46% instances)

Parents of AUX nodes belong to 6 different parts of speech: VERB (15; 41% instances), ADJ (9; 24% instances), NOUN (9; 24% instances), PRON (2; 5% instances), ADV (1; 3% instances), DET (1; 3% instances)

37 (100%) AUX nodes are leaves.

The highest child degree of a AUX node is 0.