home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Albanian-STAF: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

1255 tokens (35%) have a non-empty value of Gender. 683 types (56%) occur at least once with a non-empty value of Gender. 550 lemmas (56%) occur at least once with a non-empty value of Gender. The feature is used with 5 part-of-speech tags: NOUN (595; 17% instances), DET (235; 7% instances), PRON (232; 7% instances), ADJ (162; 5% instances), PROPN (31; 1% instances).

NOUN

595 NOUN tokens (95% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (488; 82%), Definite=Def (349; 59%).

NOUN tokens may have the following values of Gender:

Paradigm sytëMascFem
Case=Acc|Number=Singsytë
Case=Acc|Number=Plursytë
Case=Nom|Number=Plursytë

Gender seems to be lexical feature of NOUN. 99% lemmas (378) occur only with one value of Gender.

DET

235 DET tokens (78% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Definite=EMPTY (235; 100%), Number=Sing (185; 79%), PronType=Art (157; 67%).

DET tokens may have the following values of Gender:

Paradigm eMascFem
Case=Acc|Number=Sing|PronType=Artee
Case=Acc|Number=Plur|PronType=Artee
Case=Gen|Number=Plur|PronType=Art
Case=Nom|Number=Singe
Case=Nom|Number=Sing|PronType=Arte
Case=Nom|Number=Plur|PronType=Artee
Number=Singee
Number=Pluree
Number=Plur|PronType=Arte

PRON

232 PRON tokens (54% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (190; 82%), PronType=Prs (152; 66%).

PRON tokens may have the following values of Gender:

Paradigm aiMascFem
Case=Acc|Number=Sing|Person=3|PronType=Prse, atë, i, tëe
Case=Acc|Number=Sing|PronType=Dematë
Case=Acc|Number=Sing|PronType=Prsi
Case=Acc|Number=Plur|Person=3|PronType=Prsii
Case=Dat|Number=Sing|Person=3|PronType=Prsatij, ii
Case=Nom|Number=Sing|Person=3|PronType=Demai
Case=Nom|Number=Sing|Person=3|PronType=Prsai
Case=Nom|Number=Sing|PronType=Dematë

ADJ

162 ADJ tokens (91% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (152; 94%), Number=Sing (130; 80%).

ADJ tokens may have the following values of Gender:

Paradigm bardhëMascFem
Case=Acc|Number=Singbardhë
Case=Gen|Number=Singbardhëbardhë
Case=Nom|Number=Singbardhëbardhë
Case=Nom|Number=Plurbardha

PROPN

31 PROPN tokens (79% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (30; 97%), Definite=Def (24; 77%).

PROPN tokens may have the following values of Gender:

Paradigm VedatMascFem
Case=Gen|Number=SingVedatit
Case=Nom|Number=SingVedati
Case=Nom|Number=PlurVedat

Gender seems to be lexical feature of PROPN. 94% lemmas (16) occur only with one value of Gender.

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: ADJ –[det:adj]–> DET (109; 91%), NOUN –[amod]–> ADJ (107; 95%), NOUN –[det]–> PRON (26; 51%), NOUN –[det:poss]–> PRON (24; 71%), NOUN –[nmod:poss]–> NOUN (24; 51%), NOUN –[conj]–> NOUN (16; 55%), ADJ –[det]–> DET (14; 88%), PRON –[det]–> DET (11; 52%), ADJ –[obl]–> NOUN (7; 64%), PRON –[det:pron]–> DET (7; 78%).