Treebank Statistics: UD_Albanian-STAF: Features: Definite
This feature is universal.
It occurs with 2 different values: Def
, Ind
.
678 tokens (19%) have a non-empty value of Definite
.
504 types (41%) occur at least once with a non-empty value of Definite
.
401 lemmas (41%) occur at least once with a non-empty value of Definite
.
The feature is used with 4 part-of-speech tags: NOUN (587; 16% instances), DET (45; 1% instances), PROPN (30; 1% instances), PRON (16; 0% instances).
NOUN
587 NOUN tokens (94% of all NOUN
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which NOUN
and Definite
co-occurred: Number=Sing (480; 82%), Gender=Fem (350; 60%).
NOUN
tokens may have the following values of Definite
:
Def
(349; 59% of non-emptyDefinite
): gjenerali, sytë, Nëna, gjendjes, prifti, shtëpia, babai, dorën, kohën, mendjenInd
(238; 41% of non-emptyDefinite
): ditë, shi, fillim, arsye, fund, gjë, grua, herë, krahasim, mendEMPTY
(38): gjak, lindur, Mysafiri, Zana, atë, babait, brejtja, dajë, djalë, dëshirës
Paradigm njeri | Ind | Def |
---|---|---|
Case=Acc|Number=Plur | njerëzit | |
Case=Dat|Number=Sing | njeriu | |
Case=Dat|Number=Plur | njerëzve | |
Case=Gen|Number=Plur | njerëzve | |
Case=Nom|Number=Sing | njeri, njeriu | |
Case=Nom|Number=Plur | njerëz | njerëzit |
DET
45 DET tokens (15% of all DET
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which DET
and Definite
co-occurred: Case=EMPTY (45; 100%), Gender=EMPTY (45; 100%), Number=EMPTY (45; 100%), PronType=EMPTY (45; 100%).
DET
tokens may have the following values of Definite
:
Ind
(45; 100% of non-emptyDefinite
): një, NjaEMPTY
(255): e, të, i, së, një, nja, pak
PROPN
30 PROPN tokens (77% of all PROPN
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which PROPN
and Definite
co-occurred: Number=Sing (29; 97%), Gender=Masc (20; 67%).
PROPN
tokens may have the following values of Definite
:
Def
(24; 80% of non-emptyDefinite
): Ernesti, Ernestit, Linda, Vedati, Dizit, Ervehenë, Hadi, Hadin, Lorin, MargaInd
(6; 20% of non-emptyDefinite
): Shqipëri, Berti, Ernest, VajazanEMPTY
(9): Bamit, Dizi, Dizin, Ernesti, Lindën, Nerminja, Odise, Varrit, shtunë
Paradigm Ernest | Ind | Def |
---|---|---|
Case=Dat | Ernestit | |
Case=Gen | Ernestit | |
Case=Nom | Ernest | Ernesti |
PRON
16 PRON tokens (4% of all PRON
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which PRON
and Definite
co-occurred: Person=EMPTY (16; 100%), Gender=Masc (13; 81%), Number=Sing (11; 69%), PronType=EMPTY (9; 56%).
PRON
tokens may have the following values of Definite
:
Def
(10; 63% of non-emptyDefinite
): tjerë, Ç’, ka, mi, njena, sajin, tjerash, tjerëveInd
(6; 38% of non-emptyDefinite
): tjetër, më, AsnjeriEMPTY
(414): e, i, më, që, unë, ai, kjo, tij, ky, ajo
Paradigm tjetër | Ind | Def |
---|---|---|
Case=Acc|Gender=Masc|Number=Sing | tjetër | |
Case=Acc|Gender=Masc|Number=Plur | tjerëve | |
Case=Acc|Gender=Fem|Number=Sing | tjetër | |
Case=Nom|Gender=Masc|Number=Plur | tjerë |
Relations with Agreement in Definite
The 10 most frequent relations where parent and child node agree in Definite
:
NOUN –[nmod:poss]–> NOUN (30; 67%),
NOUN –[nmod]–> NOUN (29; 59%),
NOUN –[conj]–> NOUN (22; 76%),
NOUN –[nmod:poss]–> PROPN (4; 80%),
NOUN –[nsubj]–> NOUN (2; 67%),
NOUN –[amod]–> PRON (1; 100%),
PROPN –[nmod:poss]–> NOUN (1; 100%).