home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Lithuanian-ALKSNIS: Features: Gender

This feature is universal. It occurs with 4 different values: Com, Fem, Masc, Neut.

34426 tokens (49%) have a non-empty value of Gender. 14372 types (80%) occur at least once with a non-empty value of Gender. 6232 lemmas (72%) occur at least once with a non-empty value of Gender. The feature is used with 9 part-of-speech tags: NOUN (21127; 30% instances), ADJ (4648; 7% instances), VERB (3471; 5% instances), DET (1780; 3% instances), PROPN (1574; 2% instances), PRON (1476; 2% instances), NUM (338; 0% instances), AUX (10; 0% instances), X (2; 0% instances).

NOUN

21127 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (13634; 65%).

NOUN tokens may have the following values of Gender:

Paradigm darželisMascFem
Case=Acc|Number=Plurdarželius
Case=Dat|Number=Plurdarželiams
Case=Gen|Number=Singdarželiodarželio
Case=Gen|Number=Plurdarželių

Gender seems to be lexical feature of NOUN. 100% lemmas (3399) occur only with one value of Gender.

ADJ

4648 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (4360; 94%), Definite=Ind (4300; 93%), Number=Sing (2627; 57%).

ADJ tokens may have the following values of Gender:

Paradigm svarbusMascFemNeut
Case=Acc|Degree=Pos|Number=Singsvarbųsvarbią
Case=Acc|Degree=Pos|Number=Plursvarbiussvarbias
Case=Acc|Degree=Sup|Number=Plursvarbiausiussvarbiausias
Case=Dat|Degree=Sup|Number=Plursvarbiausioms
Case=Gen|Degree=Pos|Number=Singsvarbaus
Case=Gen|Degree=Pos|Number=Plursvarbių
Case=Gen|Degree=Sup|Number=Plursvarbiausiųsvarbiausių
Case=Ins|Degree=Pos|Number=Plursvarbiais
Case=Ins|Degree=Cmp|Number=Plursvarbesniais
Case=Ins|Degree=Sup|Number=Singsvarbiausiu
Case=Ins|Degree=Sup|Number=Plursvarbiausiomis
Case=Nom|Degree=Pos|Number=Singsvarbussvarbi
Case=Nom|Degree=Pos|Number=Plursvarbūssvarbios
Case=Nom|Degree=Cmp|Number=Singsvarbesnis
Case=Nom|Degree=Cmp|Number=Plursvarbesni
Case=Nom|Degree=Sup|Number=Singsvarbiausiassvarbiausia
Case=Nom|Degree=Sup|Number=Plursvarbiausisvarbiausios
Degree=Possvarbu
Degree=Supsvarbiausia

VERB

3471 VERB tokens (34% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=EMPTY (3471; 100%), Mood=EMPTY (3445; 99%), Polarity=Pos (3273; 94%), VerbForm=Part (3266; 94%), Reflex=EMPTY (3202; 92%), Definite=Ind (3137; 90%), Aspect=EMPTY (2953; 85%), Voice=Pass (2237; 64%).

VERB tokens may have the following values of Gender:

Paradigm galėtiMascFemNeut
Aspect=Perf|Case=Nom|Number=Sing|Tense=Past|Voice=Actgalėjęs
Case=Acc|Number=Plur|Tense=Pres|Voice=Actgalinčias
Case=Acc|Number=Plur|Tense=Pres|Voice=Passgalimus
Case=Gen|Number=Plur|Tense=Pres|Voice=Actgalinčių
Case=Ins|Number=Sing|Tense=Pres|Voice=Passgalima
Case=Nom|Number=Sing|Tense=Pres|Voice=Passgalimas
Case=Nom|Number=Plur|Tense=Pres|Voice=Passgalimi
Tense=Pres|Voice=Passgalima

DET

1780 DET tokens (100% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Definite=Ind (1773; 100%), Number=Sing (995; 56%), PronType=Dem (973; 55%).

DET tokens may have the following values of Gender:

Paradigm tasMascFemNeut
Case=Acc|Definite=Ind|Hyph=Yes|Number=Sing
Case=Acc|Definite=Ind|Hyph=Yes|Number=Plurtuos
Case=Acc|Definite=Ind|Number=Sing
Case=Acc|Definite=Ind|Number=Plurtuostas
Case=Dat|Definite=Ind|Hyph=Yes|Number=Plurtiems
Case=Dat|Definite=Ind|Number=Singtam, tuo
Case=Dat|Definite=Ind|Number=Plurtiems
Case=Gen|Definite=Ind|Hyph=Yes|Number=Singto
Case=Gen|Definite=Ind|Number=Singtotos
Case=Gen|Definite=Ind|Number=Plur
Case=Ins|Definite=Ind|Hyph=Yes|Number=Singtuota
Case=Ins|Definite=Ind|Hyph=Yes|Number=Plurtomis
Case=Ins|Definite=Ind|Number=Singtuota
Case=Ins|Definite=Ind|Number=Plurtaistomis
Case=Loc|Definite=Ind|Hyph=Yes|Number=Singtoje
Case=Loc|Definite=Ind|Number=Singtametoje
Case=Loc|Definite=Ind|Number=Plurtose
Case=Nom|Definite=Def|Number=SingTasaitoji
Case=Nom|Definite=Ind|Hyph=Yes|Number=Singtasta
Case=Nom|Definite=Ind|Hyph=Yes|Number=Plurtietos
Case=Nom|Definite=Ind|Number=Singtasta
Case=Nom|Definite=Ind|Number=Plurtietos
Definite=Deftatai
Definite=Indtai

PROPN

1574 PROPN tokens (99% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (1540; 98%).

PROPN tokens may have the following values of Gender:

Paradigm KlaipėdaMascFem
Case=LocKlaipėdoje
Case=NomKlaipėda

Gender seems to be lexical feature of PROPN. 100% lemmas (545) occur only with one value of Gender.

PRON

1476 PRON tokens (61% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Definite=Ind (1470; 100%), PronType=Prs (905; 61%), Person=3 (898; 61%).

PRON tokens may have the following values of Gender:

Paradigm vienasMascFemNeut
Case=Acc|Number=Singvienąvieną
Case=Acc|Number=Plurvienus
Case=Dat|Number=Singvienam
Case=Dat|Number=Plurvieniems
Case=Gen|Number=Singvienovienos
Case=Gen|Number=Plurvienų
Case=Ins|Hyph=Yes|Number=Singvienu
Case=Ins|Number=Singvienu
Case=Loc|Number=Singvienamevienoje
Case=Loc|Number=PlurVienose
Case=Nom|Hyph=Yes|Number=Singvienas
Case=Nom|Hyph=Yes|Number=Plurvieni
Case=Nom|Number=Singvienasviena
Case=Nom|Number=Plurvieni
viena

NUM

338 NUM tokens (20% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (319; 94%), Definite=EMPTY (190; 56%), NumType=Card (190; 56%).

NUM tokens may have the following values of Gender:

Paradigm pirmasMascFemNeut
Case=Acc|Definite=Def|Number=Sing|NumForm=Wordpirmąjįpirmąją
Case=Acc|Definite=Ind|Hyph=Yes|Number=Sing|NumForm=Wordpirmą
Case=Acc|Definite=Ind|Number=Sing|NumForm=Wordpirmą
Case=Dat|Definite=Def|Number=Sing|NumForm=WordPirmajai
Case=Gen|Definite=Def|Number=Sing|NumForm=Wordpirmojo
Case=Gen|Definite=Def|Number=Plur|NumForm=Wordpirmųjų
Case=Gen|Definite=Ind|Number=Sing|NumForm=Wordpirmopirmos
Case=Gen|Definite=Ind|Number=Plur|NumForm=Wordpirmo, pirmųjų
Case=Ins|Definite=Def|Number=Plur|NumForm=Wordpirmaisiais
Case=Loc|Definite=Def|Number=Sing|NumForm=Wordpirmajamepirmojoje
Case=Loc|Definite=Def|Number=Plur|NumForm=Wordpirmuosiuose
Case=Loc|Definite=Ind|Number=Sing|NumForm=Wordpirmame
Case=Loc|Definite=Ind|Number=Plur|NumForm=WordPirmuose
Case=Nom|Definite=Def|Number=Sing|NumForm=Wordpirmasispirmoji
Case=Nom|Definite=Def|Number=Singpirmoji
Case=Nom|Definite=Def|Number=Plur|NumForm=Wordpirmiejipirmosios
Case=Nom|Definite=Ind|Number=Sing|NumForm=Wordpirmaspirmoji
Case=Nom|Definite=Ind|Number=Plur|NumForm=Wordpirmi
Definite=Ind|NumForm=Wordpirma

AUX

10 AUX tokens (1% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (10; 100%), Person=EMPTY (10; 100%), Polarity=Pos (10; 100%), VerbForm=Part (9; 90%), Number=Sing (8; 80%), Aspect=EMPTY (7; 70%), Tense=Pres (6; 60%).

AUX tokens may have the following values of Gender:

Paradigm būtiMascFemNeut
Aspect=Perf|Case=Nom|Definite=Ind|Number=Sing|Tense=Past|VerbForm=Part|Voice=Actbuvęsbuvusi
Aspect=Perf|Definite=Ind|Tense=Past|VerbForm=Part|Voice=Actbuvę
Case=Nom|Definite=Ind|Number=Sing|Tense=Pres|VerbForm=Part|Voice=Actesąs
Case=Nom|Definite=Ind|Number=Plur|Tense=Pres|VerbForm=Part|Voice=Actesą
Number=Sing|VerbForm=Convbūdama

X

2 X tokens (0% of all X tokens) have a non-empty value of Gender.

The most frequent other feature values with which X and Gender co-occurred: Abbr=EMPTY (2; 100%), Hyph=EMPTY (2; 100%).

X tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (3658; 99%), NOUN –[acl]–> VERB (1532; 84%), NOUN –[conj]–> NOUN (1259; 59%), NOUN –[det]–> DET (896; 100%), NOUN –[obl:arg]–> NOUN (651; 52%), NOUN –[nmod]–> PRON (499; 56%), NOUN –[nmod]–> PROPN (488; 67%), VERB –[nsubj:pass]–> NOUN (413; 98%), ADJ –[conj]–> ADJ (278; 96%), NOUN –[obl]–> NOUN (221; 53%).