Treebank Statistics: UD_Pashto-Sikaram: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
464 tokens (47%) have a non-empty value of Gender
.
322 types (73%) occur at least once with a non-empty value of Gender
.
270 lemmas (74%) occur at least once with a non-empty value of Gender
.
The feature is used with 8 part-of-speech tags: NOUN (232; 23% instances), ADJ (101; 10% instances), VERB (46; 5% instances), PROPN (28; 3% instances), DET (24; 2% instances), AUX (20; 2% instances), PRON (7; 1% instances), NUM (6; 1% instances).
NOUN
232 NOUN tokens (100% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (177; 76%), Case=Nom (117; 50%).
NOUN
tokens may have the following values of Gender
:
Fem
(101; 44% of non-emptyGender
): ژبه, ژبې, ژباړې, سیمې, نړۍ, اترو, برخې, خبرو, خبرې, خواMasc
(131; 56% of non-emptyGender
): ارزښت, اثر, دود, وخت, شمېر, انسانانو, خلکو, زر, شرق, غرب
Gender
seems to be lexical feature of NOUN
. 100% lemmas (145) occur only with one value of Gender
.
ADJ
101 ADJ tokens (100% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=Sing (75; 74%), Case=Nom (67; 66%).
ADJ
tokens may have the following values of Gender
:
Fem
(36; 36% of non-emptyGender
): لیکنۍ, نړيواله, ژوندۍ, ښه, ادبي, اصلي, اغېزناکه, اوسنۍ, اوچته, ايرانۍMasc
(65; 64% of non-emptyGender
): بېل, جوړ, راټول, شهکار, فرهنګي, لږ, نور, هنري, ټولنیز, ګڼ
Paradigm نړيوال | Masc | Fem |
---|---|---|
Case=Abl|Number=Plur | نړيوالو | |
Case=Acc|Number=Plur | نړيوالو | |
Case=Loc|Number=Sing | نړيواله | |
Case=Nom|Number=Sing | نړيواله |
Gender
seems to be lexical feature of ADJ
. 91% lemmas (67) occur only with one value of Gender
.
VERB
46 VERB tokens (51% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Tense=Past (45; 98%), Number=Sing (33; 72%), Aspect=Perf (31; 67%), Case=EMPTY (31; 67%), Mood=Ind (31; 67%), Person=3 (31; 67%), VerbForm=Fin (31; 67%).
VERB
tokens may have the following values of Gender
:
Fem
(15; 33% of non-emptyGender
): شوې, کړه, درلوده, رسولې, روزونکې, شوه, وهله, وهلې, وکړه, وګټلهMasc
(31; 67% of non-emptyGender
): شو, کړل, کښل, راتلی, راوغزول, راوژباړه, رسېدلی, شوى, شوي, لوستىEMPTY
(44): لري, وي, کوي, ګڼل, اواروي, برېښي, خپروي, شته, شو, شي
Paradigm کول | Masc | Fem |
---|---|---|
Aspect=Imp|Mood=Ind|Number=Sing|Person=3|VerbForm=Fin | کاوه | |
Aspect=Perf|Case=Nom|Number=Sing|VerbForm=Part | کړى | کړې |
Aspect=Perf|Case=Nom|Number=Plur|VerbForm=Part | کړي | |
Aspect=Perf|Mood=Ind|Number=Sing|Person=3|Variant=Long|VerbForm=Fin | وکړ | وکړه |
Aspect=Perf|Mood=Ind|Number=Sing|Person=3|VerbForm=Fin | کړ | کړه |
Aspect=Perf|Mood=Ind|Number=Plur|Person=3|VerbForm=Fin | کړل |
Gender
seems to be lexical feature of VERB
. 91% lemmas (20) occur only with one value of Gender
.
PROPN
28 PROPN tokens (100% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Number=Sing (27; 96%).
PROPN
tokens may have the following values of Gender
:
Fem
(11; 39% of non-emptyGender
): اردو, مریم, امريکا, انګرېزۍ, براون, جانې, فرانسېMasc
(17; 61% of non-emptyGender
): پیتر, ټیګور, افغان, ايرانیانو, ایګوازو, حبیبي, سامه, سمیس, طلوع, نوبل
Gender
seems to be lexical feature of PROPN
. 100% lemmas (19) occur only with one value of Gender
.
DET
24 DET tokens (47% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Deixis=EMPTY (23; 96%), Number=Sing (13; 54%), Poss=EMPTY (13; 54%), Reflex=EMPTY (13; 54%).
DET
tokens may have the following values of Gender
:
Fem
(10; 42% of non-emptyGender
): هرې, هره, خپله, دې, ټوله, ټولېMasc
(14; 58% of non-emptyGender
): خپل, هر, کوم, خپلوEMPTY
(27): هغه, دغه, هماغه, داسې, ستا, څو, دا, دې, زما, هماغسې
Paradigm خپل | Masc | Fem |
---|---|---|
Case=Acc|Number=Sing | خپل | |
Case=Acc|Number=Plur | خپلو | |
Case=Loc|Number=Sing | خپله | |
Case=Nom|Number=Sing | خپل | |
Case=Nom|Number=Plur | خپل |
AUX
20 AUX tokens (43% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Number=Sing (19; 95%), Mood=Ind (17; 85%), Person=3 (17; 85%), VerbForm=Fin (17; 85%), Aspect=EMPTY (16; 80%), Tense=Pres (13; 65%).
AUX
tokens may have the following values of Gender
:
Fem
(7; 35% of non-emptyGender
): ده, وهMasc
(13; 65% of non-emptyGender
): دى, دی, شوى, شو, وو, کولیEMPTY
(26): به, وي, کېږي, شوای, شې, شي, وای, ونه, ونۀ
Paradigm یم | Masc | Fem |
---|---|---|
Number=Sing|Tense=Past | وه | |
Number=Sing|Tense=Pres | دى, دی | ده |
Number=Plur|Tense=Past | وو |
PRON
7 PRON tokens (12% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Poss=EMPTY (7; 100%), Number=Sing (6; 86%), Variant=EMPTY (6; 86%), Case=Acc (5; 71%), Deixis=Remt (5; 71%), Person=3 (5; 71%), PronType=Prs (5; 71%).
PRON
tokens may have the following values of Gender
:
Fem
(3; 43% of non-emptyGender
): هغې, دېMasc
(4; 57% of non-emptyGender
): هغۀ, ټولوEMPTY
(50): يې, چې, ور, څه, یې, ما, ځان, دا, دوی, هغوى
Paradigm هغه | Masc | Fem |
---|---|---|
هغۀ | هغې |
NUM
6 NUM tokens (100% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumType=Card (6; 100%), Case=Acc (4; 67%).
NUM
tokens may have the following values of Gender
:
Fem
(4; 67% of non-emptyGender
): يوې, يوهMasc
(2; 33% of non-emptyGender
): يوه, یو
Paradigm یو | Masc | Fem |
---|---|---|
Case=Acc | يوه | يوې |
Case=Nom | یو | يوه |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[amod]–> ADJ (57; 100%),
VERB –[obj]–> NOUN (24; 57%),
ADJ –[conj]–> ADJ (14; 100%),
VERB –[xcomp]–> ADJ (12; 75%),
NOUN –[conj]–> NOUN (10; 71%),
NOUN –[nmod]–> PROPN (7; 70%),
ADJ –[nsubj]–> NOUN (5; 100%),
NOUN –[fixed]–> NOUN (5; 100%),
NOUN –[nummod]–> NUM (5; 100%),
ADJ –[conj]–> NOUN (4; 100%).