home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Bhojpuri-BHTB: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

3957 tokens (59%) have a non-empty value of Gender. 1340 types (80%) occur at least once with a non-empty value of Gender. 1291 lemmas (79%) occur at least once with a non-empty value of Gender. The feature is used with 14 part-of-speech tags: NOUN (1621; 24% instances), ADP (571; 9% instances), VERB (513; 8% instances), PROPN (347; 5% instances), AUX (205; 3% instances), DET (187; 3% instances), PRON (172; 3% instances), ADJ (114; 2% instances), PART (96; 1% instances), NUM (90; 1% instances), CCONJ (32; 0% instances), ADV (4; 0% instances), INTJ (4; 0% instances), SCONJ (1; 0% instances).

NOUN

1621 NOUN tokens (87% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Person=3 (1559; 96%), Number=Sing (1502; 93%), Case=Nom (940; 58%).

NOUN tokens may have the following values of Gender:

Paradigm बिआहMascFem
Case=Acc|Number=Singबिआहबिआह
Case=Nom|Number=Singबिआहबिआह
Case=Nom|Number=Plurबिआह

Gender seems to be lexical feature of NOUN. 93% lemmas (722) occur only with one value of Gender.

ADP

571 ADP tokens (58% of all ADP tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADP and Gender co-occurred: AdpType=Post (555; 97%), Number=Sing (474; 83%), Case=Acc (346; 61%).

ADP tokens may have the following values of Gender:

Paradigm काMascFem
Case=Acc|Number=Singके
Case=Acc|Number=Plurके
Case=Nom|Number=Singका, केके
Case=Nom|Number=Sing|Person=3|Polite=Formके
Case=Nom|Number=Plurके
Number=Plur|Person=3के

Gender seems to be lexical feature of ADP. 94% lemmas (33) occur only with one value of Gender.

VERB

513 VERB tokens (67% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Number=Sing (399; 78%), Person=3 (371; 72%), Voice=EMPTY (335; 65%), Aspect=EMPTY (321; 63%), VerbForm=EMPTY (310; 60%).

VERB tokens may have the following values of Gender:

Paradigm होMascFem
Aspect=Perf|Number=Sing|VerbForm=Partहोखी
Aspect=Perf|Number=Sing|VerbForm=Part|Voice=Actहोखी
Aspect=Perf|Number=Plur|VerbForm=Partहोई
Aspect=Perf|Number=Plur|VerbForm=Part|Voice=Actहोखे
Case=Acc|Number=Singहोखी
Case=Acc|Number=Sing|Person=3होखे
Mood=Ind|Number=Sing|Person=3|Tense=Fut|VerbForm=Fin|Voice=Actहोई
Number=Sing|Person=3|VerbForm=Inf|Voice=Actहोखे
Number=Sing|Person=3|Voice=Actहो
Number=Sing|Voice=Actहो
Number=Plur|Person=3|Voice=Actहो

Gender seems to be lexical feature of VERB. 91% lemmas (195) occur only with one value of Gender.

PROPN

347 PROPN tokens (82% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (345; 99%), Person=3 (344; 99%), Case=Nom (235; 68%).

PROPN tokens may have the following values of Gender:

Paradigm भोजपुरीMascFem
Case=Accभोजपुरीभोजपुरी
Case=Nomभोजपुरी

Gender seems to be lexical feature of PROPN. 96% lemmas (155) occur only with one value of Gender.

AUX

205 AUX tokens (58% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Number=Sing (192; 94%), Voice=EMPTY (174; 85%), Polite=EMPTY (167; 81%), Person=3 (150; 73%), Aspect=EMPTY (123; 60%), VerbForm=EMPTY (112; 55%), Case=Nom (106; 52%).

AUX tokens may have the following values of Gender:

Paradigm बाMascFem
Aspect=Perf|Number=Plur|VerbForm=Partबाड़े
Case=Nom|Number=Sing|Person=3बा, बाड़न, बाड़बा, बाड़ी

DET

187 DET tokens (53% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: NumType=EMPTY (187; 100%), PronType=EMPTY (181; 97%), Person=3 (180; 96%), Number=Sing (171; 91%), Case=Nom (163; 87%).

DET tokens may have the following values of Gender:

Paradigm MascFem

Gender seems to be lexical feature of DET. 93% lemmas (41) occur only with one value of Gender.

PRON

172 PRON tokens (51% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (161; 94%), Aspect=EMPTY (160; 93%), VerbForm=EMPTY (160; 93%), Case=Nom (143; 83%), PronType=EMPTY (129; 75%), Person=3 (121; 70%).

PRON tokens may have the following values of Gender:

Paradigm हमरMascFem
Case=Acc,Gen|Person=1|Poss=Yes|PronType=Prsहमरा
Case=Acc|Number=Sing|Person=3हमरा
Case=Nom|Number=Sing|Person=3हमरा

ADJ

114 ADJ tokens (46% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (107; 94%), Person=3 (97; 85%), Case=Nom (96; 84%).

ADJ tokens may have the following values of Gender:

Paradigm बेहूदाMascFem
Case=Accबेहूदा
Case=Nom|Person=3बेहूदा

Gender seems to be lexical feature of ADJ. 99% lemmas (70) occur only with one value of Gender.

PART

96 PART tokens (50% of all PART tokens) have a non-empty value of Gender.

The most frequent other feature values with which PART and Gender co-occurred: Person=3 (91; 95%), Case=Nom (87; 91%), Number=Sing (80; 83%).

PART tokens may have the following values of Gender:

Paradigm MascFem
Number=Sing
Number=Plur

NUM

90 NUM tokens (60% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=EMPTY (90; 100%), Case=Nom (85; 94%), Person=3 (85; 94%), Number=Sing (70; 78%).

NUM tokens may have the following values of Gender:

Paradigm एगोMascFem
एगोएगो

Gender seems to be lexical feature of NUM. 93% lemmas (25) occur only with one value of Gender.

CCONJ

32 CCONJ tokens (21% of all CCONJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which CCONJ and Gender co-occurred: Case=Nom (30; 94%), Number=Sing (30; 94%), Person=3 (30; 94%).

CCONJ tokens may have the following values of Gender:

ADV

4 ADV tokens (13% of all ADV tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADV and Gender co-occurred: Number=Sing (3; 75%).

ADV tokens may have the following values of Gender:

INTJ

4 INTJ tokens (80% of all INTJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which INTJ and Gender co-occurred: Case=Acc (4; 100%), Number=Sing (4; 100%).

INTJ tokens may have the following values of Gender:

SCONJ

1 SCONJ tokens (1% of all SCONJ tokens) have a non-empty value of Gender.

SCONJ tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: VERB –[compound]–> NOUN (173; 51%), NOUN –[nmod]–> NOUN (164; 56%), NOUN –[compound]–> NOUN (156; 64%), PROPN –[compound]–> PROPN (78; 54%), NOUN –[compound]–> DET (54; 57%), NOUN –[compound]–> NUM (35; 61%), NOUN –[compound]–> ADJ (33; 62%), NOUN –[nmod]–> VERB (23; 72%), NOUN –[obl]–> NOUN (19; 66%), NOUN –[conj]–> NOUN (17; 65%).