Treebank Statistics: UD_Beja-Autogramm: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
4528 tokens (38%) have a non-empty value of Gender
.
936 types (49%) occur at least once with a non-empty value of Gender
.
1 lemmas (0) occur at least once with a non-empty value of Gender
.
The feature is used with 10 part-of-speech tags: DET (1733; 15% instances), NOUN (1396; 12% instances), VERB (1063; 9% instances), SCONJ (157; 1% instances), PRON (110; 1% instances), AUX (59; 0% instances), INTJ (5; 0% instances), ADJ (3; 0% instances), NUM (1; 0% instances), PART (1; 0% instances).
DET
1733 DET tokens (100% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Deixis=EMPTY (1454; 84%), PronType=EMPTY (1450; 84%), Definite=Def (1008; 58%), Case=EMPTY (1000; 58%), Number=EMPTY (930; 54%).
DET
tokens may have the following values of Gender
:
Fem
(697; 40% of non-emptyGender
): =t, ti=, t=, toː=, tuː=, oːt, eːt, toːt, teː=, tuːtMasc
(1036; 60% of non-emptyGender
): i=, oː=, w=, uː=, oːn, uːn, j=, =b, eːn, aː=EMPTY
(4): -a, -aː, =eː, mhasi
NOUN
1396 NOUN tokens (81% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=EMPTY (1164; 83%).
NOUN
tokens may have the following values of Gender
:
Fem
(442; 32% of non-emptyGender
): naː, lhaweː, na, takat, giɖʔa, sala, mʔari, ʃiha, ʔabaː, ʔabaMasc
(954; 68% of non-emptyGender
): tak, doːr, mhiːn, jhaːm, mijʔat, dhaj, kʷiːkʷʔaːj, gaw, haˈwaːd, ʔarEMPTY
(323): kaːm, ʔar, ʔoːr, meːk, tʔiit, =na, =naː, na, kam, ʔaraːw
VERB
1063 VERB tokens (44% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Person=EMPTY (1037; 98%), Number=Sing (939; 88%), VerbClass=1 (676; 64%).
VERB
tokens may have the following values of Gender
:
Fem
(247; 23% of non-emptyGender
): tindi, tini, tidi, tiːd, tiːfi, akai, ʔeːta, tikati, ʔeːti, ʔabkinMasc
(816; 77% of non-emptyGender
): indi, ini, jʔi, iːfi, iːbri, ikati, jʔiːni, id, isni, iːktiEMPTY
(1347): eːn, jʔeːn, jʔeːtiːt, akeːna, diːtiːt, ani, diːt, rhan, akajeː, difeː
SCONJ
157 SCONJ tokens (27% of all SCONJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which SCONJ
and Gender
co-occurred: PronType=Rel (121; 77%).
SCONJ
tokens may have the following values of Gender
:
Fem
(54; 34% of non-emptyGender
): =eːt, =jeːt, =t, ti=Masc
(103; 66% of non-emptyGender
): =eːb, =jeːb, =b, ji=EMPTY
(435): =hoːb, =ajt, =eːk, =aj, =it, =jeːk, =eː, =i, =jeː, =ji
PRON
110 PRON tokens (13% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Poss=EMPTY (103; 94%), Number=Sing (66; 60%), Person=EMPTY (57; 52%).
PRON
tokens may have the following values of Gender
:
Fem
(28; 25% of non-emptyGender
): ti=, -t, t=, ambataː, =oːki, imbateː, ombatoːMasc
(82; 75% of non-emptyGender
): wi=, baruːk, umbaruːk, i=, w=, baroːk, baruː, baraː, barhi, barijoːkEMPTY
(710): =heːb, =i, =oː, =eː, ani, =hoːk, kna, =oːk, =oːn, aneːb
AUX
59 AUX tokens (21% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Number=Sing (58; 98%), Person=EMPTY (51; 86%), VerbType=EMPTY (49; 83%).
AUX
tokens may have the following values of Gender
:
Fem
(8; 14% of non-emptyGender
): tiki, taki, tidʔi, tindi, tirib, tiːha, tkiMasc
(51; 86% of non-emptyGender
): =wa, iki, ihi, indi, hijaː, irib, iːkti, dʔijaːb, idi, iniEMPTY
(225): =u, =i, =a, andi, akajeː, aki, nijad, dannʔi, ijajna, adi
INTJ
5 INTJ tokens (8% of all INTJ
tokens) have a non-empty value of Gender
.
INTJ
tokens may have the following values of Gender
:
Masc
(5; 100% of non-emptyGender
): jhaːEMPTY
(61): iraːnaj, əəə, ahaː, mmm, iraːni, jaːbi, hawawawawa, jhaː, mmmmm, nʔalla
ADJ
3 ADJ tokens (2% of all ADJ
tokens) have a non-empty value of Gender
.
ADJ
tokens may have the following values of Gender
:
Fem
(1; 33% of non-emptyGender
): kʷaɖaːɖatMasc
(2; 67% of non-emptyGender
): nifri, sasuːbajaːbEMPTY
(146): daːji, kass, malia, sagi, daːwri, koː, dabal, daːjiː, raw, hadal
NUM
1 NUM tokens (2% of all NUM
tokens) have a non-empty value of Gender
.
NUM
tokens may have the following values of Gender
:
Fem
(1; 100% of non-emptyGender
): gaːtEMPTY
(58): mhaj, gaːl, mhall, alif, asarama, gali, mhali, gaːt, awwal, faɖig
PART
1 PART tokens (0% of all PART
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PART
and Gender
co-occurred: Aspect=EMPTY (1; 100%), Mood=EMPTY (1; 100%), Polarity=EMPTY (1; 100%).
PART
tokens may have the following values of Gender
:
Masc
(1; 100% of non-emptyGender
): ʔaʃajEMPTY
(320): bak, ka=, han, ontʔa, ki=, baː=, bi=, jaː, bass, tʔa
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (1202; 82%),
NOUN –[acl:relcl]–> SCONJ (109; 73%),
VERB –[dislocated:subj]–> NOUN (16; 62%),
AUX –[compound:svc]–> VERB (7; 100%),
SCONJ –[fixed]–> DET (6; 100%),
NOUN –[discourse]–> DET (3; 60%),
VERB –[dep:redup]–> VERB (3; 100%),
VERB –[dep]–> PRON (3; 60%),
VERB –[dislocated:subj]–> PRON (3; 75%),
PART –[det]–> DET (2; 67%).