Treebank Statistics: UD_Hausa-NorthernAutogramm: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
1220 tokens (29%) have a non-empty value of Gender
.
338 types (41%) occur at least once with a non-empty value of Gender
.
223 lemmas (40%) occur at least once with a non-empty value of Gender
.
The feature is used with 8 part-of-speech tags: AUX (421; 10% instances), NOUN (420; 10% instances), PRON (196; 5% instances), VERB (137; 3% instances), ADJ (19; 0% instances), DET (18; 0% instances), PROPN (5; 0% instances), PART (4; 0% instances).
AUX
421 AUX tokens (61% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Number=EMPTY (421; 100%), Person=3 (334; 79%), Aspect=Perf (294; 70%).
AUX
tokens may have the following values of Gender
:
Fem
(122; 29% of non-emptyGender
): tanàː, tac, tà, taː, takè, tay, tas, ta’, tab, takMasc
(299; 71% of non-emptyGender
): yac, shinàː, yaː, kaː, shì, yaz, kà, yat, yay, yahEMPTY
(269): ankà, sunkà, à, ìn, anàː, sunàː, naː, akà, sun, inàː
NOUN
420 NOUN tokens (80% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=EMPTY (419; 100%), Definite=EMPTY (234; 56%).
NOUN
tokens may have the following values of Gender
:
Fem
(110; 26% of non-emptyGender
): dàːmisàː, kuːraː, gàyyaː, duːniyàː, yâː, raːnaː, shìgat, kwaːnaː, raːnak, rânMasc
(310; 74% of non-emptyGender
): kàreː, gidaː, sâː, yaːɗaː, zàkaràː, ɓiki, mùtun, àbin, mùzuːruː, sarkinEMPTY
(106): ruwaː, ƴan, mutàːneː, ayaː, baːyuː, cinàn, giːwàːyeː, ruwan, zàːrùmmai, kuɗɗiː
Paradigm ɗaː | Masc | Fem |
---|---|---|
_ | ɗaː | |
Definite=Cons | ɗan | ƴa' |
Definite=Def | ɗan |
Gender
seems to be lexical feature of NOUN
. 99% lemmas (125) occur only with one value of Gender
.
PRON
196 PRON tokens (70% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Number=EMPTY (195; 99%), PronType=Prs (175; 89%), Person=3 (134; 68%), Case=EMPTY (114; 58%).
PRON
tokens may have the following values of Gender
:
Fem
(64; 33% of non-emptyGender
): ita, =tà, wàccan, tà, =ta, matà, ta, =kì, wàgga, keːMasc
(132; 67% of non-emptyGender
): shiː, =tai, shì, shi, =nai, =kà, mai, =ka, kai, kàEMPTY
(86): =sù, niː, suː, musù, sù, koːmiː, min, =na, =su, ni
Paradigm wani | Masc | Fem |
---|---|---|
Definite=Spec | wani | |
wata |
Gender
seems to be lexical feature of PRON
. 97% lemmas (37) occur only with one value of Gender
.
VERB
137 VERB tokens (19% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: ExtPos=NOUN (131; 96%), VerbForm=Vnoun (131; 96%).
VERB
tokens may have the following values of Gender
:
Fem
(77; 56% of non-emptyGender
): tàhiyàː, zakkùwaː, bìyash, gàmuwaː, gàyyaː., hwaːɗùwaː, kankaryaː, sarɓaː, bìɗaː, cêːwaːMasc
(60; 44% of non-emptyGender
): yîː, sôn, cîn, kwaːnaː, gudùː, sauraːreː, taːshìː, hwaɗìː, yîn, zamanEMPTY
(592): cèː, yi, cêː, zakà, zoː, ga, tàhi, ji, taɓà, bìye
Paradigm ci | Masc | Fem |
---|---|---|
Definite=Cons | cîn | |
cîː | cânyeːwàː |
Gender
seems to be lexical feature of VERB
. 98% lemmas (40) occur only with one value of Gender
.
ADJ
19 ADJ tokens (70% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=EMPTY (19; 100%), Definite=Cons (12; 63%).
ADJ
tokens may have the following values of Gender
:
Fem
(3; 16% of non-emptyGender
): wacèː, ƴag, ƴakMasc
(16; 84% of non-emptyGender
): ɗan, baƙiː, hwarin, hwariː, namijì, ƙàramiː, baƙin, hìyayyem, janEMPTY
(8): jaː, maːtaː, jan, kwànce-kwancèn
Paradigm ɗaː | Masc | Fem |
---|---|---|
ɗan | ƴag, ƴak |
DET
18 DET tokens (20% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Deixis=EMPTY (17; 94%), Definite=EMPTY (12; 67%), PronType=Ind (10; 56%).
DET
tokens may have the following values of Gender
:
Fem
(11; 61% of non-emptyGender
): wata, wàccân, tan, waccèMasc
(7; 39% of non-emptyGender
): wani, wanèːEMPTY
(70): nan, ga, wasu, du’, dug, su, dus, dut, duy, can
Paradigm wani | Masc | Fem |
---|---|---|
Definite=Spec | wani | |
wata |
PROPN
5 PROPN tokens (5% of all PROPN
tokens) have a non-empty value of Gender
.
PROPN
tokens may have the following values of Gender
:
Masc
(5; 100% of non-emptyGender
): Buːzuː, Bàhaushèː, BàgawailèːEMPTY
(88): Tudùː, Galmaːwaː, Muːsà, Ìlleːlàː, Allàː, Bàːgai, Dòːdoː, Gidan, Ƙurƙìyaː, Tuːraːwaː
PART
4 PART tokens (2% of all PART
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PART
and Gender
co-occurred: Aspect=EMPTY (4; 100%), Polarity=EMPTY (4; 100%).
PART
tokens may have the following values of Gender
:
Fem
(4; 100% of non-emptyGender
): taEMPTY
(177): ta, gàː, nàː, ba, baːbù, kòː, dai, àkwai, bâː, bàː
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[compound]–> NOUN (20; 65%),
NOUN –[amod]–> ADJ (14; 82%),
VERB –[nmod]–> NOUN (13; 62%),
NOUN –[conj]–> NOUN (8; 80%),
VERB –[conj]–> VERB (6; 60%),
NOUN –[acl:relcl]–> NOUN (5; 63%),
VERB –[det]–> DET (4; 80%),
AUX –[nsubj]–> NOUN (2; 100%),
NOUN –[parataxis]–> NOUN (2; 100%),
PRON –[conj]–> NOUN (2; 67%).