Treebank Statistics: UD_Macedonian-MTB: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem
, Masc
, Neut
.
This is a layered feature with the following layers: Gender, Gender[psor].
339 tokens (25%) have a non-empty value of Gender
.
234 types (43%) occur at least once with a non-empty value of Gender
.
210 lemmas (44%) occur at least once with a non-empty value of Gender
.
The feature is used with 8 part-of-speech tags: NOUN (177; 13% instances), PRON (68; 5% instances), ADJ (35; 3% instances), PROPN (26; 2% instances), DET (15; 1% instances), VERB (14; 1% instances), NUM (3; 0% instances), AUX (1; 0% instances).
NOUN
177 NOUN tokens (99% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (151; 85%), Definite=Ind (112; 63%).
NOUN
tokens may have the following values of Gender
:
Fem
(67; 38% of non-emptyGender
): јакна, година, авантури, бронза, книгата, колата, пари, снимка, собата, тортаMasc
(76; 43% of non-emptyGender
): Натпреварот, крајот, автомобил, дена, испитот, компјутерот, облаците, професорот, син, сладоледNeut
(34; 19% of non-emptyGender
): кино, Детето, дете, злато, место, писмо, Луѓето, Сонцето, време, времетоEMPTY
(2): Рим, Спа
Paradigm сонце | Fem | Neut |
---|---|---|
Сонцето | Сонцето |
Gender
seems to be lexical feature of NOUN
. 99% lemmas (124) occur only with one value of Gender
.
PRON
68 PRON tokens (38% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Reflex=EMPTY (68; 100%), Number=Sing (67; 99%), Definite=EMPTY (55; 81%), Person=3 (51; 75%), PronType=Prs (51; 75%).
PRON
tokens may have the following values of Gender
:
Fem
(15; 22% of non-emptyGender
): ја, Таа, сите, ѝMasc
(41; 60% of non-emptyGender
): го, му, Тој, кој, Неговата, каков, којшто, него, нему, јасNeut
(12; 18% of non-emptyGender
): тоа, го, којшто, Што, нешто, ништоEMPTY
(112): се, ми, ме, тие, ти, ги, ние, си, што, Сѐ
Paradigm го | Masc | Neut |
---|---|---|
Definite=Def|Person=3 | го | |
Person=3 | го | Го |
го |
ADJ
35 ADJ tokens (80% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=Sing (35; 100%), Degree=Pos (32; 91%), Definite=Ind (21; 60%).
ADJ
tokens may have the following values of Gender
:
Fem
(12; 34% of non-emptyGender
): голема, нова, учебната, добра, мала, мила, минатата, првата, убаваMasc
(16; 46% of non-emptyGender
): утрешниот, вознемирен, главниот, дрзок, зелениот, кинески, минатиот, незадоволен, позабавен, познатNeut
(7; 20% of non-emptyGender
): добро, корисно, одлично, прекрасно, светлото, слободно, слученоEMPTY
(9): болен, глупави, долги, играни, нови, одбрани, последниве, презадоволни, скапа
Paradigm добар | Fem | Neut |
---|---|---|
добра | добро |
PROPN
26 PROPN tokens (96% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Number=Sing (26; 100%), Definite=EMPTY (15; 58%).
PROPN
tokens may have the following values of Gender
:
Fem
(8; 31% of non-emptyGender
): Мери, Марија, Џејн, Браун, ФранцијаMasc
(17; 65% of non-emptyGender
): Петар, Јован, Марко, Бетовен, Вардар, Лудвиг, Париз, Сем, Смит, ТинексNeut
(1; 4% of non-emptyGender
): ИгуацуEMPTY
(1): ван
Gender
seems to be lexical feature of PROPN
. 100% lemmas (17) occur only with one value of Gender
.
DET
15 DET tokens (75% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number=Sing (15; 100%), Case=EMPTY (11; 73%), Person=EMPTY (10; 67%), Poss=EMPTY (9; 60%), Definite=EMPTY (8; 53%).
DET
tokens may have the following values of Gender
:
Fem
(4; 27% of non-emptyGender
): Оваа, една, некоја, нејзиниотMasc
(6; 40% of non-emptyGender
): мојот, каков, неговиот, својот, твојотNeut
(5; 33% of non-emptyGender
): она, Ова, такво, тоаEMPTY
(5): моите, Тие, некои
Paradigm ова | Fem | Neut |
---|---|---|
Case=Nom | Оваа | |
Ова |
Gender
seems to be lexical feature of DET
. 92% lemmas (11) occur only with one value of Gender
.
VERB
14 VERB tokens (5% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Number=Sing (14; 100%), Mood=EMPTY (13; 93%), Person=EMPTY (12; 86%), VerbForm=Part (12; 86%), Aspect=Perf (10; 71%), Tense=EMPTY (10; 71%).
VERB
tokens may have the following values of Gender
:
Fem
(3; 21% of non-emptyGender
): јакна, прочитанаMasc
(9; 64% of non-emptyGender
): одземен, возбуден, гледал, казнет, можел, напишал, оставил, совладанNeut
(2; 14% of non-emptyGender
): испорачано, слученоEMPTY
(262): дојде, облеков, студеше, сакам, јави, Мислам, дојдеш, воодушеви, гледав, дојди
Gender
seems to be lexical feature of VERB
. 100% lemmas (12) occur only with one value of Gender
.
NUM
3 NUM tokens (43% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: Definite=Ind (3; 100%), NumType=Card (3; 100%), Number=Plur (2; 67%).
NUM
tokens may have the following values of Gender
:
Fem
(1; 33% of non-emptyGender
): еднаMasc
(2; 67% of non-emptyGender
): дваEMPTY
(4): 15, неколку, пет, три
AUX
1 AUX tokens (2% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Aspect=Imp (1; 100%), Mood=Ind (1; 100%), Number=Sing (1; 100%), Person=EMPTY (1; 100%), Tense=EMPTY (1; 100%), VerbForm=Part (1; 100%), Voice=Act (1; 100%).
AUX
tokens may have the following values of Gender
:
Masc
(1; 100% of non-emptyGender
): билEMPTY
(61): ќе, е, беше, бев, биде, сте, Бевме, Сум, би, бидат
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[amod]–> ADJ (18; 75%),
NOUN –[det]–> DET (10; 63%),
PROPN –[flat]–> PROPN (3; 75%),
VERB –[nsubj:pass]–> NOUN (3; 100%),
ADJ –[conj]–> ADJ (2; 67%),
ADJ –[nsubj]–> NOUN (2; 100%),
ADJ –[nsubj]–> PRON (2; 67%),
ADJ –[nsubj]–> PROPN (2; 100%),
NOUN –[expl]–> PRON (2; 100%),
PROPN –[appos]–> NOUN (2; 100%).