Treebank Statistics: UD_Swedish-Talbanken: Features: Gender
This feature is universal.
It occurs with 4 different values: Com
, Fem
, Masc
, Neut
.
33466 tokens (35%) have a non-empty value of Gender
.
10175 types (67%) occur at least once with a non-empty value of Gender
.
6792 lemmas (65%) occur at least once with a non-empty value of Gender
.
The feature is used with 6 part-of-speech tags: NOUN (22562; 23% instances), PRON (4074; 4% instances), DET (3714; 4% instances), ADJ (2998; 3% instances), NUM (91; 0% instances), VERB (27; 0% instances).
NOUN
22562 NOUN tokens (98% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Case=Nom (21440; 95%), Number=Sing (15354; 68%), Definite=Ind (14734; 65%).
NOUN
tokens may have the following values of Gender
:
Com
(15734; 70% of non-emptyGender
): del, procent, människor, tid, familjen, kvinnor, man, dag, miljoner, frågaFem
(1; 0% of non-emptyGender
): nuptiamMasc
(1; 0% of non-emptyGender
): consensusNeut
(6826; 30% of non-emptyGender
): år, barn, äktenskapet, barnen, sätt, samhället, arbete, fall, äktenskap, barnetEMPTY
(435): kr, %, dr, s., kap., proc, KPI, milj, mån, kl
Paradigm äktenskap | Neut | Com |
---|---|---|
Case=Gen|Definite=Def|Number=Sing | äktenskapets | äktenskapens |
Case=Gen|Definite=Ind|Number=Sing | äktenskaps | |
Case=Gen|Definite=Ind|Number=Plur | äktenskaps | |
Case=Nom|Definite=Def|Number=Sing | äktenskapet | äktenskapen |
Case=Nom|Definite=Ind|Number=Sing | äktenskap | |
Case=Nom|Definite=Ind|Number=Plur | äktenskap |
Gender
seems to be lexical feature of NOUN
. 99% lemmas (5835) occur only with one value of Gender
.
PRON
4074 PRON tokens (61% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Poss=EMPTY (3709; 91%), Number=Sing (3656; 90%), Definite=Def (2944; 72%), PronType=Prs (2880; 71%), Case=EMPTY (2394; 59%).
PRON
tokens may have the following values of Gender
:
Com
(2341; 57% of non-emptyGender
): man, vi, den, du, sin, han, jag, oss, hon, enNeut
(1733; 43% of non-emptyGender
): det, detta, vad, sitt, något, vårt, allt, vilket, ditt, mycketEMPTY
(2599): som, de, sig, dem, sina, deras, våra, andra, många, alla
Paradigm den | Neut | Com |
---|---|---|
det | den |
DET
3714 DET tokens (76% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number=Sing (3714; 100%), PronType=Art (3244; 87%), Definite=Ind (2322; 63%).
DET
tokens may have the following values of Gender
:
Com
(2576; 69% of non-emptyGender
): en, den, denna, någon, ingen, vilken, var, all, nånNeut
(1138; 31% of non-emptyGender
): ett, det, detta, något, allt, inget, vilket, vart, vartannatEMPTY
(1163): de, alla, varje, dessa, några, vilka, båda, inga, bägge, vardera
Paradigm en | Neut | Com |
---|---|---|
en | ||
PronType=Art | ett | en |
ADJ
2998 ADJ tokens (35% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Case=Nom (2993; 100%), Degree=Pos (2991; 100%), Number=Sing (2977; 99%), Definite=Ind (2947; 98%), Tense=EMPTY (2569; 86%), VerbForm=EMPTY (2569; 86%).
ADJ
tokens may have the following values of Gender
:
Com
(1938; 65% of non-emptyGender
): stor, annan, själv, sådan, viss, egen, ny, hög, kristen, socialNeut
(1060; 35% of non-emptyGender
): annat, svårt, nytt, möjligt, sådant, viktigt, eget, socialt, stort, övrigtEMPTY
(5673): olika, andra, nya, många, stora, samma, större, första, vissa, hela
Paradigm stor | Neut | Com |
---|---|---|
Definite=Def|Degree=Sup | störste | |
Definite=Ind|Degree=Pos|Number=Sing | stort | stor |
NUM
91 NUM tokens (5% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: Case=Nom (91; 100%), NumType=Card (91; 100%).
NUM
tokens may have the following values of Gender
:
Com
(59; 65% of non-emptyGender
): enNeut
(32; 35% of non-emptyGender
): ettEMPTY
(1649): två, tre, 1, 20, 2, 1970, 3, 10, 1971, 7
Paradigm en | Neut | Com |
---|---|---|
ett | en |
VERB
27 VERB tokens (0% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (27; 100%), Voice=Pass (27; 100%), Tense=Past (20; 74%), VerbForm=Part (20; 74%).
VERB
tokens may have the following values of Gender
:
Com
(21; 78% of non-emptyGender
): vald, vänd, hörselskadad, accepterad, förstärkt, förändrad, ifylld, komplicerad, likställd, lämnadNeut
(6; 22% of non-emptyGender
): förbjudet, opåverkat, reglerat, sysselsatt, tillgodosett, upplagtEMPTY
(9843): har, finns, blir, få, får, ha, är, gäller, ger, går
Gender
seems to be lexical feature of VERB
. 100% lemmas (22) occur only with one value of Gender
.
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (3469; 77%),
NOUN –[nmod]–> NOUN (1703; 54%),
NOUN –[conj]–> NOUN (1389; 66%),
NOUN –[nmod:poss]–> NOUN (552; 60%),
NOUN –[nmod:poss]–> PRON (351; 51%),
NOUN –[appos]–> NOUN (180; 63%),
NOUN –[nsubj]–> NOUN (156; 56%),
ADJ –[conj]–> ADJ (107; 71%),
ADJ –[expl]–> PRON (105; 86%),
NOUN –[acl]–> NOUN (55; 57%).