Treebank Statistics: UD_Swedish-LinES: Features: Gender
This feature is universal.
It occurs with 2 different values: Com
, Neut
.
29909 tokens (33%) have a non-empty value of Gender
.
8469 types (59%) occur at least once with a non-empty value of Gender
.
5978 lemmas (59%) occur at least once with a non-empty value of Gender
.
The feature is used with 8 part-of-speech tags: NOUN (15768; 17% instances), PRON (7596; 8% instances), DET (4030; 4% instances), ADJ (2477; 3% instances), PROPN (25; 0% instances), NUM (8; 0% instances), X (3; 0% instances), VERB (2; 0% instances).
NOUN
15768 NOUN tokens (99% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Case=Nom (15286; 97%), Number=Sing (11470; 73%), Definite=Ind (10310; 65%).
NOUN
tokens may have the following values of Gender
:
Com
(11068; 70% of non-emptyGender
): far, gång, man, sidan, del, mor, väg, fråga, mr, mannenNeut
(4700; 30% of non-emptyGender
): sätt, år, fält, data, ögon, ansikte, barn, exempel, fältet, huvudetEMPTY
(205): går, Language, års, Web, närvarande, vänster, Components, Server, Engine, Stylesheet
Paradigm man | Neut | Com |
---|---|---|
Case=Gen|Definite=Def|Number=Sing | mannens | |
Case=Gen|Definite=Ind|Number=Sing | mans | |
Case=Nom|Definite=Def|Number=Sing | mannen | |
Case=Nom|Definite=Def|Number=Plur | männen | männen |
Case=Nom|Definite=Ind|Number=Sing | man | |
Case=Nom|Definite=Ind|Number=Plur | män | män, man |
Gender
seems to be lexical feature of NOUN
. 98% lemmas (4890) occur only with one value of Gender
.
PRON
7596 PRON tokens (70% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Number=Sing (7199; 95%), Poss=EMPTY (6909; 91%), Definite=Def (6717; 88%), PronType=Prs (6665; 88%).
PRON
tokens may have the following values of Gender
:
Com
(5536; 73% of non-emptyGender
): han, jag, du, vi, hon, honom, mig, den, sin, manNeut
(2060; 27% of non-emptyGender
): det, vad, detta, sitt, allt, något, ingenting, mitt, vilket, annatEMPTY
(3206): som, sig, de, hans, dem, sina, hennes, deras, alla, dom
Paradigm den | Neut | Com |
---|---|---|
det | den |
DET
4030 DET tokens (85% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number=Sing (4030; 100%), PronType=Art (3536; 88%), Definite=Ind (2822; 70%).
DET
tokens may have the following values of Gender
:
Com
(2731; 68% of non-emptyGender
): en, den, någon, denna, ingen, all, varenda, vilken, denne, nånNeut
(1299; 32% of non-emptyGender
): ett, det, något, detta, inget, allt, vilket, nåt, intet, vartendaEMPTY
(709): de, alla, några, varje, dessa, inga, båda, vilka, dom, the
Paradigm en | Neut | Com |
---|---|---|
ett | en |
ADJ
2477 ADJ tokens (39% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Case=Nom (2477; 100%), Definite=Ind (2476; 100%), Number=Sing (2475; 100%), Degree=Pos (2470; 100%), Tense=EMPTY (2320; 94%), VerbForm=EMPTY (2319; 94%).
ADJ
tokens may have the following values of Gender
:
Com
(1652; 67% of non-emptyGender
): själv, stor, egen, annan, liten, vit, gammal, lång, sådan, nyNeut
(825; 33% of non-emptyGender
): annat, stort, nytt, eget, litet, möjligt, svårt, rött, taget, klartEMPTY
(3828): andra, hela, samma, första, flera, många, enda, nya, vita, egna
Paradigm annan | Neut | Com |
---|---|---|
annat | annan |
PROPN
25 PROPN tokens (1% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Case=Nom (23; 92%), Number=Sing (23; 92%).
PROPN
tokens may have the following values of Gender
:
Com
(23; 92% of non-emptyGender
): Stella, Athena, Alice, Jove, Dior, Hefaistos, Lutyens, Psaltaren, Ringen, RoverNeut
(2; 8% of non-emptyGender
): Cunards, JungEMPTY
(2835): Harry, Quinn, Stillman, Bray, Auster, Access, Microsoft, Weasley, Ron, Dobby
Gender
seems to be lexical feature of PROPN
. 100% lemmas (15) occur only with one value of Gender
.
NUM
8 NUM tokens (2% of all NUM
tokens) have a non-empty value of Gender
.
NUM
tokens may have the following values of Gender
:
Com
(4; 50% of non-emptyGender
): enNeut
(4; 50% of non-emptyGender
): ettEMPTY
(497): två, tre, en, fem, sex, fyra, tio, 1, 2, 2000
Paradigm en | Neut | Com |
---|---|---|
Definite=Ind | ett | en |
NumType=Card | Ett |
X
3 X tokens (11% of all X
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which X
and Gender
co-occurred: Case=Nom (3; 100%), Definite=Ind (3; 100%), Number=Sing (3; 100%).
X
tokens may have the following values of Gender
:
Neut
(3; 100% of non-emptyGender
): alium, coniunctis, internumEMPTY
(25): W3C, foie, TSQL, maris, stella, .adp, .lpk, .mdb, .odc, .udl
VERB
2 VERB tokens (0% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (2; 100%), Tense=EMPTY (2; 100%), VerbForm=EMPTY (2; 100%), Voice=EMPTY (2; 100%).
VERB
tokens may have the following values of Gender
:
Com
(1; 50% of non-emptyGender
): krigsmåladNeut
(1; 50% of non-emptyGender
): sittEMPTY
(11353): sa, var, hade, gick, kom, har, såg, ta, göra, sade
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (3715; 85%),
NOUN –[nmod]–> NOUN (1300; 59%),
NOUN –[conj]–> NOUN (741; 63%),
NOUN –[nmod:poss]–> PRON (605; 51%),
NOUN –[nmod:poss]–> NOUN (238; 54%),
ADJ –[nsubj]–> PRON (183; 71%),
ADJ –[conj]–> ADJ (132; 74%),
NOUN –[appos]–> NOUN (80; 70%),
NOUN –[nsubj]–> NOUN (79; 71%),
ADJ –[expl]–> PRON (74; 85%).