Treebank Statistics: UD_Faroese-OFT: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem
, Masc
, Neut
.
3898 tokens (39%) have a non-empty value of Gender
.
2250 types (72%) occur at least once with a non-empty value of Gender
.
1557 lemmas (65%) occur at least once with a non-empty value of Gender
.
The feature is used with 8 part-of-speech tags: NOUN (2434; 24% instances), ADJ (721; 7% instances), PROPN (329; 3% instances), DET (185; 2% instances), PRON (184; 2% instances), NUM (34; 0% instances), VERB (7; 0% instances), ADV (4; 0% instances).
NOUN
2434 NOUN tokens (97% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (1875; 77%), Definite=Ind (1496; 61%), Case=Nom (1468; 60%).
NOUN
tokens may have the following values of Gender
:
Fem
(691; 28% of non-emptyGender
): kommuna, kommunur, kommunu, ár, oyggin, oynni, øld, bygdini, kommununi, ferðavinnaMasc
(1174; 48% of non-emptyGender
): býur, høvuðsstaður, býurin, høvuðsstaðurin, landslutinum, partur, týdning, Meginparturin, limur, landsluturNeut
(569; 23% of non-emptyGender
): fólkinum, fólk, landinum, landi, landið, grundarlagið, mál, Endamálið, fólkatalið, lýðveldiEMPTY
(63): USA, ES, t.d., uml., mió, A, á.Kr., FLOT, FU, FUR
Paradigm ár | Fem | Neut |
---|---|---|
Case=Acc|Definite=Def|Number=Plur | árini | |
Case=Dat|Definite=Ind|Number=Sing | ár | |
Case=Gen|Definite=Def|Number=Sing | Ársins | |
Case=Gen|Definite=Ind|Number=Plur | ára | |
Case=Nom|Definite=Def|Number=Sing | árið | |
Case=Nom|Definite=Def|Number=Plur | Árarnar | |
Case=Nom|Definite=Ind|Number=Plur | ár |
Gender
seems to be lexical feature of NOUN
. 100% lemmas (1154) occur only with one value of Gender
.
ADJ
721 ADJ tokens (95% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Degree=EMPTY (616; 85%), Case=Nom (528; 73%), Definite=Ind (485; 67%), Number=Sing (470; 65%).
ADJ
tokens may have the following values of Gender
:
Fem
(153; 21% of non-emptyGender
): størsta, fleiri, nógvar, stór, turr, aðrar, føroysku, nógv, onnur, somuMasc
(409; 57% of non-emptyGender
): størsti, stórur, stóran, nógvur, stórir, føroyskur, aðrir, mangir, amerikanska, einastiNeut
(159; 22% of non-emptyGender
): nógv, mong, stórt, Flestu, stór, sama, ymisk, annað, fleiri, føroysktEMPTY
(39): 2., 1., 18., 19., 11., 12., 16., 17., 29., 3.
Paradigm stórur | Masc | Fem | Neut |
---|---|---|---|
Case=Acc|Definite=Def|Degree=Sup|Number=Plur | Størstu | ||
Case=Acc|Definite=Ind|Number=Sing | stóran | stóra | stórt |
Case=Acc|Definite=Ind|Number=Plur | stórar | stór | |
Case=Dat|Definite=Def|Degree=Sup|Number=Sing | størsta | ||
Case=Dat|Definite=Def|Degree=Sup|Number=Plur | størstu | ||
Case=Dat|Definite=Def|Number=Plur | stóru | ||
Case=Dat|Definite=Ind|Number=Sing | stórum | stórum | |
Case=Dat|Definite=Ind|Number=Plur | stórum | stórum | |
Case=Nom|Definite=Def|Degree=Sup|Number=Sing | størsti | størsta | størsta |
Case=Nom|Definite=Def|Degree=Sup|Number=Plur | Størstu | ||
Case=Nom|Definite=Def|Number=Sing | stóri | ||
Case=Nom|Definite=Ind|Number=Sing | stórur | stór | stórt, størri |
Case=Nom|Definite=Ind|Number=Plur | stórir | stórar | stór |
PROPN
329 PROPN tokens (41% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Number=Sing (251; 76%), Definite=Ind (236; 72%).
PROPN
tokens may have the following values of Gender
:
Fem
(126; 38% of non-emptyGender
): Føroyum, Føroya, Føroyar, Danmark, Kina, Keypmannahavn, Florida, Tórshavnar, Tórshavn, BergtóraMasc
(90; 27% of non-emptyGender
): Kalifornia, Tróndur, Jákupsson, Bergur, Dávid, Gásadali, Hanus, Jóannes, Jógvan, MagnusNeut
(113; 34% of non-emptyGender
): Noregi, Fraklandi, Niðurlondum, Noregs, Grønlandi, Hordalandi, Island, Russlandi, Estlandi, GrønlandEMPTY
(476): Kanada, Amerika, Italia, New, Nigeria, York, Asia, Jackson, Kastrup, Mississippi
Gender
seems to be lexical feature of PROPN
. 100% lemmas (131) occur only with one value of Gender
.
DET
185 DET tokens (99% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number=Sing (172; 93%), Case=Nom (145; 78%).
DET
tokens may have the following values of Gender
:
Fem
(32; 17% of non-emptyGender
): ein, eina, øll, Allar, eini, sína, Summi, allari, ei, einariMasc
(94; 51% of non-emptyGender
): ein, einum, allir, Summir, allan, allur, sínum, mínirNeut
(59; 32% of non-emptyGender
): eitt, einum, annað, síni, sínum, ØllEMPTY
(1): alt
Paradigm ein | Masc | Fem | Neut |
---|---|---|---|
Case=Acc | ein | eina | eitt |
Case=Dat | einum | eini, einari | einum |
Case=Nom | ein | ein, ei | eitt |
PRON
184 PRON tokens (63% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Number=Sing (168; 91%), Case=Nom (146; 79%), Person=3 (143; 78%), PronType=Prs (143; 78%).
PRON
tokens may have the following values of Gender
:
Fem
(68; 37% of non-emptyGender
): hon, henni, hennara, hana, onga, tærMasc
(83; 45% of non-emptyGender
): hann, teir, hansara, honum, nakrir, Allir, Báðir, Summir, hesir, nakarNeut
(33; 18% of non-emptyGender
): hetta, Hatta, Hettar, hvat, okkurtEMPTY
(108): tað, sum, seg, tey, ið, sær, vit, eg, Hon, man
Paradigm hesin | Masc | Neut |
---|---|---|
Case=Acc|Number=Sing | hetta | |
Case=Nom|Number=Sing | hetta, Hettar | |
Case=Nom|Number=Plur | hesir |
NUM
34 NUM tokens (14% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: Number=Sing (34; 100%), Case=Nom (27; 79%).
NUM
tokens may have the following values of Gender
:
Fem
(23; 68% of non-emptyGender
): ein, tvær, trimum, tríggjarMasc
(6; 18% of non-emptyGender
): tveirNeut
(5; 15% of non-emptyGender
): trý, tveimum, tveyEMPTY
(205): %, 2005, 2011, 10, 2010, 4, 18, 20, 2008, 26
Paradigm tvey | Masc | Fem | Neut |
---|---|---|---|
Case=Acc | tveir | tvær | |
Case=Dat | tveimum | ||
Case=Nom | tveir | tvær | tvey |
VERB
7 VERB tokens (1% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=Ind (7; 100%), Person=EMPTY (7; 100%), Tense=Past (7; 100%), VerbForm=Part (7; 100%), Number=Plur (5; 71%).
VERB
tokens may have the following values of Gender
:
Fem
(2; 29% of non-emptyGender
): Sameindu, nevndarMasc
(4; 57% of non-emptyGender
): flettir, kendastur, keyptir, prentaðirNeut
(1; 14% of non-emptyGender
): samlaðaEMPTY
(566): býr, hevur, kom, liggur, Sí, eru, fer, varð, fór, er
ADV
4 ADV tokens (1% of all ADV
tokens) have a non-empty value of Gender
.
ADV
tokens may have the following values of Gender
:
Masc
(1; 25% of non-emptyGender
): vanligaNeut
(3; 75% of non-emptyGender
): størsta, vanliga, veldigaEMPTY
(422): eisini, ikki, har, nú, so, enn, tó, tá, fyrr, ofta
Paradigm vanligur | Masc | Neut |
---|---|---|
vanliga | vanliga |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[amod]–> ADJ (558; 92%),
NOUN –[det]–> DET (180; 98%),
NOUN –[conj]–> NOUN (86; 57%),
ADJ –[nsubj]–> NOUN (64; 85%),
NOUN –[nsubj]–> PRON (50; 56%),
NOUN –[parataxis]–> NOUN (23; 51%),
ADJ –[conj]–> ADJ (20; 91%),
ADJ –[nsubj]–> PRON (12; 100%),
ADJ –[nmod]–> NOUN (4; 80%),
ADJ –[nsubj]–> PROPN (4; 67%).