Treebank Statistics: UD_Irish-Cadhan: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
1578 tokens (33%) have a non-empty value of Gender
.
1082 types (58%) occur at least once with a non-empty value of Gender
.
760 lemmas (69%) occur at least once with a non-empty value of Gender
.
The feature is used with 6 part-of-speech tags: NOUN (985; 21% instances), PROPN (195; 4% instances), ADP (109; 2% instances), ADJ (102; 2% instances), DET (102; 2% instances), PRON (85; 2% instances).
NOUN
985 NOUN tokens (86% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: VerbForm=EMPTY (985; 100%), Number=Sing (847; 86%), Case=Nom (718; 73%), Form=EMPTY (668; 68%), Definite=Def (533; 54%).
NOUN
tokens may have the following values of Gender
:
Fem
(344; 35% of non-emptyGender
): beatha, leith, bliadhna, cuid, oidhche, réir, thoil, laimh, linn, láimhMasc
(641; 65% of non-emptyGender
): lá, duine, fhios, saoghal, bith, fear, la, ainm, creidimh, macEMPTY
(161): bheith, chur, dhéanamh, tan, cur, tabhairt, teacht, éis, ais, déanamh
Paradigm cor | Masc | Fem |
---|---|---|
Case=Gen|Definite=Def|Form=Ecl|NounType=Weak|Number=Plur | gcor | |
Case=Nom|Form=Len|Number=Sing | chor | choir |
Gender
seems to be lexical feature of NOUN
. 100% lemmas (550) occur only with one value of Gender
.
PROPN
195 PROPN tokens (86% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Definite=Def (192; 98%), Foreign=EMPTY (191; 98%), Number=Sing (181; 93%), Form=EMPTY (159; 82%), Case=Nom (102; 52%).
PROPN
tokens may have the following values of Gender
:
Fem
(47; 24% of non-emptyGender
): Éireann, Éirinn, Danann, Eireann, Uladh, n-Éirinn, Callain, Casga, Chill, ChnámhchoillMasc
(148; 76% of non-emptyGender
): Iósa, Dia, Dé, Sacsaibh, Ursula, Beare, Bheannchair, Bhuck, Comhghall, DhiaEMPTY
(31): Buck, Hanmer, Oirdnidhe, Bangor, Chairbre, Cuidithe, Cúna, Dyea, Hibernia, Hiberus
Paradigm Ulaidh | Masc | Fem |
---|---|---|
Uladh | Uladh |
Gender
seems to be lexical feature of PROPN
. 99% lemmas (122) occur only with one value of Gender
.
ADP
109 ADP tokens (15% of all ADP
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADP
and Gender
co-occurred: Number=Sing (109; 100%), Person=3 (109; 100%).
ADP
tokens may have the following values of Gender
:
Fem
(9; 8% of non-emptyGender
): aici, uirre, dhi, di, lei, léithi, ríaMasc
(100; 92% of non-emptyGender
): ann, aige, air, ‘na, na, d’á, dá, dó, as, daEMPTY
(623): ar, ag, i, do, le, ó, go, re, gan, dochum
Paradigm ar | Masc | Fem |
---|---|---|
air | uirre |
ADJ
102 ADJ tokens (49% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Degree=EMPTY (102; 100%), Number=Sing (91; 89%), Case=Nom (88; 86%), Form=EMPTY (81; 79%).
ADJ
tokens may have the following values of Gender
:
Fem
(25; 25% of non-emptyGender
): ghloin, mhaith, shuthoin, aintréin, bheg, bhán, breagh, buidhe, direach, doirbheMasc
(77; 75% of non-emptyGender
): beag, mór, dil, maith, Caoimh, Caoin, Sasanach, aisteach, allta, amhainEMPTY
(108): maith, mó, amháin, iomdha, ionann, mór, buaine, cóir, geal, mhór
Paradigm mór | Masc | Fem |
---|---|---|
Form=Len | mhór | |
mór |
DET
102 DET tokens (23% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number=Sing (101; 99%), Case=EMPTY (55; 54%), Definite=EMPTY (55; 54%), Person=3 (55; 54%), Poss=Yes (55; 54%), PronType=EMPTY (55; 54%).
DET
tokens may have the following values of Gender
:
Fem
(26; 25% of non-emptyGender
): na, a, n-a, náMasc
(76; 75% of non-emptyGender
): a, an, n-a, do, naEMPTY
(350): an, na, gach, mo, a, do, eile, so, aon, sin
Paradigm an | Masc | Fem |
---|---|---|
Number=Sing | an | na |
Number=Plur | na |
PRON
85 PRON tokens (41% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Number=Sing (85; 100%), Person=3 (85; 100%), PronType=EMPTY (80; 94%).
PRON
tokens may have the following values of Gender
:
Fem
(13; 15% of non-emptyGender
): sí, í, si, siseMasc
(72; 85% of non-emptyGender
): sé, é, hé, se, e, seision, eisean, seiseanEMPTY
(124): sin, féin, mé, siad, tú, iad, so, a, sibh, tusa
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[amod]–> ADJ (93; 89%),
NOUN –[conj]–> NOUN (49; 69%),
PROPN –[conj]–> PROPN (15; 100%),
PROPN –[appos]–> NOUN (14; 88%),
PROPN –[flat:name]–> PROPN (9; 75%),
NOUN –[appos]–> PROPN (8; 67%),
PROPN –[amod]–> ADJ (6; 100%),
PROPN –[nmod]–> NOUN (5; 56%),
NOUN –[parataxis]–> NOUN (4; 57%),
NOUN –[nsubj]–> PRON (3; 60%).