Treebank Statistics: UD_Polish-LFG: Features: SubGender
This feature is language-specific.
It occurs with 3 different values: Masc1
, Masc2
, Masc3
.
30073 tokens (23%) have a non-empty value of SubGender
.
13875 types (47%) occur at least once with a non-empty value of SubGender
.
8286 lemmas (53%) occur at least once with a non-empty value of SubGender
.
The feature is used with 8 part-of-speech tags: NOUN (11867; 9% instances), VERB (6036; 5% instances), ADJ (3894; 3% instances), PROPN (3028; 2% instances), PRON (2760; 2% instances), DET (1576; 1% instances), NUM (565; 0% instances), AUX (347; 0% instances).
NOUN
11867 NOUN tokens (47% of all NOUN
tokens) have a non-empty value of SubGender
.
The most frequent other feature values with which NOUN
and SubGender
co-occurred: Gender=Masc (11867; 100%), Number=Sing (8150; 69%).
NOUN
tokens may have the following values of SubGender
:
Masc1
(3779; 32% of non-emptySubGender
): pan, pana, panie, ludzi, ludzie, poseł, mężczyzna, panu, policjanci, człowiekMasc2
(476; 4% of non-emptySubGender
): złotych, zł, ptaki, koty, konie, kot, papierosy, psy, papierosa, psaMasc3
(7612; 64% of non-emptySubGender
): lat, domu, roku, raz, czas, dni, szpitala, dzień, pokoju, czasem
Paradigm przewodnik | Masc1 | Masc2 | Masc3 |
---|---|---|---|
Case=Acc|Number=Sing | przewodnika | ||
Case=Acc|Number=Plur | przewodniki | ||
Case=Gen|Number=Plur | przewodników | ||
Case=Ins|Number=Sing | przewodnikiem | ||
Case=Ins|Number=Plur | przewodnikami |
SubGender
seems to be lexical feature of NOUN
. 99% lemmas (2769) occur only with one value of SubGender
.
VERB
6036 VERB tokens (28% of all VERB
tokens) have a non-empty value of SubGender
.
The most frequent other feature values with which VERB
and SubGender
co-occurred: Gender=Masc (6036; 100%), Mood=Ind (6036; 100%), Person=EMPTY (6036; 100%), VerbForm=Fin (6036; 100%), Voice=Act (6036; 100%), Tense=Past (6009; 100%), Number=Sing (4543; 75%), Aspect=Perf (3612; 60%).
VERB
tokens may have the following values of SubGender
:
Masc1
(5233; 87% of non-emptySubGender
): miał, chciał, mógł, widział, musiał, mieli, powiedział, zaczął, spojrzał, wiedziałMasc2
(147; 2% of non-emptySubGender
): kantowały, miał, był, stał, chciał, grał, krążyły, miały, pojawiły, rozumiałMasc3
(656; 11% of non-emptySubGender
): mógł, zaczął, był, stał, były, miał, minął, nadszedł, panował, powstał
Paradigm mieć | Masc1 | Masc2 | Masc3 |
---|---|---|---|
Number=Sing | miał | miał | miał |
Number=Plur | mieli | miały | miały |
ADJ
3894 ADJ tokens (45% of all ADJ
tokens) have a non-empty value of SubGender
.
The most frequent other feature values with which ADJ
and SubGender
co-occurred: Gender=Masc (3894; 100%), Aspect=EMPTY (3296; 85%), Polarity=EMPTY (3296; 85%), VerbForm=EMPTY (3296; 85%), Voice=EMPTY (3296; 85%), Degree=Pos (3123; 80%), Number=Sing (2616; 67%).
ADJ
tokens may have the following values of SubGender
:
Masc1
(1267; 33% of non-emptySubGender
): sam, sami, inni, jeden, pierwszy, innych, stary, dobry, starszy, jednegoMasc2
(139; 4% of non-emptySubGender
): jeden, małe, dzikie, jednego, pokrojone, Białego, Biały, Biedny, Inne, LuksusowegoMasc3
(2488; 64% of non-emptySubGender
): cały, pierwszy, kolejny, jeden, drugi, inny, inne, wielki, nowy, ostatnie
Paradigm sam | Masc1 | Masc2 | Masc3 |
---|---|---|---|
Case=Acc|Number=Sing | sam | ||
Case=Acc|Number=Plur | samych | same | |
Case=Gen|Number=Sing | samego | samego | |
Case=Gen|Number=Plur | samych | ||
Case=Ins|Number=Sing | samym | ||
Case=Loc|Number=Sing | samym | ||
Case=Nom|Number=Sing | sam | sam | |
Case=Nom|Number=Plur | sami | same | same |
PROPN
3028 PROPN tokens (66% of all PROPN
tokens) have a non-empty value of SubGender
.
The most frequent other feature values with which PROPN
and SubGender
co-occurred: Gender=Masc (3028; 100%), Number=Sing (2840; 94%), Case=Nom (1729; 57%).
PROPN
tokens may have the following values of SubGender
:
Masc1
(2451; 81% of non-emptySubGender
): Polacy, Jerzy, Andrzej, Adam, Bóg, Michał, Krzysztof, Kwaśniewski, Niemcy, AleksanderMasc2
(49; 2% of non-emptySubGender
): Dior, Duduś, Puzon, stara, Bosmana, Bronek, Czerwonym, DigiPath, Dunaja, DusiołkuMasc3
(528; 17% of non-emptySubGender
): SLD, Izrael, Krakowa, Paryżu, Poznaniu, Wrocławia, Afganistanie, Gdańska, Izraelu, Kraków
Paradigm Lech | Masc1 | Masc2 |
---|---|---|
Case=Acc | Lecha | |
Case=Gen | Lecha | Lecha |
Case=Ins | Lechem | |
Case=Nom | Lech |
SubGender
seems to be lexical feature of PROPN
. 100% lemmas (1842) occur only with one value of SubGender
.
PRON
2760 PRON tokens (30% of all PRON
tokens) have a non-empty value of SubGender
.
The most frequent other feature values with which PRON
and SubGender
co-occurred: Gender=Masc (2760; 100%), Reflex=EMPTY (2760; 100%), PronType=Prs (2376; 86%), Number=Sing (2021; 73%), PrepCase=EMPTY (1404; 51%).
PRON
tokens may have the following values of SubGender
:
Masc1
(2505; 91% of non-emptySubGender
): go, mnie, jego, mu, ja, mi, ich, nas, on, ktoMasc2
(55; 2% of non-emptySubGender
): go, ich, mu, jego, nim, je, on, ci, mnie, niegoMasc3
(200; 7% of non-emptySubGender
): go, je, nim, on, niego, one, ich, jego, nich, mu
Paradigm on | Masc1 | Masc2 | Masc3 |
---|---|---|---|
Case=Acc|Number=Sing|PrepCase=Npr|Variant=Long | jego | ||
Case=Acc|Number=Sing|PrepCase=Npr|Variant=Short | go | go | go |
Case=Acc|Number=Sing|PrepCase=Pre|Variant=Long | niego | niego | niego |
Case=Acc|Number=Sing|PrepCase=Pre|Variant=Short | ń | ń | |
Case=Acc|Number=Plur|PrepCase=Npr|Variant=Long | ich | je, ich | je |
Case=Acc|Number=Plur|PrepCase=Pre|Variant=Long | nich | ||
Case=Dat|Number=Sing|PrepCase=Npr|Variant=Long | jemu | jemu | |
Case=Dat|Number=Sing|PrepCase=Npr|Variant=Short | mu | mu | mu |
Case=Dat|Number=Sing|PrepCase=Pre|Variant=Long | niemu | ||
Case=Dat|Number=Plur|PrepCase=Npr|Variant=Long | im | ||
Case=Dat|Number=Plur|PrepCase=Pre|Variant=Long | nim | ||
Case=Gen|Number=Sing|PrepCase=Npr|Variant=Long | jego, iego | jego | jego |
Case=Gen|Number=Sing|PrepCase=Npr|Variant=Short | go | go | go |
Case=Gen|Number=Sing|PrepCase=Pre|Variant=Long | niego | niego | |
Case=Gen|Number=Plur|PrepCase=Npr|Variant=Long | ich | ich | ich |
Case=Gen|Number=Plur|PrepCase=Pre|Variant=Long | nich | nich | nich |
Case=Ins|Number=Sing|PrepCase=Npr|Variant=Long | nim | nim | nim |
Case=Ins|Number=Sing|PrepCase=Pre|Variant=Long | nim | nim | nim |
Case=Ins|Number=Plur|PrepCase=Pre|Variant=Long | nimi | nimi | |
Case=Loc|Number=Sing|PrepCase=Pre|Variant=Long | nim | nim | nim |
Case=Loc|Number=Plur|PrepCase=Pre|Variant=Long | nich | nich | |
Case=Nom|Number=Sing|PrepCase=Npr|Variant=Long | on | on | on |
Case=Nom|Number=Plur|PrepCase=Npr|Variant=Long | oni | one |
DET
1576 DET tokens (49% of all DET
tokens) have a non-empty value of SubGender
.
The most frequent other feature values with which DET
and SubGender
co-occurred: Gender=Masc (1576; 100%), Number[psor]=EMPTY (1396; 89%), Person=EMPTY (1396; 89%), NumType=EMPTY (1353; 86%), Poss=EMPTY (1261; 80%), Number=Sing (906; 57%).
DET
tokens may have the following values of SubGender
:
Masc1
(491; 31% of non-emptySubGender
): ten, wielu, wszyscy, każdy, mój, ci, wszystkich, tego, który, niektórzyMasc2
(55; 3% of non-emptySubGender
): ten, te, który, nasze, takiego, tego, Twój, Wszystkie, ile, któryśMasc3
(1030; 65% of non-emptySubGender
): ten, tym, tego, kilka, te, swój, jakiś, taki, takie, tych
Paradigm ten | Masc1 | Masc2 | Masc3 |
---|---|---|---|
Case=Acc|Number=Sing | tego | tego | ten, tyn |
Case=Acc|Number=Plur | tych | te | te |
Case=Dat|Number=Plur | tym | ||
Case=Gen|Number=Sing | tego | tego | tego |
Case=Gen|Number=Plur | tych | tych | |
Case=Ins|Number=Sing | tym | tym | tym |
Case=Ins|Number=Plur | tymi | ||
Case=Loc|Number=Sing | tym | tym | |
Case=Loc|Number=Plur | tych | ||
Case=Nom|Number=Sing | ten | ten | ten |
Case=Nom|Number=Plur | ci | te | te |
NUM
565 NUM tokens (68% of all NUM
tokens) have a non-empty value of SubGender
.
The most frequent other feature values with which NUM
and SubGender
co-occurred: Gender=Masc (565; 100%), Number=Plur (565; 100%), NumType=Card (563; 100%), Case=Acc (399; 71%).
NUM
tokens may have the following values of SubGender
:
Masc1
(176; 31% of non-emptySubGender
): dwóch, trzech, czterech, dwaj, obaj, pięciu, trzej, obu, sześciu, stuMasc2
(49; 9% of non-emptySubGender
): 1500, 20, 4, 600, dwa, pięć, siedem, sto, trzy, 1.000Masc3
(340; 60% of non-emptySubGender
): dwa, trzy, cztery, dwóch, dwadzieścia, sto, trzech, 10, 4, 80
Paradigm dwa | Masc1 | Masc2 | Masc3 |
---|---|---|---|
Case=Acc | dwóch | dwa | |
Case=Gen | dwóch | dwóch | dwóch, dwu |
Case=Ins | dwoma | dwoma | |
Case=Loc | dwóch | ||
Case=Nom | dwaj | dwa | dwa |
AUX
347 AUX tokens (8% of all AUX
tokens) have a non-empty value of SubGender
.
The most frequent other feature values with which AUX
and SubGender
co-occurred: Gender=Masc (347; 100%), Mood=Ind (347; 100%), Person=EMPTY (347; 100%), Tense=Past (347; 100%), Variant=EMPTY (347; 100%), VerbForm=Fin (347; 100%), Voice=Act (291; 84%), Aspect=Imp (283; 82%), Number=Sing (266; 77%).
AUX
tokens may have the following values of SubGender
:
Masc1
(194; 56% of non-emptySubGender
): był, byli, został, zostali, bywałMasc2
(7; 2% of non-emptySubGender
): był, został, byłyMasc3
(146; 42% of non-emptySubGender
): był, został, były, zostały
Paradigm być | Masc1 | Masc2 | Masc3 |
---|---|---|---|
Number=Sing | był | był | |
Number=Sing|Voice=Act | był | był | był |
Number=Plur | byli | były | |
Number=Plur|Voice=Act | byli | były | były |
Relations with Agreement in SubGender
The 10 most frequent relations where parent and child node agree in SubGender
:
NOUN –[amod]–> ADJ (2435; 100%),
VERB –[nsubj]–> NOUN (1562; 51%),
NOUN –[det]–> DET (1216; 100%),
VERB –[nsubj]–> PROPN (798; 75%),
VERB –[conj]–> VERB (487; 62%),
NOUN –[nummod]–> NUM (476; 100%),
NOUN –[flat]–> PROPN (466; 96%),
NOUN –[acl]–> ADJ (357; 100%),
PROPN –[flat]–> PROPN (247; 97%),
NOUN –[flat]–> NOUN (131; 93%).