home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Polish-LFG: Features: SubGender

This feature is language-specific. It occurs with 3 different values: Masc1, Masc2, Masc3.

30073 tokens (23%) have a non-empty value of SubGender. 13875 types (47%) occur at least once with a non-empty value of SubGender. 8286 lemmas (53%) occur at least once with a non-empty value of SubGender. The feature is used with 8 part-of-speech tags: NOUN (11867; 9% instances), VERB (6036; 5% instances), ADJ (3894; 3% instances), PROPN (3028; 2% instances), PRON (2760; 2% instances), DET (1576; 1% instances), NUM (565; 0% instances), AUX (347; 0% instances).

NOUN

11867 NOUN tokens (47% of all NOUN tokens) have a non-empty value of SubGender.

The most frequent other feature values with which NOUN and SubGender co-occurred: Gender=Masc (11867; 100%), Number=Sing (8150; 69%).

NOUN tokens may have the following values of SubGender:

Paradigm przewodnikMasc1Masc2Masc3
Case=Acc|Number=Singprzewodnika
Case=Acc|Number=Plurprzewodniki
Case=Gen|Number=Plurprzewodników
Case=Ins|Number=Singprzewodnikiem
Case=Ins|Number=Plurprzewodnikami

SubGender seems to be lexical feature of NOUN. 99% lemmas (2769) occur only with one value of SubGender.

VERB

6036 VERB tokens (28% of all VERB tokens) have a non-empty value of SubGender.

The most frequent other feature values with which VERB and SubGender co-occurred: Gender=Masc (6036; 100%), Mood=Ind (6036; 100%), Person=EMPTY (6036; 100%), VerbForm=Fin (6036; 100%), Voice=Act (6036; 100%), Tense=Past (6009; 100%), Number=Sing (4543; 75%), Aspect=Perf (3612; 60%).

VERB tokens may have the following values of SubGender:

Paradigm miećMasc1Masc2Masc3
Number=Singmiałmiałmiał
Number=Plurmielimiałymiały

ADJ

3894 ADJ tokens (45% of all ADJ tokens) have a non-empty value of SubGender.

The most frequent other feature values with which ADJ and SubGender co-occurred: Gender=Masc (3894; 100%), Aspect=EMPTY (3296; 85%), Polarity=EMPTY (3296; 85%), VerbForm=EMPTY (3296; 85%), Voice=EMPTY (3296; 85%), Degree=Pos (3123; 80%), Number=Sing (2616; 67%).

ADJ tokens may have the following values of SubGender:

Paradigm samMasc1Masc2Masc3
Case=Acc|Number=Singsam
Case=Acc|Number=Plursamychsame
Case=Gen|Number=Singsamegosamego
Case=Gen|Number=Plursamych
Case=Ins|Number=Singsamym
Case=Loc|Number=Singsamym
Case=Nom|Number=Singsamsam
Case=Nom|Number=Plursamisamesame

PROPN

3028 PROPN tokens (66% of all PROPN tokens) have a non-empty value of SubGender.

The most frequent other feature values with which PROPN and SubGender co-occurred: Gender=Masc (3028; 100%), Number=Sing (2840; 94%), Case=Nom (1729; 57%).

PROPN tokens may have the following values of SubGender:

Paradigm LechMasc1Masc2
Case=AccLecha
Case=GenLechaLecha
Case=InsLechem
Case=NomLech

SubGender seems to be lexical feature of PROPN. 100% lemmas (1842) occur only with one value of SubGender.

PRON

2760 PRON tokens (30% of all PRON tokens) have a non-empty value of SubGender.

The most frequent other feature values with which PRON and SubGender co-occurred: Gender=Masc (2760; 100%), Reflex=EMPTY (2760; 100%), PronType=Prs (2376; 86%), Number=Sing (2021; 73%), PrepCase=EMPTY (1404; 51%).

PRON tokens may have the following values of SubGender:

Paradigm onMasc1Masc2Masc3
Case=Acc|Number=Sing|PrepCase=Npr|Variant=Longjego
Case=Acc|Number=Sing|PrepCase=Npr|Variant=Shortgogogo
Case=Acc|Number=Sing|PrepCase=Pre|Variant=Longniegoniegoniego
Case=Acc|Number=Sing|PrepCase=Pre|Variant=Shortńń
Case=Acc|Number=Plur|PrepCase=Npr|Variant=Longichje, ichje
Case=Acc|Number=Plur|PrepCase=Pre|Variant=Longnich
Case=Dat|Number=Sing|PrepCase=Npr|Variant=Longjemujemu
Case=Dat|Number=Sing|PrepCase=Npr|Variant=Shortmumumu
Case=Dat|Number=Sing|PrepCase=Pre|Variant=Longniemu
Case=Dat|Number=Plur|PrepCase=Npr|Variant=Longim
Case=Dat|Number=Plur|PrepCase=Pre|Variant=Longnim
Case=Gen|Number=Sing|PrepCase=Npr|Variant=Longjego, iegojegojego
Case=Gen|Number=Sing|PrepCase=Npr|Variant=Shortgogogo
Case=Gen|Number=Sing|PrepCase=Pre|Variant=Longniegoniego
Case=Gen|Number=Plur|PrepCase=Npr|Variant=Longichichich
Case=Gen|Number=Plur|PrepCase=Pre|Variant=Longnichnichnich
Case=Ins|Number=Sing|PrepCase=Npr|Variant=Longnimnimnim
Case=Ins|Number=Sing|PrepCase=Pre|Variant=Longnimnimnim
Case=Ins|Number=Plur|PrepCase=Pre|Variant=Longniminimi
Case=Loc|Number=Sing|PrepCase=Pre|Variant=Longnimnimnim
Case=Loc|Number=Plur|PrepCase=Pre|Variant=Longnichnich
Case=Nom|Number=Sing|PrepCase=Npr|Variant=Longononon
Case=Nom|Number=Plur|PrepCase=Npr|Variant=Longonione

DET

1576 DET tokens (49% of all DET tokens) have a non-empty value of SubGender.

The most frequent other feature values with which DET and SubGender co-occurred: Gender=Masc (1576; 100%), Number[psor]=EMPTY (1396; 89%), Person=EMPTY (1396; 89%), NumType=EMPTY (1353; 86%), Poss=EMPTY (1261; 80%), Number=Sing (906; 57%).

DET tokens may have the following values of SubGender:

Paradigm tenMasc1Masc2Masc3
Case=Acc|Number=Singtegotegoten, tyn
Case=Acc|Number=Plurtychtete
Case=Dat|Number=Plurtym
Case=Gen|Number=Singtegotegotego
Case=Gen|Number=Plurtychtych
Case=Ins|Number=Singtymtymtym
Case=Ins|Number=Plurtymi
Case=Loc|Number=Singtymtym
Case=Loc|Number=Plurtych
Case=Nom|Number=Singtententen
Case=Nom|Number=Plurcitete

NUM

565 NUM tokens (68% of all NUM tokens) have a non-empty value of SubGender.

The most frequent other feature values with which NUM and SubGender co-occurred: Gender=Masc (565; 100%), Number=Plur (565; 100%), NumType=Card (563; 100%), Case=Acc (399; 71%).

NUM tokens may have the following values of SubGender:

Paradigm dwaMasc1Masc2Masc3
Case=Accdwóchdwa
Case=Gendwóchdwóchdwóch, dwu
Case=Insdwomadwoma
Case=Locdwóch
Case=Nomdwajdwadwa

AUX

347 AUX tokens (8% of all AUX tokens) have a non-empty value of SubGender.

The most frequent other feature values with which AUX and SubGender co-occurred: Gender=Masc (347; 100%), Mood=Ind (347; 100%), Person=EMPTY (347; 100%), Tense=Past (347; 100%), Variant=EMPTY (347; 100%), VerbForm=Fin (347; 100%), Voice=Act (291; 84%), Aspect=Imp (283; 82%), Number=Sing (266; 77%).

AUX tokens may have the following values of SubGender:

Paradigm byćMasc1Masc2Masc3
Number=Singbyłbył
Number=Sing|Voice=Actbyłbyłbył
Number=Plurbylibyły
Number=Plur|Voice=Actbylibyłybyły

Relations with Agreement in SubGender

The 10 most frequent relations where parent and child node agree in SubGender: NOUN –[amod]–> ADJ (2435; 100%), VERB –[nsubj]–> NOUN (1562; 51%), NOUN –[det]–> DET (1216; 100%), VERB –[nsubj]–> PROPN (798; 75%), VERB –[conj]–> VERB (487; 62%), NOUN –[nummod]–> NUM (476; 100%), NOUN –[flat]–> PROPN (466; 96%), NOUN –[acl]–> ADJ (357; 100%), PROPN –[flat]–> PROPN (247; 97%), NOUN –[flat]–> NOUN (131; 93%).