home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_German-HDT: Features: Case

This feature is universal. It occurs with 4 different values: Acc, Dat, Gen, Nom.

1120241 tokens (32%) have a non-empty value of Case. 35564 types (19%) occur at least once with a non-empty value of Case. 13775 lemmas (20%) occur at least once with a non-empty value of Case. The feature is used with 8 part-of-speech tags: DET (490512; 14% instances), ADP (345545; 10% instances), PRON (93583; 3% instances), NOUN (78735; 2% instances), PROPN (61547; 2% instances), ADJ (50236; 1% instances), X (61; 0% instances), NUM (22; 0% instances).

DET

490512 DET tokens (99% of all DET tokens) have a non-empty value of Case.

The most frequent other feature values with which DET and Case co-occurred: PronType=Art (428894; 87%), NumType=EMPTY (420585; 86%), Number=Sing (393634; 80%), Definite=Def (359941; 73%).

DET tokens may have the following values of Case:

Paradigm derNomAccDatGen
Gender=Masc,Neut|Number=Singdem
Gender=Masc|Number=Singderden, derdem, desdes, der
Gender=Fem|Number=Singdie, derdieder, dieder
Gender=Neut|Number=Singdasdas, 'sdem, das, desdes
Number=Singdie, derdie, dender
Number=Plurdie, derdie, denden, der, dieder

ADP

345545 ADP tokens (90% of all ADP tokens) have a non-empty value of Case.

The most frequent other feature values with which ADP and Case co-occurred: AdpType=Prep (344347; 100%).

ADP tokens may have the following values of Case:

Paradigm inAccDatGen
_in
AdpType=Prepininin

PRON

93583 PRON tokens (99% of all PRON tokens) have a non-empty value of Case.

The most frequent other feature values with which PRON and Case co-occurred: Reflex=EMPTY (72463; 77%), PronType=Prs (54176; 58%), Number=Sing (53228; 57%), Gender=EMPTY (50701; 54%), Person=3 (48743; 52%).

PRON tokens may have the following values of Case:

Paradigm derNomAccDatGen
Abbr=Yes|Gender=Neut|Number=Singd.
Gender=Masc|Number=Singderdendemdessen
Gender=Fem|Number=Singdiediederderer, Deren
Gender=Neut|Number=Singdasdasdemdessen
Gender=Neut|Number=Sing|Typo=Yesda
Number=Singdasdessen
Number=Plurdiediedenenderer, der
deren

NOUN

78735 NOUN tokens (11% of all NOUN tokens) have a non-empty value of Case.

The most frequent other feature values with which NOUN and Case co-occurred: Number=Sing (43268; 55%).

NOUN tokens may have the following values of Case:

Paradigm JahrNomAccDatGen
Number=SingBoomjahrJahrJahre, Jahr, FinanzjahrJahres, Jahrs, Finanzjahres, Startup-Jahres, Verkaufsjahres
Number=PlurJahreJahreJahren, 70er-Jahren, 50-er-Jahren, 50er-Jahren, 80er-Jahren, 90er-Jahren, Achtzigerjahren, Anfangsjahren, Boom-Jahren, Folgejahren, Internet-Jahren, Neunzigerjahren, Startjahren, WachstumsjahrenJahre, 80er-Jahre

PROPN

61547 PROPN tokens (32% of all PROPN tokens) have a non-empty value of Case.

The most frequent other feature values with which PROPN and Case co-occurred: Gender=EMPTY (58875; 96%), Number=Sing (58722; 95%).

PROPN tokens may have the following values of Case:

Paradigm TelekomNomAccDatGen
TelekomTelekomTelekomTelekom

ADJ

50236 ADJ tokens (19% of all ADJ tokens) have a non-empty value of Case.

The most frequent other feature values with which ADJ and Case co-occurred: Variant=EMPTY (50232; 100%), Degree=Pos (42441; 84%), Number=Sing (27572; 55%).

ADJ tokens may have the following values of Case:

Paradigm neuNomAccDatGen
Degree=Pos|Gender=Masc|Number=Singneue, neuerneuenneuen
Degree=Pos|Gender=Masc|Number=Plurneuenneuenneuenneuen, neuer
Degree=Pos|Gender=Fem|Number=Singneuen, neuer
Degree=Pos|Gender=Fem|Number=Plurneuenneuenneuenneuer, neuen
Degree=Pos|Gender=Neut|Number=Singneuesneuen
Degree=Pos|Gender=Neut|Number=Plurneuenneuenneuenneuer, neuen
Degree=Pos|Number=Singneuem, neuenneuen
Degree=Pos|Number=Plurneuen, neueneuenneuenneuer, neuen
Degree=Cmp|Gender=Masc|Number=Singneuere, neuererneueren
Degree=Cmp|Gender=Masc|Number=Plurneuerenneueren
Degree=Cmp|Gender=Fem|Number=Plurneueren
Degree=Cmp|Number=Singneueren
Degree=Cmp|Number=Plurneueren
Degree=Sup|Gender=Masc|Number=Singneueste, neuester, neusteneuesten
Degree=Sup|Gender=Masc|Number=Plurneuesten
Degree=Sup|Gender=Fem|Number=Singneuesten
Degree=Sup|Gender=Fem|Number=Plurneuesten, neustenneuesten
Degree=Sup|Gender=Neut|Number=Plurneuestenneuesten, neusten
Degree=Sup|Number=Singneuestem, neuesten
Degree=Sup|Number=Plurneuestenneuester

X

61 X tokens (0% of all X tokens) have a non-empty value of Case.

The most frequent other feature values with which X and Case co-occurred: Foreign=Yes (61; 100%).

X tokens may have the following values of Case:

Paradigm DigitalNomDat
DigitalDigital

Case seems to be lexical feature of X. 98% lemmas (49) occur only with one value of Case.

NUM

22 NUM tokens (0% of all NUM tokens) have a non-empty value of Case.

The most frequent other feature values with which NUM and Case co-occurred: NumType=Card (22; 100%), Number=Plur (21; 95%).

NUM tokens may have the following values of Case:

Case seems to be lexical feature of NUM. 100% lemmas (14) occur only with one value of Case.

Relations with Agreement in Case

The 10 most frequent relations where parent and child node agree in Case: PRON –[case]–> ADP (5703; 96%), DET –[case]–> ADP (4618; 95%), ADJ –[conj]–> ADJ (450; 75%), DET –[det]–> DET (360; 56%), DET –[conj]–> DET (56; 57%), PRON –[appos]–> DET (45; 92%), PRON –[conj]–> PRON (43; 93%), PRON –[nsubj]–> PRON (31; 79%), NOUN –[amod]–> NOUN (22; 65%), ADJ –[amod]–> ADJ (14; 64%).