home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_German-HDT: Features: Number

This feature is universal. It occurs with 2 different values: Plur, Sing.

This is a layered feature with the following layers: Number, Number[psor].

1903567 tokens (55%) have a non-empty value of Number. 143462 types (76%) occur at least once with a non-empty value of Number. 34954 lemmas (50%) occur at least once with a non-empty value of Number. The feature is used with 10 part-of-speech tags: NOUN (694170; 20% instances), DET (493647; 14% instances), ADJ (172255; 5% instances), VERB (134321; 4% instances), AUX (134054; 4% instances), PROPN (128855; 4% instances), PRON (73108; 2% instances), NUM (71302; 2% instances), ADV (1676; 0% instances), X (179; 0% instances).

NOUN

694170 NOUN tokens (95% of all NOUN tokens) have a non-empty value of Number.

The most frequent other feature values with which NOUN and Number co-occurred: Case=EMPTY (616712; 89%).

NOUN tokens may have the following values of Number:

Paradigm JahrSingPlur
Case=AccJahrJahre
Case=DatJahre, Jahr, FinanzjahrJahren, 70er-Jahren, 50-er-Jahren, 50er-Jahren, 80er-Jahren, 90er-Jahren, Achtzigerjahren, Anfangsjahren, Boom-Jahren, Folgejahren, Internet-Jahren, Neunzigerjahren, Startjahren, Wachstumsjahren
Case=GenJahres, Jahrs, Finanzjahres, Startup-Jahres, VerkaufsjahresJahre, 80er-Jahre
Case=NomBoomjahrJahre
Jahr, Geburtsjahr, Gruner+Jahr, Fiskaljahr, Geschäftjahr, Steuerjahr, Studienjahr, Wachstumsjahr, Wahljahr, Abschreibungsjahr, Aktionsjahr, Ausbildungsjahr, Boom-Jahr, Boomjahr, Expo-Jahr, Finanzjahr, Früjahr, Gründungsjahr, Jubiläumsjahr, Krisenjahr, Registrierungsjahr, Spitzenjahr, UMTS-Jahr, Verlustjahr, Vertragsjahr, Windows-95-Jahr, dotcom-JahrJahre, 90er-Jahre, 80er-Jahre, 60er-Jahre, 90erJahre, Arbeitsjahre, Folgejahre, Modelljahre

DET

493647 DET tokens (100% of all DET tokens) have a non-empty value of Number.

The most frequent other feature values with which DET and Number co-occurred: PronType=Art (428896; 87%), NumType=EMPTY (423717; 86%), Definite=Def (359941; 73%).

DET tokens may have the following values of Number:

Paradigm derSingPlur
Case=Acc|Gender=Mascden, der
Case=Acc|Gender=Femdie
Case=Acc|Gender=Neutdas, 's
Case=Accdie, dendie, den
Case=Dat|Gender=Masc,Neutdem
Case=Dat|Gender=Mascdem, des
Case=Dat|Gender=Femder, die
Case=Dat|Gender=Neutdem, das, des
Case=Datden, der, die
Case=Gen|Gender=Mascdes, der
Case=Gen|Gender=Femder
Case=Gen|Gender=Neutdes
Case=Genderder
Case=Nom|Gender=Mascder
Case=Nom|Gender=Femdie, der
Case=Nom|Gender=Neutdas
Case=Nomdie, derdie, der
den

ADJ

172255 ADJ tokens (66% of all ADJ tokens) have a non-empty value of Number.

The most frequent other feature values with which ADJ and Number co-occurred: Variant=EMPTY (172254; 100%), Degree=Pos (152929; 89%), Case=EMPTY (122544; 71%), Gender=EMPTY (88117; 51%).

ADJ tokens may have the following values of Number:

Paradigm neuSingPlur
Case=Acc|Degree=Posneuen
Case=Acc|Degree=Pos|Gender=Mascneuenneuen
Case=Acc|Degree=Pos|Gender=Femneuen
Case=Acc|Degree=Pos|Gender=Neutneuen
Case=Acc|Degree=Cmp|Gender=Mascneueren
Case=Acc|Degree=Sup|Gender=Mascneuestenneuesten
Case=Acc|Degree=Sup|Gender=Femneuesten, neusten
Case=Acc|Degree=Sup|Gender=Neutneuesten, neusten
Case=Dat|Degree=Posneuem, neuenneuen
Case=Dat|Degree=Pos|Gender=Mascneuenneuen
Case=Dat|Degree=Pos|Gender=Femneuen, neuerneuen
Case=Dat|Degree=Pos|Gender=Neutneuenneuen
Case=Dat|Degree=Cmpneueren
Case=Dat|Degree=Cmp|Gender=Mascneueren
Case=Dat|Degree=Cmp|Gender=Femneueren
Case=Dat|Degree=Supneuestem, neuestenneuesten
Case=Dat|Degree=Sup|Gender=Femneuestenneuesten
Case=Gen|Degree=Posneuenneuer, neuen
Case=Gen|Degree=Pos|Gender=Mascneuen, neuer
Case=Gen|Degree=Pos|Gender=Femneuer, neuen
Case=Gen|Degree=Pos|Gender=Neutneuer, neuen
Case=Gen|Degree=Cmpneueren
Case=Gen|Degree=Supneuester
Case=Nom|Degree=Posneuen, neue
Case=Nom|Degree=Pos|Gender=Mascneue, neuerneuen
Case=Nom|Degree=Pos|Gender=Femneuen
Case=Nom|Degree=Pos|Gender=Neutneuesneuen
Case=Nom|Degree=Cmp|Gender=Mascneuere, neuererneueren
Case=Nom|Degree=Sup|Gender=Mascneueste, neuester, neuste
Case=Nom|Degree=Sup|Gender=Neutneuesten
Degree=Posneuenneue, neuen, Internet/Neue
Degree=Pos|Gender=Mascneuenneue
Degree=Pos|Gender=Femneue, neuen, neuerneue, neuen
Degree=Pos|Gender=Neutneues, neue, neuenneue, neuen
Degree=Cmpneuerenneuere, neueren
Degree=Cmp|Gender=Mascneueren
Degree=Cmp|Gender=Femneuere, neueren, neuererNeuere
Degree=Cmp|Gender=Neutneuere, neueresNeuere
Degree=Supneuesten, neustenneuesten, neueste
Degree=Sup|Gender=Mascneuesten
Degree=Sup|Gender=Femneueste, neuester, neuesten, neustenneueste
Degree=Sup|Gender=Neutneueste, neuestes, neuesten

VERB

134321 VERB tokens (51% of all VERB tokens) have a non-empty value of Number.

The most frequent other feature values with which VERB and Number co-occurred: Aspect=EMPTY (134321; 100%), VerbForm=Fin (134317; 100%), Mood=Ind (133577; 99%), Person=3 (131667; 98%), Tense=Pres (102877; 77%).

VERB tokens may have the following values of Number:

Paradigm gebenSingPlur
Mood=Imp|Person=2gibGebt
Mood=Ind|Person=1|Tense=Pastgaben
Mood=Ind|Person=1|Tense=Presgeb, gebegeben
Mood=Ind|Person=3|Tense=Pastgabgaben
Mood=Ind|Person=3|Tense=Presgibt, gebe, gäbegeben, gäben
Mood=Ind|Person=3gibt, gab, gebe, Gäbegaben, geben

AUX

134054 AUX tokens (88% of all AUX tokens) have a non-empty value of Number.

The most frequent other feature values with which AUX and Number co-occurred: VerbForm=Fin (134052; 100%), Mood=Ind (134051; 100%), Person=3 (131333; 98%), Tense=Pres (113723; 85%), VerbType=EMPTY (88433; 66%).

AUX tokens may have the following values of Number:

Paradigm seinSingPlur
Mood=Imp|Person=2seid
Mood=Ind|Person=1|Tense=Pastwarwaren
Mood=Ind|Person=1|Tense=Presbin, sei, wäresind, wären, seien
Mood=Ind|Person=1bin
Mood=Ind|Person=2|Tense=Presbistseid
Mood=Ind|Person=3|Tense=Pastwarwaren
Mood=Ind|Person=3|Tense=Presist, sei, wäre, wärsind, seien, wären
Mood=Ind|Person=3ist, sei, war, wäresind, seien, waren, wären

PROPN

128855 PROPN tokens (66% of all PROPN tokens) have a non-empty value of Number.

The most frequent other feature values with which PROPN and Number co-occurred: Gender=EMPTY (101121; 78%), Case=EMPTY (70010; 54%).

PROPN tokens may have the following values of Number:

Paradigm SiemensSingPlur
_Siemens
Case=AccSiemens
Case=DatSiemens
Case=GenSiemens
Case=NomSiemens

Number seems to be lexical feature of PROPN. 99% lemmas (8376) occur only with one value of Number.

PRON

73108 PRON tokens (77% of all PRON tokens) have a non-empty value of Number.

The most frequent other feature values with which PRON and Number co-occurred: Reflex=EMPTY (72732; 99%), Case=Nom (57883; 79%), Person=EMPTY (39739; 54%).

PRON tokens may have the following values of Number:

Paradigm derSingPlur
Abbr=Yes|Case=Nom|Gender=Neutd.
Case=Acc|Gender=Mascden
Case=Acc|Gender=Femdie
Case=Acc|Gender=Neutdas
Case=Accdie
Case=Dat|Gender=Mascdem
Case=Dat|Gender=Femder
Case=Dat|Gender=Neutdem
Case=Datdenen
Case=Gen|Gender=Mascdessen
Case=Gen|Gender=Femderer, Deren
Case=Gen|Gender=Neutdessen
Case=Gendessenderer, der
Case=Nom|Gender=Mascder
Case=Nom|Gender=Femdie
Case=Nom|Gender=Neutdas
Case=Nom|Gender=Neut|Typo=Yesda
Case=Nomdasdie

Number seems to be lexical feature of PRON. 92% lemmas (24) occur only with one value of Number.

NUM

71302 NUM tokens (100% of all NUM tokens) have a non-empty value of Number.

The most frequent other feature values with which NUM and Number co-occurred: NumType=Card (71301; 100%).

NUM tokens may have the following values of Number:

Paradigm viertelSingPlur
viertelviertel

Number seems to be lexical feature of NUM. 100% lemmas (6530) occur only with one value of Number.

ADV

1676 ADV tokens (1% of all ADV tokens) have a non-empty value of Number.

The most frequent other feature values with which ADV and Number co-occurred: PronType=Ind (1675; 100%).

ADV tokens may have the following values of Number:

Paradigm mehrSingPlur
Case=Accmehrere
Case=Datmehreren
Case=Genmehrerer
Case=Nommehrere
Gender=Neutmehrmehr
mehr

X

179 X tokens (0% of all X tokens) have a non-empty value of Number.

The most frequent other feature values with which X and Number co-occurred: Foreign=Yes (179; 100%).

X tokens may have the following values of Number:

Relations with Agreement in Number

The 10 most frequent relations where parent and child node agree in Number: NOUN –[det]–> DET (430617; 96%), NOUN –[amod]–> ADJ (156711; 90%), NOUN –[nmod]–> NOUN (84329; 53%), VERB –[nsubj]–> NOUN (61922; 62%), NOUN –[nummod]–> NUM (46550; 97%), VERB –[nsubj]–> PRON (28090; 65%), NOUN –[conj]–> NOUN (22775; 58%), NOUN –[flat:name]–> PROPN (20480; 58%), NOUN –[nmod]–> PROPN (17162; 52%), PROPN –[det]–> DET (13435; 61%).