home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Upper_Sorbian-UFAL: Features: Number

This feature is universal. It occurs with 4 different values: Dual, Plur, Ptan, Sing.

This is a layered feature with the following layers: Number, Number[psor].

5903 tokens (53%) have a non-empty value of Number. 3744 types (86%) occur at least once with a non-empty value of Number. 2386 lemmas (78%) occur at least once with a non-empty value of Number. The feature is used with 9 part-of-speech tags: NOUN (2528; 23% instances), ADJ (1409; 13% instances), VERB (692; 6% instances), PROPN (545; 5% instances), AUX (287; 3% instances), DET (275; 2% instances), PRON (134; 1% instances), NUM (32; 0% instances), ADV (1; 0% instances).

NOUN

2528 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Number.

The most frequent other feature values with which NOUN and Number co-occurred: Animacy=EMPTY (1389; 55%).

NOUN tokens may have the following values of Number:

Paradigm lětoSingDualPlur
Case=Acclěto
Case=Genlětalět, lětow
Case=Loclěće, lětulětomajlětach
Case=Nomlěto

ADJ

1409 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Number.

The most frequent other feature values with which ADJ and Number co-occurred: Animacy=EMPTY (1249; 89%), Voice=EMPTY (1220; 87%), VerbForm=EMPTY (1219; 87%), Degree=EMPTY (897; 64%).

ADJ tokens may have the following values of Number:

Paradigm serbskiSingDualPlur
Animacy=Inan|Case=Acc|Degree=Pos|Gender=Mascserbskej
Case=Acc|Degree=Pos|Gender=Mascserbski
Case=Acc|Degree=Pos|Gender=Neutserbske
Case=Acc|Gender=Femserbsku
Case=Dat|Gender=Mascserbskemu
Case=Dat|Gender=Femserbskim
Case=Gen|Degree=Pos|Gender=Femserbskeje
Case=Gen|Gender=MascSerbskehoserbskich
Case=Gen|Gender=Femserbskeje
Case=Ins|Gender=Femserbskej, serbsku
Case=Loc|Degree=Pos|Gender=MascSerbskim
Case=Loc|Gender=MascSerbskim
Case=Loc|Gender=Femserbskej
Case=Nom|Degree=Pos|Gender=MascSerbski, SERBSKI
Case=Nom|Degree=Pos|Gender=Femserbska
Case=Nom|Gender=Femserbska
Case=Nom|Gender=Neutserbske

VERB

692 VERB tokens (84% of all VERB tokens) have a non-empty value of Number.

The most frequent other feature values with which VERB and Number co-occurred: VerbForm=Fin (642; 93%), Mood=Ind (629; 91%), Person=3 (594; 86%), Tense=Pres (433; 63%).

VERB tokens may have the following values of Number:

Paradigm měćSingDualPlur
Animacy=Inan|Gender=Masc|Tense=Past|VerbForm=Part|Voice=Actmał
Gender=Masc|Tense=Past|VerbForm=Part|Voice=Actměł
Mood=Ind|Person=3|Polarity=Neg|Tense=Past|VerbForm=Finnjeměješe
Mood=Ind|Person=3|Tense=Past|VerbForm=Finměješemějachu
Mood=Ind|Person=3|Tense=Pres|VerbForm=Finma, nimamatejmaja, nimaja
Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin|VerbType=Modma, nimamaja
Mood=Ind|Tense=Pres|VerbForm=Finmaja

PROPN

545 PROPN tokens (91% of all PROPN tokens) have a non-empty value of Number.

The most frequent other feature values with which PROPN and Number co-occurred: Gender=Masc (281; 52%).

PROPN tokens may have the following values of Number:

Paradigm WikipedijaSingPlur
Case=AccWikipediju
Case=GenWikipedijeWikipedijow
Case=LocWikipediji
Case=NomWikipedija

Number seems to be lexical feature of PROPN. 99% lemmas (324) occur only with one value of Number.

AUX

287 AUX tokens (99% of all AUX tokens) have a non-empty value of Number.

The most frequent other feature values with which AUX and Number co-occurred: VerbForm=Fin (285; 99%), Person=3 (283; 99%), Mood=Ind (275; 96%), Voice=EMPTY (242; 84%), Tense=Pres (194; 68%).

AUX tokens may have the following values of Number:

Paradigm byćSingDualPlur
Gender=Masc|Tense=Past|VerbForm=Part|Voice=Actbył
Gender=Fem|Tense=Past|VerbForm=Part|Voice=Actbyła
Mood=Cnd|Person=3|VerbForm=Finbybychu
Mood=Ind|Person=2|Tense=Pres|VerbForm=Finsy
Mood=Ind|Person=3|Polarity=Neg|Tense=Past|VerbForm=Finnjebuchu
Mood=Ind|Person=3|Polarity=Neg|Tense=Pres|VerbForm=Finnjejenjejsu, njesu
Mood=Ind|Person=3|Tense=Fut|VerbForm=Finbudu, budźe
Mood=Ind|Person=3|Tense=Past|VerbForm=Finbě, buběštejběchu, buchu
Mood=Ind|Person=3|Tense=Past|VerbForm=Fin|Voice=Passbubuštejbuchu
Mood=Ind|Person=3|Tense=Pres|VerbForm=Finjestej, stajsu
Mood=Ind|Person=3|VerbForm=Fin|Voice=Passbuchu

DET

275 DET tokens (84% of all DET tokens) have a non-empty value of Number.

The most frequent other feature values with which DET and Number co-occurred: Abbr=EMPTY (239; 87%), Number[psor]=EMPTY (230; 84%), Person=EMPTY (230; 84%), Poss=EMPTY (201; 73%), Animacy=EMPTY (185; 67%).

DET tokens may have the following values of Number:

Paradigm kotryžSingDualPlur
Animacy=Anim|Case=Dat|Gender=Masckotrymž
Animacy=Anim|Case=Nom|Gender=Masckotryžkotřiž
Animacy=Inan|Case=Acc|Gender=Masckotryž
Animacy=Inan|Case=Gen|Gender=Masckotrychž
Animacy=Inan|Case=Loc|Gender=Masckotrychž
Animacy=Inan|Case=Nom|Gender=Masckotryž, kotrežkotrež
Case=Gen|Gender=Masckotrehož
Case=Gen|Gender=Femkotrejež
Case=Ins|Gender=Femkotrymiž
Case=Loc|Gender=Masckotrymž
Case=Loc|Gender=Femkotrejžkotrychž
Case=Loc|Gender=Neutkotrychž
Case=Nom|Gender=Masckotryžkotrež
Case=Nom|Gender=Femkotražkotrež
Case=Nom|Gender=Neutkotrežkotrejžkotrež

PRON

134 PRON tokens (40% of all PRON tokens) have a non-empty value of Number.

The most frequent other feature values with which PRON and Number co-occurred: Reflex=EMPTY (134; 100%), Gender=Neut (83; 62%), Person=EMPTY (76; 57%).

PRON tokens may have the following values of Number:

Paradigm wónSingDualPlur
Animacy=Anim|Case=Nom|Gender=MascWoni
Animacy=Inan|Case=Acc|Gender=Mascje
Animacy=Nhum|Case=Acc|Gender=Mascjeho
Case=Acc|Gender=Mascjón, jeho
Case=Acc|Gender=Femju, njuje
Case=Accje
Case=Dat|Gender=FemJej, jeje, njej
Case=Gen|Gender=Mascnich
Case=Gen|Gender=Femnjeje
Case=Ins|Gender=Neutnimi
Case=Loc|Gender=Mascnim
Case=Loc|Gender=Neutnim
Case=Nom|Gender=Mascwón
Case=Nom|Gender=Femwonawone
Case=Nom|Gender=Neutwono, wonewone
Case=NomWonej
Gender=Mascjón

NUM

32 NUM tokens (8% of all NUM tokens) have a non-empty value of Number.

The most frequent other feature values with which NUM and Number co-occurred: NumType=Card (31; 97%).

NUM tokens may have the following values of Number:

ADV

1 ADV tokens (0% of all ADV tokens) have a non-empty value of Number.

The most frequent other feature values with which ADV and Number co-occurred: Degree=Pos (1; 100%), PronType=EMPTY (1; 100%).

ADV tokens may have the following values of Number:

Relations with Agreement in Number

The 10 most frequent relations where parent and child node agree in Number: NOUN –[amod]–> ADJ (1083; 99%), VERB –[nsubj]–> NOUN (348; 91%), NOUN –[nmod]–> NOUN (309; 58%), VERB –[obl]–> NOUN (235; 56%), NOUN –[conj]–> NOUN (211; 89%), NOUN –[det]–> DET (179; 83%), ADJ –[cop]–> AUX (137; 96%), NOUN –[nmod]–> PROPN (108; 64%), PROPN –[conj]–> PROPN (82; 93%), NOUN –[cop]–> AUX (79; 91%).