Treebank Statistics: UD_Bulgarian-BTB: Features: Number
This feature is universal but the values Count
are language-specific.
It occurs with 4 different values: Count
, Plur
, Ptan
, Sing
.
87764 tokens (56%) have a non-empty value of Number
.
27713 types (105%) occur at least once with a non-empty value of Number
.
14050 lemmas (94%) occur at least once with a non-empty value of Number
.
The feature is used with 10 part-of-speech tags: NOUN (33927; 22% instances), VERB (16828; 11% instances), ADJ (13504; 9% instances), PROPN (8363; 5% instances), PRON (5368; 3% instances), AUX (4739; 3% instances), DET (2433; 2% instances), NUM (2102; 1% instances), ADV (499; 0% instances), ADP (1; 0% instances).
NOUN
33927 NOUN tokens (99% of all NOUN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NOUN
and Number
co-occurred: Definite=Ind (20766; 61%).
NOUN
tokens may have the following values of Number
:
Count
(888; 3% of non-emptyNumber
): %, лв., млн., $, месеца, дни, лева, млрд., пъти, долараPlur
(8792; 26% of non-emptyNumber
): г., години, пари, страни, проблеми, представители, сили, промени, фирми, паритеPtan
(325; 1% of non-emptyNumber
): хората, хора, души, преговори, преговорите, финансите, боеприпаси, книжа, книжата, белезнициSing
(23922; 71% of non-emptyNumber
): г., време, година, част, президентът, страната, събрание, път, страна, краяEMPTY
(225): глава, собственост, партия, президент, интерес, въпрос, съюз, училище, въстание, изказване
Paradigm лев | Sing | Plur | Count |
---|---|---|---|
Definite=Def | лева, Левът | ||
Definite=Ind | лев | левове | |
лв., лева |
VERB
16828 VERB tokens (100% of all VERB
tokens) have a non-empty value of Number
.
The most frequent other feature values with which VERB
and Number
co-occurred: Voice=Act (15495; 92%), Gender=EMPTY (15006; 89%), Definite=EMPTY (14164; 84%), VerbForm=Fin (14164; 84%), Mood=Ind (13916; 83%), Person=3 (11480; 68%), Tense=Pres (9528; 57%), Aspect=Imp (8679; 52%).
VERB
tokens may have the following values of Number
:
Plur
(5102; 30% of non-emptyNumber
): могат, имат, съобщиха, можем, имаме, работят, искат, правят, вземат, искамеSing
(11726; 70% of non-emptyNumber
): има, няма, може, трябва, каза, съобщи, заяви, стана, обяви, направи
Paradigm мога | Sing | Plur |
---|---|---|
Definite=Ind|Gender=Masc|Tense=Imp|VerbForm=Part | можел | |
Definite=Ind|Gender=Masc|Tense=Past|VerbForm=Part | могъл | |
Definite=Ind|Gender=Fem|Tense=Imp|VerbForm=Part | можела | |
Definite=Ind|Gender=Fem|Tense=Past|VerbForm=Part | могла | |
Definite=Ind|Gender=Neut|Tense=Imp|VerbForm=Part | можело | |
Definite=Ind|Gender=Neut|Tense=Past|VerbForm=Part | могло | |
Definite=Ind|Tense=Imp|VerbForm=Part | можели | |
Definite=Ind|Tense=Past|VerbForm=Part | могли | |
Mood=Ind|Person=1|Tense=Imp|VerbForm=Fin | можех | Можехме |
Mood=Ind|Person=1|Tense=Past|VerbForm=Fin | можах | |
Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | мога | можем |
Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin | можеш | можете |
Mood=Ind|Person=3|Tense=Imp|VerbForm=Fin | можеше | можеха |
Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | можа | можаха |
Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | може | могат |
ADJ
13504 ADJ tokens (99% of all ADJ
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADJ
and Number
co-occurred: Degree=Pos (12793; 95%), Aspect=EMPTY (12018; 89%), VerbForm=EMPTY (12018; 89%), Voice=EMPTY (12018; 89%), Definite=Ind (7442; 55%).
ADJ
tokens may have the following values of Number
:
Plur
(3947; 29% of non-emptyNumber
): други, другите, последните, нови, новите, първите, различни, българските, големи, въоръженитеSing
(9557; 71% of non-emptyNumber
): народното, българската, нова, европейската, 2001, друг, цялата, 2000, голяма, новияEMPTY
(87): т.нар., US, жп, държавен, народна, политически, важен, военноморско, електронна, избирателна
Paradigm нов | Sing | Plur |
---|---|---|
Case=Voc|Degree=Pos|Gender=Masc | Нови | |
Definite=Def|Degree=Pos | новите | |
Definite=Def|Degree=Pos|Gender=Masc | новия, новият | |
Definite=Def|Degree=Pos|Gender=Fem | новата | |
Definite=Def|Degree=Pos|Gender=Neut | новото | |
Definite=Def|Degree=Sup|Gender=Masc | най-новият | |
Definite=Def|Degree=Sup|Gender=Fem | най-новата | |
Definite=Def|Degree=Sup|Gender=Neut | Най-новото | |
Definite=Ind|Degree=Pos | нови | |
Definite=Ind|Degree=Pos|Gender=Masc | нов | |
Definite=Ind|Degree=Pos|Gender=Fem | нова | |
Definite=Ind|Degree=Pos|Gender=Neut | ново | |
Definite=Ind|Degree=Sup | най-нови |
PROPN
8363 PROPN tokens (99% of all PROPN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PROPN
and Number
co-occurred: Definite=Ind (8078; 97%), Gender=Masc (5216; 62%).
PROPN
tokens may have the following values of Number
:
Plur
(142; 2% of non-emptyNumber
): САЩ, Балканите, БДЖ, ОДС, DM, Балкани, Гласове, Полимери, РМД-та, АлпиPtan
(8; 0% of non-emptyNumber
): Кремиковци, ОАЕ, Брадвари, ДрагалевциSing
(8213; 98% of non-emptyNumber
): България, София, Иван, ЕС, Европа, СДС, Петър, Стоянов, Костов, ГеоргиEMPTY
(72): де, Р-300, ван, -, 2000, ал, ди, дела, дьо, 173
Paradigm сдс | Sing | Plur |
---|---|---|
Definite=Def|Gender=Masc | СДС | |
Definite=Def|Gender=Neut | СДС-та | |
Definite=Ind|Gender=Masc | СДС |
Number
seems to be lexical feature of PROPN
. 100% lemmas (2907) occur only with one value of Number
.
PRON
5368 PRON tokens (53% of all PRON
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PRON
and Number
co-occurred: Poss=EMPTY (5368; 100%), Reflex=EMPTY (5368; 100%), PronType=Prs (3398; 63%), Case=Nom (3311; 62%).
PRON
tokens may have the following values of Number
:
Plur
(1421; 26% of non-emptyNumber
): които, те, ги, тях, нас, ни, ние, им, всички, виSing
(3947; 74% of non-emptyNumber
): това, той, го, който, тя, която, му, което, него, азEMPTY
(4726): се, си, му, ни, й, им, ми, себе, ви, ти
Paradigm аз | Sing | Plur |
---|---|---|
Case=Acc|Gender=Masc|Person=3 | го, него | |
Case=Acc|Gender=Fem|Person=3 | я, нея | |
Case=Acc|Gender=Neut|Person=3 | го, него | |
Case=Acc|Person=1 | ме, мен, мене | нас, ни |
Case=Acc|Person=2 | те, тебе, ви, вас, теб | вас, ви |
Case=Acc|Person=3 | ги, тях | |
Case=Dat|Gender=Masc|Person=3 | му, нему | |
Case=Dat|Gender=Fem|Person=3 | й | |
Case=Dat|Gender=Neut|Person=3 | му | |
Case=Dat|Person=1 | ми, мен, мене | ни |
Case=Dat|Person=2 | ти, ви | ви |
Case=Dat|Person=3 | им, тям | |
Case=Nom|Gender=Masc|Person=3 | той | |
Case=Nom|Gender=Fem|Person=3 | тя | |
Case=Nom|Gender=Neut|Person=3 | то | |
Case=Nom|Person=1 | аз | ние, ний |
Case=Nom|Person=2 | ти, вие | вие |
Case=Nom|Person=3 | те | |
Gender=Fem|Person=3 | й | |
Person=1 | ми | ни |
Person=2 | ти, ви | ви |
Person=3 | им |
AUX
4739 AUX tokens (52% of all AUX
tokens) have a non-empty value of Number
.
The most frequent other feature values with which AUX
and Number
co-occurred: Mood=Ind (4600; 97%), Voice=Act (4600; 97%), VerbForm=Fin (4493; 95%), Aspect=Imp (4353; 92%), Person=3 (4059; 86%), Tense=Pres (3679; 78%).
AUX
tokens may have the following values of Number
:
Plur
(1309; 28% of non-emptyNumber
): са, бяха, бъдат, сме, били, сте, бъдем, биха, бихте, бяхтеSing
(3430; 72% of non-emptyNumber
): е, бе, бъде, беше, съм, би, бил, си, била, бихEMPTY
(4395): да, ще, е, са, бъдат, беше, би, бъде, съм
Paradigm съм | Sing | Plur |
---|---|---|
Definite=Ind|Gender=Masc|Mood=Ind|VerbForm=Part|Voice=Act | бил | |
Definite=Ind|Gender=Fem|Mood=Ind|VerbForm=Part|Voice=Act | била | |
Definite=Ind|Gender=Neut|Mood=Ind|VerbForm=Part|Voice=Act | било | |
Definite=Ind|Mood=Ind|VerbForm=Part|Voice=Act | били | |
Mood=Cnd|Person=1|VerbForm=Fin | бих | бихме |
Mood=Cnd|Person=2|VerbForm=Fin | Би | бихте |
Mood=Cnd|Person=3|Tense=Past|VerbForm=Fin | би | |
Mood=Cnd|Person=3|VerbForm=Fin | биха | |
Mood=Ind|Person=1|Tense=Past|VerbForm=Fin|Voice=Act | бях | бяхме |
Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin|Voice=Act | съм | сме |
Mood=Ind|Person=2|Tense=Past|VerbForm=Fin|Voice=Act | беше | |
Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin|Voice=Act | си | сте |
Mood=Ind|Person=2|VerbForm=Fin|Voice=Act | бяхте | |
Mood=Ind|Person=3|Tense=Past|VerbForm=Fin|Voice=Act | бе, беше | бяха |
Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act | е | са |
DET
2433 DET tokens (100% of all DET
tokens) have a non-empty value of Number
.
The most frequent other feature values with which DET
and Number
co-occurred: Person=EMPTY (2044; 84%), Poss=EMPTY (1921; 79%), Definite=EMPTY (1640; 67%), Case=EMPTY (1534; 63%).
DET
tokens may have the following values of Number
:
Plur
(713; 29% of non-emptyNumber
): тези, всички, нашите, някои, какви, своите, такива, техните, наши, тияSing
(1720; 71% of non-emptyNumber
): тази, този, това, един, какво, една, всеки, всяка, едно, своя
Paradigm този | Sing | Plur |
---|---|---|
Case=Nom|Gender=Fem | тази, тая, онази, тeзи | |
Case=Nom|Gender=Neut | това, онова, туй | |
Gender=Masc | този, тоя, оня, онзи | |
тези, тия, онези, ония |
NUM
2102 NUM tokens (100% of all NUM
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NUM
and Number
co-occurred: NumType=Card (2102; 100%), Definite=Ind (1957; 93%), Gender=EMPTY (1587; 75%).
NUM
tokens may have the following values of Number
:
Plur
(1863; 89% of non-emptyNumber
): две, два, 2, 3, три, 10, двамата, 20, двете, 000Sing
(239; 11% of non-emptyNumber
): един, една, 1, едно, половин, 0, Единият, едното, 0,1, 0.00EMPTY
(3): 02, 08, 2000
Number
seems to be lexical feature of NUM
. 100% lemmas (414) occur only with one value of Number
.
ADV
499 ADV tokens (8% of all ADV
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADV
and Number
co-occurred: PronType=EMPTY (499; 100%), Degree=Pos (463; 93%).
ADV
tokens may have the following values of Number
:
Plur
(499; 100% of non-emptyNumber
): много, повече, малко, повечето, по-малко, най-много, най-малко, малкото, Многая, Най-малкотоEMPTY
(6059): още, вчера, само, вече, когато, защото, обаче, сега, как, така
ADP
1 ADP tokens (0% of all ADP
tokens) have a non-empty value of Number
.
ADP
tokens may have the following values of Number
:
Sing
(1; 100% of non-emptyNumber
): сравнениеEMPTY
(22095): на, в, за, от, с, по, до, след, като, през
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number
:
NOUN –[amod]–> ADJ (11160; 97%),
NOUN –[nmod]–> NOUN (5930; 61%),
VERB –[nsubj]–> NOUN (4234; 93%),
VERB –[obj]–> NOUN (2782; 58%),
NOUN –[nmod]–> PROPN (2663; 82%),
VERB –[obl]–> NOUN (2632; 58%),
NOUN –[det]–> DET (1948; 98%),
VERB –[nsubj]–> PRON (1889; 98%),
NOUN –[conj]–> NOUN (1720; 78%),
PROPN –[flat]–> PROPN (1539; 96%).