Treebank Statistics: UD_Bulgarian-BTB: Features: Definite
This feature is universal.
It occurs with 2 different values: Def
, Ind
.
61280 tokens (39%) have a non-empty value of Definite
.
22259 types (84%) occur at least once with a non-empty value of Definite
.
12399 lemmas (83%) occur at least once with a non-empty value of Definite
.
The feature is used with 10 part-of-speech tags: NOUN (33023; 21% instances), ADJ (13480; 9% instances), PROPN (8357; 5% instances), VERB (2664; 2% instances), NUM (2102; 1% instances), DET (793; 1% instances), ADV (499; 0% instances), AUX (246; 0% instances), PRON (115; 0% instances), ADP (1; 0% instances).
NOUN
33023 NOUN tokens (97% of all NOUN
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which NOUN
and Definite
co-occurred: Number=Sing (23906; 72%).
NOUN
tokens may have the following values of Definite
:
Def
(12257; 37% of non-emptyDefinite
): хората, президентът, страната, края, държавата, правителството, закона, отбраната, шефът, началотоInd
(20766; 63% of non-emptyDefinite
): г., време, година, години, част, събрание, хора, път, страна, съветEMPTY
(1129): %, лв., млн., $, месеца, дни, лева, млрд., пъти, долара
Paradigm година | Ind | Def |
---|---|---|
Number=Sing | г., година | годината |
Number=Plur | г., години, г | годините |
ADJ
13480 ADJ tokens (99% of all ADJ
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which ADJ
and Definite
co-occurred: Degree=Pos (12769; 95%), Aspect=EMPTY (11994; 89%), VerbForm=EMPTY (11994; 89%), Voice=EMPTY (11994; 89%), Number=Sing (9533; 71%).
ADJ
tokens may have the following values of Definite
:
Def
(6038; 45% of non-emptyDefinite
): народното, българската, другите, европейската, последните, цялата, новия, миналата, националната, новатаInd
(7442; 55% of non-emptyDefinite
): други, нова, 2001, друг, нови, 2000, голяма, български, различни, 1EMPTY
(111): т.нар., US, Нови, уважаеми, жп, държавен, народна, политически, важен, военноморско
Paradigm нов | Ind | Def |
---|---|---|
Degree=Pos|Gender=Masc|Number=Sing | нов | новия, новият |
Degree=Pos|Gender=Fem|Number=Sing | нова | новата |
Degree=Pos|Gender=Neut|Number=Sing | ново | новото |
Degree=Pos|Number=Plur | нови | новите |
Degree=Sup|Gender=Masc|Number=Sing | най-новият | |
Degree=Sup|Gender=Fem|Number=Sing | най-новата | |
Degree=Sup|Gender=Neut|Number=Sing | Най-новото | |
Degree=Sup|Number=Plur | най-нови |
PROPN
8357 PROPN tokens (99% of all PROPN
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which PROPN
and Definite
co-occurred: Number=Sing (8207; 98%), Gender=Masc (5211; 62%).
PROPN
tokens may have the following values of Definite
:
Def
(279; 3% of non-emptyDefinite
): Балканите, ЕС, СДС, Конституцията, НИС, Дяволът, Евросъюза, ЦСКА, Евролевицата, САЩInd
(8078; 97% of non-emptyDefinite
): България, София, Иван, Европа, Петър, ЕС, Стоянов, СДС, Костов, ГеоргиEMPTY
(78): де, Р-300, ван, -, 2000, ал, ди, дела, дьо, 173
Paradigm европа | Ind | Def |
---|---|---|
Европа | Европата |
Definite
seems to be lexical feature of PROPN
. 98% lemmas (2863) occur only with one value of Definite
.
VERB
2664 VERB tokens (16% of all VERB
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which VERB
and Definite
co-occurred: Mood=EMPTY (2664; 100%), Person=EMPTY (2664; 100%), VerbForm=Part (2664; 100%), Aspect=Perf (2037; 76%), Number=Sing (1822; 68%), Voice=Act (1579; 59%), Tense=Past (1350; 51%).
VERB
tokens may have the following values of Definite
:
Def
(5; 0% of non-emptyDefinite
): останалите, Останалото, Уволнените, отличенитеInd
(2659; 100% of non-emptyDefinite
): имало, направил, дал, трябвало, заминал, направено, искал, казал, направили, дошълEMPTY
(14164): има, няма, може, трябва, каза, могат, съобщи, заяви, стана, имат
Paradigm остана | Ind | Def |
---|---|---|
Degree=Pos|Number=Plur | останалите | |
Gender=Masc|Number=Sing | останал | |
Gender=Fem|Number=Sing | останала | |
Gender=Neut|Number=Sing | Останалото | |
Number=Plur | останали | ОСТАНАЛИТЕ |
Definite
seems to be lexical feature of VERB
. 100% lemmas (994) occur only with one value of Definite
.
NUM
2102 NUM tokens (100% of all NUM
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which NUM
and Definite
co-occurred: NumType=Card (2102; 100%), Number=Plur (1863; 89%), Gender=EMPTY (1587; 75%).
NUM
tokens may have the following values of Definite
:
Def
(145; 7% of non-emptyDefinite
): двамата, двете, двата, тримата, трите, четиримата, Единият, четирите, десетте, еднотоInd
(1957; 93% of non-emptyDefinite
): две, един, два, 2, една, 1, 3, три, 10, едноEMPTY
(3): 02, 08, 2000
Paradigm два | Ind | Def |
---|---|---|
Gender=Masc | два, 2 | двата |
Gender=Fem | две, 2 | двете |
Gender=Neut | две | двете |
2, две | двете |
Definite
seems to be lexical feature of NUM
. 97% lemmas (403) occur only with one value of Definite
.
DET
793 DET tokens (33% of all DET
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which DET
and Definite
co-occurred: Case=EMPTY (615; 78%), Number=Sing (599; 76%), Poss=Yes (512; 65%), PronType=Prs (512; 65%), Person=EMPTY (404; 51%).
DET
tokens may have the following values of Definite
:
Def
(394; 50% of non-emptyDefinite
): нашите, своите, нашата, своя, техните, неговата, своята, своето, тяхното, нейнитеInd
(399; 50% of non-emptyDefinite
): един, една, едно, наши, негово, своя, нищо, нещо, свой, своиEMPTY
(1640): тази, този, тези, това, всички, какво, всеки, всяка, някои, какви
Paradigm един | Ind | Def |
---|---|---|
Case=Nom|Gender=Masc|Number=Sing | един | единият |
Case=Nom|Gender=Fem|Number=Sing | едната | |
Case=Nom|Gender=Neut|Number=Sing | едно | |
Case=Nom|Number=Plur | едните | |
Gender=Masc|Number=Sing | единия | |
Gender=Fem|Number=Sing | една, един | |
Gender=Neut|Number=Sing | едното | |
Number=Plur | едни |
ADV
499 ADV tokens (8% of all ADV
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which ADV
and Definite
co-occurred: PronType=EMPTY (499; 100%), Degree=Pos (463; 93%).
ADV
tokens may have the following values of Definite
:
Def
(47; 9% of non-emptyDefinite
): повечето, малкото, Най-малкото, многото, по-малкотоInd
(452; 91% of non-emptyDefinite
): много, повече, малко, по-малко, най-много, най-малко, Многая, немалко, ПовечкоEMPTY
(6059): още, вчера, само, вече, когато, защото, обаче, сега, как, така
Paradigm много | Ind | Def |
---|---|---|
Degree=Pos | много, Многая | многото |
Degree=Sup | най-много |
AUX
246 AUX tokens (3% of all AUX
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which AUX
and Definite
co-occurred: Aspect=Imp (246; 100%), Mood=Ind (246; 100%), Person=EMPTY (246; 100%), Tense=EMPTY (246; 100%), VerbForm=Part (246; 100%), Voice=Act (246; 100%), Number=Sing (179; 73%).
AUX
tokens may have the following values of Definite
:
Ind
(246; 100% of non-emptyDefinite
): бил, били, била, билоEMPTY
(8888): да, е, ще, са, бе, бъде, беше, бяха, съм, бъдат
PRON
115 PRON tokens (1% of all PRON
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which PRON
and Definite
co-occurred: Person=EMPTY (115; 100%), Poss=EMPTY (115; 100%), Reflex=EMPTY (115; 100%), Case=Nom (106; 92%), Number=Sing (87; 76%), Gender=Neut (83; 72%), PronType=Ind (71; 62%).
PRON
tokens may have the following values of Definite
:
Def
(26; 23% of non-emptyDefinite
): нещата, всичкото, Единият, Едната, нищотоInd
(89; 77% of non-emptyDefinite
): нищо, нещо, неколцина, няколко, един, едноEMPTY
(9979): се, си, това, той, му, които, го, ни, те, който
Paradigm никой | Ind | Def |
---|---|---|
нищо | нищото |
ADP
1 ADP tokens (0% of all ADP
tokens) have a non-empty value of Definite
.
ADP
tokens may have the following values of Definite
:
Ind
(1; 100% of non-emptyDefinite
): сравнениеEMPTY
(22095): на, в, за, от, с, по, до, след, като, през
Relations with Agreement in Definite
The 10 most frequent relations where parent and child node agree in Definite
:
NOUN –[amod]–> ADJ (5775; 50%),
NOUN –[nmod]–> NOUN (4928; 51%),
NOUN –[conj]–> NOUN (1822; 83%),
NOUN –[nmod]–> PROPN (1744; 54%),
PROPN –[flat]–> PROPN (1538; 96%),
PROPN –[conj]–> PROPN (576; 99%),
ADJ –[obl]–> NOUN (377; 54%),
PROPN –[nmod]–> PROPN (333; 97%),
ADJ –[conj]–> ADJ (330; 91%),
PROPN –[nmod]–> NOUN (301; 89%).