Treebank Statistics: UD_Bhojpuri-BHTB: Features: Number
This feature is universal.
It occurs with 2 different values: Plur
, Sing
.
4276 tokens (64%) have a non-empty value of Number
.
1395 types (83%) occur at least once with a non-empty value of Number
.
1341 lemmas (82%) occur at least once with a non-empty value of Number
.
The feature is used with 14 part-of-speech tags: NOUN (1628; 24% instances), ADP (579; 9% instances), VERB (546; 8% instances), PROPN (395; 6% instances), AUX (277; 4% instances), PRON (276; 4% instances), DET (222; 3% instances), ADJ (111; 2% instances), PART (99; 1% instances), NUM (93; 1% instances), CCONJ (41; 1% instances), ADV (4; 0% instances), INTJ (4; 0% instances), SCONJ (1; 0% instances).
NOUN
1628 NOUN tokens (88% of all NOUN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NOUN
and Number
co-occurred: Person=3 (1568; 96%), Gender=Masc (1227; 75%), Case=Nom (944; 58%).
NOUN
tokens may have the following values of Number
:
Plur
(115; 7% of non-emptyNumber
): लोग, गायक, जानकारी, दिसाईं, पहिले, आदमी, आनंद, आर्थिक, ओने, कबीरSing
(1513; 93% of non-emptyNumber
): जी, रंग, देश, बिआह, भाषा, आजु, साल, बात, लोगन, साहित्यEMPTY
(226): जब, बिआह, तब, अब, पहिले, उहाँ, कथा, गवनई, चीफ, जहाँ
Paradigm बिआह | Sing | Plur |
---|---|---|
Case=Acc|Gender=Masc | बिआह | |
Case=Acc|Gender=Fem | बिआह | |
Case=Nom|Gender=Masc | बिआह | बिआह |
Case=Nom|Gender=Fem | बिआह | |
Mood=Sub|VerbForm=Fin|Voice=Act | बिआहे |
Number
seems to be lexical feature of NOUN
. 97% lemmas (757) occur only with one value of Number
.
ADP
579 ADP tokens (59% of all ADP
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADP
and Number
co-occurred: Gender=Masc (566; 98%), AdpType=Post (556; 96%), Case=Acc (344; 59%).
ADP
tokens may have the following values of Number
:
Plur
(97; 17% of non-emptyNumber
): के, हमनीके, उठाके, लेके, करेके, पढ़िके, लगावेके, लगे, ले, लोगSing
(482; 83% of non-emptyNumber
): के, का, ले, वाला, खातिर, ओके, जाके, साथे, ओकराके, खड़ेEMPTY
(410): में, से, पर, के, खातिर, तबे, अतने, का, अपने, उहाँका
Paradigm का | Sing | Plur |
---|---|---|
Case=Acc|Gender=Masc | के | के |
Case=Nom | के | |
Case=Nom|Gender=Masc | का, के | के |
Case=Nom|Gender=Masc|Person=3|Polite=Form | के | |
Case=Nom|Gender=Fem | के | |
Gender=Masc|Person=3 | के |
VERB
546 VERB tokens (71% of all VERB
tokens) have a non-empty value of Number
.
The most frequent other feature values with which VERB
and Number
co-occurred: Person=3 (412; 75%), Aspect=EMPTY (368; 67%), Gender=Masc (367; 67%), Voice=EMPTY (340; 62%), VerbForm=EMPTY (332; 61%).
VERB
tokens may have the following values of Number
:
Plur
(106; 19% of non-emptyNumber
): चाहीं, होखे, होई, आवे, जाए, जाता, देबे, बा, मए, लागेSing
(440; 81% of non-emptyNumber
): बा, भइल, आइल, करे, क, कहल, ह, होखे, कइल, कतहींEMPTY
(221): हो, कर, ना, लागल, ह, करत, चलि, धीरे, पड़ल, भइला
Paradigm हो | Sing | Plur |
---|---|---|
Aspect=Perf|Gender=Masc|VerbForm=Part|Voice=Act | होखे | |
Aspect=Perf|Gender=Fem|VerbForm=Part | होखी | होई |
Aspect=Perf|Gender=Fem|VerbForm=Part|Voice=Act | होखी | |
Case=Acc|Gender=Masc|Person=3 | होखे | |
Case=Acc|Gender=Fem | होखी | |
Gender=Masc|Mood=Ind|Person=3|Tense=Fut|VerbForm=Fin|Voice=Act | होई | |
Gender=Masc|Person=3|VerbForm=Inf|Voice=Act | होखे | |
Gender=Masc|Person=3|Voice=Act | हो | हो |
Gender=Masc|Voice=Act | हो | |
Person=3|Voice=Act | हो |
Number
seems to be lexical feature of VERB
. 95% lemmas (215) occur only with one value of Number
.
PROPN
395 PROPN tokens (94% of all PROPN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PROPN
and Number
co-occurred: Person=3 (392; 99%), Case=Nom (271; 69%), Gender=Masc (225; 57%).
PROPN
tokens may have the following values of Number
:
Plur
(3; 1% of non-emptyNumber
): हिन्दुस्तान, 25Sing
(392; 99% of non-emptyNumber
): भोजपुरी, सिंह, प्रियंका, राय, जी, डॉ., पाण्डेय, तिवारी, दिल्ली, द्विवेदीEMPTY
(26): पाती, डा॰, अंचल, टिप्पणिए, प्रो॰, 2012, 24, इटलिए, चोपड़ा, पूर्वांचल
Paradigm हिन्दुस्तान | Sing | Plur |
---|---|---|
Case=Acc | हिन्दुस्तान | |
Case=Nom | हिन्दुस्तान |
Number
seems to be lexical feature of PROPN
. 99% lemmas (181) occur only with one value of Number
.
AUX
277 AUX tokens (78% of all AUX
tokens) have a non-empty value of Number
.
The most frequent other feature values with which AUX
and Number
co-occurred: Voice=EMPTY (235; 85%), Polite=EMPTY (234; 84%), Person=3 (218; 79%), Aspect=EMPTY (195; 70%), VerbForm=EMPTY (168; 61%), Case=Nom (156; 56%).
AUX
tokens may have the following values of Number
:
Plur
(15; 5% of non-emptyNumber
): बाड़न, रहलीं, गइल, चाहीं, जाए, जात, दीहें, देले, बाड़े, मारींSing
(262; 95% of non-emptyNumber
): बा, रहे, गइल, रहल, जाई, जाव, जा, रहीं, बानी, सकेलाEMPTY
(78): जा, गइल, बा, हो, बाड़न, दिहलसि, लागल, आइल, कइले, करी
Paradigm बा | Sing | Plur |
---|---|---|
Aspect=Perf|Gender=Masc|VerbForm=Part | बाड़े | |
Case=Nom|Gender=Masc|Person=3 | बा, बाड़न, बाड़ | |
Case=Nom|Gender=Fem|Person=3 | बा, बाड़ी | |
Case=Nom|Person=3 | बा, बानी | |
Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | बा | बाड़न |
Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act | बा, बाटे | |
Mood=Sub|Person=3|VerbForm=Fin|Voice=Act | बाड़ें | |
Person=3|Polite=Form | बाड़ |
PRON
276 PRON tokens (82% of all PRON
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PRON
and Number
co-occurred: Aspect=EMPTY (230; 83%), VerbForm=EMPTY (228; 83%), Case=Nom (184; 67%), PronType=EMPTY (178; 64%), Person=3 (142; 51%).
PRON
tokens may have the following values of Number
:
Plur
(77; 28% of non-emptyNumber
): ओकरा, हम, आम, एकरा, एकदम, कलम, तमाम, दरपन, बनाम, हमनीSing
(199; 72% of non-emptyNumber
): अपना, हमरा, ऊ, आपन, हमनी, रउरा, हमार, बिना, उनुकर, उनुकाEMPTY
(59): ऊ, केहूँ, काहे, कहीं, एकर, कहाँ, ईहे, उहाँ, एकहूँ, कबो
Paradigm हम | Sing | Plur |
---|---|---|
Case=Acc,Erg | हम | |
Case=Nom | हम | हम, हमनी |
DET
222 DET tokens (63% of all DET
tokens) have a non-empty value of Number
.
The most frequent other feature values with which DET
and Number
co-occurred: NumType=EMPTY (222; 100%), Person=3 (207; 93%), PronType=EMPTY (191; 86%), Case=Nom (183; 82%), Gender=Masc (130; 59%).
DET
tokens may have the following values of Number
:
Plur
(21; 9% of non-emptyNumber
): कुछु, आजु, सब, सबले, असहिष्णु, ओकरा, ओहु, जासु, जेकरा, पढ़सुSing
(201; 91% of non-emptyNumber
): ई, कवनो, एह, अइसन, जवन, जवना, एही, ओह, सभे, कतनाEMPTY
(131): एह, कुछ, ओकर, ओह, हर, अब, आजु, ई, फेरु, ईहो
Paradigm आजु | Sing | Plur |
---|---|---|
आजुओ | आजु |
Number
seems to be lexical feature of DET
. 94% lemmas (49) occur only with one value of Number
.
ADJ
111 ADJ tokens (45% of all ADJ
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADJ
and Number
co-occurred: Person=3 (98; 88%), Case=Nom (96; 86%), Gender=Masc (95; 86%).
ADJ
tokens may have the following values of Number
:
Plur
(3; 3% of non-emptyNumber
): छोट, चोटSing
(108; 97% of non-emptyNumber
): पूरा, बड़, छोट, तरह, नवका, बड़हन, भोजपुरिया, अढ़ाई, अश्लील, आधाEMPTY
(138): सांस्कृतिक, तथाकथित, प, खास, चुपचाप, जरूरी, आखिरी, आसान, काव्य, सहज
Paradigm छोट | Sing | Plur |
---|---|---|
Case=Acc | छोट | |
Case=Nom | छोट | छोट |
Number
seems to be lexical feature of ADJ
. 99% lemmas (69) occur only with one value of Number
.
PART
99 PART tokens (52% of all PART
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PART
and Number
co-occurred: Person=3 (94; 95%), Case=Nom (89; 90%), Gender=Masc (79; 80%).
PART
tokens may have the following values of Number
:
Plur
(17; 17% of non-emptyNumber
): नइखे, बहुते, त, तिकवते, नाहींSing
(82; 83% of non-emptyNumber
): त, ना, बस, गमगमावे, घटना, नइखे, अतना, अलावे, केहू, खालीEMPTY
(93): ना, त, नइखे, भर, ढेर, तनिको, बनवले, बिना, भी, सँ
Paradigm त | Sing | Plur |
---|---|---|
Gender=Masc | त | त |
Gender=Fem | त |
NUM
93 NUM tokens (62% of all NUM
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NUM
and Number
co-occurred: NumType=EMPTY (93; 100%), Person=3 (88; 95%), Case=Nom (87; 94%), Gender=Masc (75; 81%).
NUM
tokens may have the following values of Number
:
Plur
(21; 23% of non-emptyNumber
): लोग, कलिग, उमंग, सन, सभSing
(72; 77% of non-emptyNumber
): एगो, गो, दू, 5, छठवां, दोसर, दोसरा, दोसरो, सिलसिला, २०१२EMPTY
(56): एक, कुछ, अनकस, बाकि, 12, 120, 2011, 75, आठ, एगो
Paradigm सभ | Sing | Plur |
---|---|---|
Gender=Masc | सभ | |
PronType=Prs | सभ |
Number
seems to be lexical feature of NUM
. 96% lemmas (27) occur only with one value of Number
.
CCONJ
41 CCONJ tokens (27% of all CCONJ
tokens) have a non-empty value of Number
.
The most frequent other feature values with which CCONJ
and Number
co-occurred: Case=Nom (30; 73%), Gender=Masc (30; 73%), Person=3 (30; 73%).
CCONJ
tokens may have the following values of Number
:
Plur
(4; 10% of non-emptyNumber
): आ, फगुआ, रउँआSing
(37; 90% of non-emptyNumber
): बाकिर, अउर, फगुआ, भा, राउर, आखिर, खम्भा, आ, आउरEMPTY
(110): आ, बाकिर, अउर, आउर, खैर, बलुक, सचहूं
Paradigm आ | Sing | Plur |
---|---|---|
Gender=Fem | आ | |
आ | आ |
ADV
4 ADV tokens (13% of all ADV
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADV
and Number
co-occurred: Gender=Masc (3; 75%).
ADV
tokens may have the following values of Number
:
Plur
(1; 25% of non-emptyNumber
): नाहिंएSing
(3; 75% of non-emptyNumber
): आजुओ, जल्दी, शुरूEMPTY
(27): जइसे, हिन्दी, गद्य, ललित, सभ्य, आनन्द, आसानी, जरूर, जल्दी, जसहीं
INTJ
4 INTJ tokens (80% of all INTJ
tokens) have a non-empty value of Number
.
The most frequent other feature values with which INTJ
and Number
co-occurred: Case=Acc (4; 100%), Gender=Masc (4; 100%).
INTJ
tokens may have the following values of Number
:
Sing
(4; 100% of non-emptyNumber
): गहरे, अरे, दोसरेEMPTY
(1): अजी
SCONJ
1 SCONJ tokens (1% of all SCONJ
tokens) have a non-empty value of Number
.
SCONJ
tokens may have the following values of Number
:
Plur
(1; 100% of non-emptyNumber
): तकलेEMPTY
(117): कि, त, काहेंकि, निकलि, बाकि, लपकि, आँखि, कोच्चि, प्रवृत्ति
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number
:
NOUN –[nmod]–> NOUN (216; 73%),
NOUN –[compound]–> NOUN (188; 77%),
VERB –[compound]–> NOUN (182; 54%),
PROPN –[compound]–> PROPN (137; 93%),
VERB –[aux]–> AUX (117; 51%),
VERB –[nmod]–> NOUN (96; 54%),
NOUN –[compound]–> DET (74; 76%),
PROPN –[case]–> ADP (47; 55%),
NOUN –[nmod]–> PROPN (46; 73%),
NOUN –[compound]–> PROPN (41; 89%).