UD Sanskrit Vedic
Language: Sanskrit (code: sa
)
Family: IE
This treebank has been part of Universal Dependencies since the UD v2.6 release.
The following people have contributed to making this treebank part of UD: Salvatore Scarlata, Elia Ackermann, Oliver Hellwig, Erica Biagetti, Paul Widmer, Sven Sellmer.
Repository: UD_Sanskrit-Vedic
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.15
License: CC BY-SA 4.0
Genre: nonfiction
Questions, comments? General annotation questions (either Sanskrit-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [hellwig7 (æt) gmx • de]. Development of the treebank happens outside the UD repository. If there are bugs, either the original data source or the conversion procedure must be fixed. Do not submit pull requests against the UD repository.
Annotation | Source |
---|---|
Lemmas | annotated manually in non-UD style, automatically converted to UD |
UPOS | assigned by a program, with some manual corrections, but not a full manual verification |
XPOS | annotated manually in non-UD style, automatically converted to UD |
Features | annotated manually in non-UD style, automatically converted to UD |
Relations | annotated manually, natively in UD style |
Description
The Treebank of Vedic Sanskrit contains 4,000 sentences with 27,000 words chosen from metrical and prose passages of the Ṛgveda (RV), the Śaunaka recension of the Atharvaveda (ŚS), the Maitrāyaṇīsaṃhitā (MS), and the Aitareya- (AB) and Śatapatha-Brāhmaṇas (ŚB).
Lexical and morpho-syntactic information has been generated using a tagging software and manually validated. POS tags have been induced automatically from the morpho-sytactic information of each word.
Vedic Sanskrit is an ancient Indo-Aryan language, one of the oldest transmitted Indo-European languages and the precursor of Classical Sanskrit. The relatively large corpus of Vedic poetry and prose is critical for the reconstruction of the early linguistic history of Indo-European and important as a source for socio-cultural developments in South Asia during the second and first millenia BCE.
The composition of the Vedic treebank is motivated by the need for a resource that can be used for data-driven, quantitatively robust diachronic and synchronic investigations of linguistic phenomena in, and starting with, the oldest layers of Vedic Sanskrit.
References
Annotation and composition of this treebank are described in detail in the following paper:
@inproceedings{hellwig-vtb-lrec-2020,
author = {Hellwig, Oliver and Scarlata, Salvatore and Ackermann, Elia and Widmer, Paul},
title = {The Treebank of {Vedic Sanskrit}},
booktitle = {Proceedings of the LREC},
year = {2020}
}
An updated overview of the annotation procedure and coverage can be found here:
@article{vedic-parser,
title = {Data-driven Dependency Parsing of {V}edic {S}anskrit},
author = {Hellwig, Oliver and Nehrdich, Sebastian and Sellmer, Sven},
journal = {Language Resources \& Evaluation},
volume = {57},
pages = {1173--1206},
year = {2023}
}
Train-test split
Following the UD recommendations, documents are kept together when generating the train, test and dev splits. For the Vedic data, the term “document” means a hymn (metrical texts) or a chapter (prose texts). Some of these documents are not complete, meaning that not the whole chapter or hymn was annotated. This happens quite often with the Rigveda.
Acknowledgments
The annotation has been performed by Salvatore Scarlata, Oliver Hellwig, Elia Ackermann, Erica Biagetti, and Sven Sellmer.
Statistics of UD Sanskrit Vedic
POS Tags
ADJ – ADP – ADV – AUX – CCONJ – DET – INTJ – NOUN – NUM – PART – PRON – SCONJ – VERB
Features
Case – Compound – Gender – Mood – Number – Person – Tense – VerbForm – Voice
Relations
acl – acl:attr – acl:cont – acl:crel – acl:dpct – acl:pred – acl:ptcp – acl:relcl – advcl – advcl:caus – advcl:ccomp – advcl:concess – advcl:cond – advcl:consec – advcl:dpct – advcl:fin – advcl:lcl – advcl:manner – advcl:tcl – advmod – amod – appos – aux – case – case:sim – cc – ccomp – ccomp:rel – compound – compound:coord – compound:name – conj – cop – csubj – det – discourse – dislocated – fixed – flat – iobj – mark – mark:sim – nmod – nmod:appos – nmod:pred – nsubj – nummod – obj – obl – obl:agent – obl:benef – obl:goal – obl:grad – obl:instr – obl:lmod – obl:manner – obl:path – obl:soc – obl:source – obl:tmod – orphan – parataxis – root – vocative – xcomp – xcomp:result
Tokenization and Word Segmentation
- This corpus contains 27182 sentences and 206440 tokens.
- All tokens in this corpus are followed by a space.
- This corpus does not contain words with spaces.
- This corpus does not contain words that contain both letters and punctuation.
Morphology
Tags
- This corpus uses 13 UPOS tags out of 17 possible: ADJ, ADP, ADV, AUX, CCONJ, DET, INTJ, NOUN, NUM, PART, PRON, SCONJ, VERB
- This corpus does not use the following tags: PROPN, SYM, PUNCT, X
- This corpus contains 46 word types tagged as particles (PART): a, aha, an, api, aṅga, cana, cit, eva, evā, gha, ghā, ha, hi, id, iti, iva, ivā, kam, khalu, kila, kilā, kimu, kuvid, mā, na, nahi, nakir, nanu, no, nu, nvai, nvāva, nā, nū, samaha, sma, smā, svid, tu, u, vai, vāva, yatha, yathā, ū, ṇa
- This corpus contains 53 lemmas tagged as pronouns (PRON): adas, ama, aneka, anya, anyatara, anyonya, apara, atas, bahu, cana, eka, ekaika, ekaścana, enad, etad, eva, idam, itara, ka, katama, katara, kati, kaya, kaścana, kaścit, kiṃca, kutas, loga, mad, nakir, nima, para, pramara, pūrva, sa, sama, sarva, sima, tad, taka, tatas, tena, tva, tvad, tya, ubh, upa, viśva, yad, yaka, yat, yatama, yatara
- This corpus contains 27 lemmas tagged as determiners (DET): anyaka, anyatama, bahu, bhūri, bhūyiṣṭha, etāvat, ititha, iyat, jīva, kṛtsna, nikhila, nitya, pratyakṣa, puru, sakala, samāna, sva, svaka, tatitha, tāvaka, tāvat, ubhaya, viśva, yatitha, yādṛś, yāvat, āmuṣyāyaṇa
- Out of the above, 2 lemmas occurred sometimes as PRON and sometimes as DET: bahu, viśva
- This corpus contains 7 lemmas tagged as auxiliaries (AUX): as, bhū, car, e, i, sthā, ās
- Out of the above, 7 lemmas occurred sometimes as AUX and sometimes as VERB: as, bhū, car, e, i, sthā, ās
- There are 4 (de)verbal forms:
- Conv
- VERB: kṛtvā, hutvā, ādāya, bhūtvā, nidhāya, gṛhītvā, iṣṭvā, gatvā, ādhāya, upasamādhāya
- Gdv
- VERB: ādheyaḥ, deyam, hotavyam, ādṛtyam, anūcyaḥ, etavyam, upajīvanīyaḥ, kāryam, īḍyam, havyaḥ
- Inf
- VERB: avase, jīvase, dṛśe, pītaye, grahītum, vītaye, jīvitum, bhuje, dāvane, janayitavai
- Part
- AUX: san, santam, sat, sataḥ, satī, satām, satīm, bhavantaḥ, sadbhyaḥ, santaḥ
- VERB: vidvān, yajamānaḥ, _, yajamānam, yajamānasya, jātaḥ, uktam, samṛddham, yajamānāya, hutam
Nominal Features
- Fem
- ADJ: uttarām, uttarayā, prācīm, uttamām, prathamām, udīcīm, dhruvā, mahī, prathamā, revatīḥ
- AUX-Part: satī, satīm, satīḥ
- DET: viśvāḥ, svayā, pūrvīḥ, svā, svām, svāyai, svāyām, svāḥ, bahūḥ, tāvatīm
- NOUN: āpaḥ, apaḥ, vāc, vācam, prajāḥ, prāyaścittiḥ, devatāḥ, vācā, apām, pṛthivī
- NUM: tisraḥ, dvābhyām, catasraḥ, dve, tisṛbhiḥ, pañca, ekā, ekām, catasṛbhiḥ, daśa
- PRON: sā, tām, tāḥ, eṣā, yā, iyam, yāḥ, etāḥ, etām, kā
- VERB-Gdv: kāryā, dīkṣaṇīyāyāḥ, paridhānīyā, hotavye, dīkṣaṇīyāyām, havyā, kartavyāḥ, anugeyāḥ, anvārambhaṇīyām, anvārambhaṇīyāyām
- VERB-Part: kṛtā, upahūtā, anvārabdhāyām, hutāyām, uśatīḥ, anvāyattā, anūktāyām, hutāḥ, jātāḥ, pratiṣṭhitā
- Masc
- ADJ: prathamaḥ, dakṣiṇam, viśve, mayaḥ, uttaram, vid, pratyaṅ, ūrdhvaḥ, dakṣiṇena, priyaḥ
- ADV: sa, saḥ, tad, pūrvam, tasmāt, tam
- AUX-Part: san, santam, sataḥ, satām, bhavantaḥ, sadbhyaḥ, santaḥ, tiṣṭhan
- DET: viśve, sve, ubhayoḥ, bahavaḥ, svena, svāt, viśvāḥ, tāvantam, tāvataḥ, viśvebhiḥ
- NOUN: agniḥ, devāḥ, agnim, agne, indraḥ, indra, prajāpatiḥ, lokam, yajñam, agneḥ
- NUM: eke, ekaḥ, trayaḥ, dvau, pañca, sapta, ṣaṭ, trīn, ekam, catvāraḥ
- PRON: yaḥ, saḥ, sa, tam, asya, enam, te, eṣa, ayam, tasya
- VERB-Gdv: ādheyaḥ, anūcyaḥ, upajīvanīyaḥ, īḍyam, havyaḥ, īḍyaḥ, cityaḥ, kāryaḥ, labhyaḥ, anumādyaḥ
- VERB-Part: vidvān, yajamānaḥ, yajamānam, yajamānasya, jātaḥ, yajamānāya, udite, yuktaḥ, jātam, pratiṣṭhitaḥ
- Neut
- ADJ: priyam, bṛhat, uru, bhūyaḥ, paramam, mahat, samānam, mahi, tṛtīyam, parame
- ADV: tat, etat, tad, idam, etad, adaḥ, kad, kim, aparam, pūrvam
- AUX-Part: sat, sataḥ, sati, satā
- DET: svena, viśvā, viśvāni, viśvam, svam, sve, viśvasmāt, bahu, purū, viśvasya
- NOUN: brahma, annam, manaḥ, namaḥ, manasā, rūpam, agnihotram, ahar, jyotiḥ, cakṣuḥ
- NUM: śatam, sapta, ekam, dvādaśa, trīṇi, sahasram, daśa, dve, nava, pañca
- PRON: tat, yat, etat, idam, sarvam, tena, kim, tāni, enat, etāni
- VERB-Gdv: deyam, hotavyam, ādṛtyam, etavyam, kāryam, bhogyam, pramaditavyam, anūcyam, kartavyam, bhavyam
- VERB-Part: samṛddham, uktam, hutam, kṛtam, bhūtam, sat, iṣṭam, pratiṣṭhitam, adhyūḍham, gṛhītam
- Dual
- ADJ: uttarābhyām, ubhayoḥ, goptārau, mahī, uttare, vārtraghnau, īśvarau, mayyau, mithunau, samanasau
- AUX: bhavataḥ, syātām, sthaḥ, āstām, asāva, svaḥ, abhūtām, babhūvatuḥ, bhavathaḥ, bhūtas
- AUX-Part: satī
- DET: ubhayoḥ, ubhaye, purū, sve
- NOUN: aśvinā, dyāvāpṛthivī, agnī, rodasī, apānau, bāhubhyām, hastābhyām, pavitre, ājyabhāgau, agnibhyām
- NUM: dve, dvābhyām, dvau, dvayoḥ, dvā, śate, dvādaśau, trayastriṃśatau
- PRON: vām, tau, yuvam, ete, ubhe, ubhau, te, tayoḥ, tābhyām, etau
- VERB: bhavataḥ, sthaḥ, yātam, gopāyetām, pibatam, gopāyataḥ, pātam, aśnīyātām, bhavatam, gatam
- VERB-Gdv: hotavye, caryau, havyā, kartavyau, kartavye, kāryau, pratiṣṭhāpye, saṃsṛjye, tyājyau, upasaṃgrāhyau
- VERB-Part: aktau, akte, alaṃkṛtau, pariṣkṛtau, pūte, saṃpṛñcānau, saṃvidānau, jātau, praṇīyamānābhyām, samṛddhe
- Plur
- ADJ: viśve, amṛtāḥ, sudānavaḥ, vidaḥ, viśvā, śucayaḥ, kṛtaḥ, revatīḥ, priyāḥ, viśvadevyāvatīḥ
- AUX: bhavanti, syuḥ, santu, syāma, stha, āsan, santi, bhavantu, yanti, āsuḥ
- AUX-Part: satām, bhavantaḥ, sadbhyaḥ, santaḥ, satīḥ
- DET: viśvāḥ, viśvā, viśve, viśvāni, bahavaḥ, pūrvīḥ, viśveṣu, purū, viśvebhiḥ, svāḥ
- NOUN: devāḥ, āpaḥ, apaḥ, devānām, paśavaḥ, devān, prajāḥ, devebhyaḥ, paśūn, devatāḥ
- NUM: eke, pañca, tisraḥ, trayaḥ, trīṇi, sapta, catasraḥ, ṣaṭ, tisṛbhiḥ, aṣṭau
- PRON: naḥ, te, ye, vaḥ, tāḥ, tān, vayam, yāḥ, ete, eṣām
- VERB: āhuḥ, bhavanti, abruvan, yanti, ācakṣate, abhavan, juhvati, āyan, viduḥ, stha
- VERB-Gdv: kāryāḥ, anūcyāni, vaktavyāḥ, kartavyāḥ, pacanīyaiḥ, abhijayyāḥ, abhivādyāḥ, abhiyaṣṭavyāḥ, anugeyāḥ, anumeyāḥ
- VERB-Part: vadantaḥ, vidvāṃsaḥ, devayantaḥ, jātāḥ, yuktāḥ, uśatīḥ, vājayantaḥ, śuddhāḥ, hutāḥ, sutāsaḥ
- Sing
- ADJ: dakṣiṇam, priyam, prathamaḥ, uttaram, mayaḥ, bṛhat, uttarām, uttamam, samānam, paramam
- ADV: tat, etat, tad, idam, etad, sa, adaḥ, pūrvam, saḥ, kad
- AUX: bhavati, asi, syāt, āsīt, astu, asmi, san, asti, santam, asat
- AUX-Part: san, santam, sat, sataḥ, satī, satīm, sati, satā, tiṣṭhan
- DET: svena, svayā, sve, viśvam, svam, viśvasmāt, bahu, svā, viśvasya, svām
- NOUN: agniḥ, agnim, agne, brahma, indraḥ, indra, prajāpatiḥ, lokam, annam, yajñam
- NUM: śatam, ekaḥ, ekam, sapta, dvādaśa, sahasram, daśa, nava, ekā, ekām
- PRON: yaḥ, tat, saḥ, yat, sa, te, tvā, tam, asya, enam
- VERB: bhavati, veda, asi, uvāca, āha, juhoti, juhuyāt, karoti, vidvān, yajamānaḥ
- VERB-Gdv: ādheyaḥ, deyam, hotavyam, ādṛtyam, anūcyaḥ, etavyam, upajīvanīyaḥ, kāryam, īḍyam, havyaḥ
- VERB-Part: vidvān, yajamānaḥ, yajamānam, yajamānasya, jātaḥ, uktam, samṛddham, yajamānāya, hutam, jātam
- Abl
- ADJ: caturviṃśāt, ṣoḍaśāt, asataḥ, bhadrāt, uttarāt, bṛhataḥ, dvāviṃśāt, savyāt, uttarasmāt, ṣaṣṭhāt
- ADV: tasmāt
- DET: viśvasmāt, svāt, viśvebhyaḥ, viśvāt
- NOUN: divaḥ, agneḥ, lokāt, aṃhasaḥ, pṛthivyāḥ, tvāt, madhyāt, devebhyaḥ, pāśāt, yoneḥ
- NUM: sahasrāt, dvāviṃśateḥ, pañcabhyaḥ, saptabhyaḥ, tisṛbhyaḥ, trayastriṃśataḥ, tribhyaḥ, śatāt
- PRON: asmāt, tasmāt, asmat, mat, tvat, etasmāt, ebhyaḥ, tasyāḥ, sarvasmāt, yasmāt
- VERB-Gdv: dīkṣaṇīyāyāḥ, saṃdeśyāt, bībhatseyāt, prasneyāt
- VERB-Part: ālabdhāt, yajamānāt, abhitaptebhyaḥ, chinnāt, hutāt, jvalataḥ, nikhātāt, sataḥ, sthitāt, taptāt
- Acc
- ADJ: dakṣiṇam, uttaram, priyam, uttarām, prāñcam, uru, bṛhat, udañcam, uttamam, tṛtīyam
- ADV: tat, etat, tad, idam, etad, adaḥ, kad, pūrvam, aparam, kim
- AUX-Part: santam, sat, satīm, satīḥ
- DET: viśvā, viśvāḥ, viśvāni, viśvam, svam, purū, svām, tāvantam, tāvataḥ, bahu
- NOUN: agnim, lokam, yajñam, apaḥ, devān, vācam, indram, ātmānam, svargam, paśūn
- NUM: sapta, śatam, tisraḥ, pañca, ekam, trīṇi, dve, sahasram, trīn, daśa
- PRON: tvā, tam, enam, tat, naḥ, tām, yat, mā, etat, yam
- VERB-Gdv: īḍyam, bhogyam, hotavyam, praṇayanīyam, saṃbhāryam, vedyam, abhivādanīyam, darśanīyam, havyam, kāryam
- VERB-Part: yajamānam, jātam, juṣṭam, sutam, bhūtam, saṃmitam, kriyamāṇam, gṛhītam, sahamānam, yuktam
- Dat
- ADJ: mahe, mahate, sviṣṭakṛte, niyutvate, catuṣpade, kṛte, viśvebhyaḥ, bṛhate, dātre, ghne
- AUX-Part: sadbhyaḥ
- DET: svāyai, viśvasmai, bahave, svāya, ubhayāya, viśvebhyaḥ, yāvadbhyaḥ
- NOUN: agnaye, devebhyaḥ, indrāya, tvāya, brahmaṇe, pitṛbhyaḥ, ūtaye, somāya, samṛddhyai, ātmane
- NUM: śatāya, sahasrāya, caturbhyaḥ, dvābhyām, ekaviṃśataye, saptabhyaḥ, tribhyaḥ
- PRON: naḥ, asmai, te, me, tasmai, vaḥ, tebhyaḥ, asyai, mahyam, asmabhyam
- VERB-Gdv: madyāya, upasadyāya, vedyāya
- VERB-Part: yajamānāya, dāśuṣe, mathyamānāya, dviṣate, jātāya, ajyamānāya, sunvate, ucchrīyamāṇāya, praṇīyamānāya, āsīnāya
- Gen
- ADJ: viśvasya, mahaḥ, aryaḥ, bṛhataḥ, sviṣṭakṛtaḥ, uttarasya, svānām, samānānām, mahīnām, ubhayoḥ
- AUX-Part: sataḥ, satām
- DET: viśvasya, svasya, viśveṣām, bahūnām, ubhayoḥ, etāvataḥ, ubhayeṣām, viśvasyāḥ, viśvāsām, yāvatīnām
- NOUN: yajñasya, devānām, agneḥ, apām, divaḥ, indrasya, pṛthivyāḥ, sūryasya, devasya, paśūnām
- NUM: ekeṣām, dvayoḥ, sahasrasya, caturṇām, śatasya, ekasya, daśānām, ekasyāḥ, saptānām, trayāṇām
- PRON: asya, te, tasya, me, naḥ, yasya, vaḥ, mama, eṣām, teṣām
- VERB-Gdv: dīkṣaṇīyāyāḥ, geyānām, japyasya, prasthānīyāyāḥ, pratigṛhyasya, vadhyānām, īḍyasya
- VERB-Part: yajamānasya, kṛtasya, dīkṣitasya, sutasya, hutasya, uktasya, carataḥ, jātasya, bhūtasya, gṛhītasya
- Ins
- ADJ: uttarayā, dakṣiṇena, savyena, uttareṇa, gāyatreṇa, uttaraiḥ, uttarābhiḥ, uttarābhyām, jāgatena, parokṣeṇa
- ADV: tad, tena
- AUX-Part: satā
- DET: svena, svayā, viśvebhiḥ, svebhiḥ, viśvena, viśvābhiḥ, bahunā, svaiḥ, viśvaiḥ
- NOUN: manasā, vācā, adbhiḥ, paśubhiḥ, brahmaṇā, ṛcā, bhāgadheyena, chandasā, prajayā, devatayā
- NUM: dvābhyām, tisṛbhiḥ, catasṛbhiḥ, tribhiḥ, ekena, pañcabhiḥ, ṣaḍbhiḥ, caturbhiḥ, ekayā, saptabhiḥ
- PRON: tena, etena, yena, tayā, tvayā, etayā, anena, sarvaiḥ, tābhiḥ, kena
- VERB-Gdv: dveṣyeṇa, pacanīyaiḥ, bhavyena, kartvena, nirmanthyena, saṃsarjanīyaiḥ, vadhyena, ācamanīyābhiḥ
- VERB-Part: viduṣā, gṛhītena, yuktena, yajamānena, uditena, bhrājatā, bhūtayā, davidyutatyā, dīkṣitena, gatena
- Loc
- ADJ: dakṣiṇe, parame, uttame, tṛtīye, daghne, astamite, madhyame, mahati, puṇye, caturthe
- AUX-Part: sati
- DET: sve, ubhayoḥ, viśveṣu, svāyām, kṛtsne, nitye, samāne, tāvakeṣu, viśvāsu
- NOUN: agnau, loke, agre, madhye, āhavanīye, apsu, divi, yajñe, ante, deveṣu
- NUM: dvayoḥ, triṣu, ekasmin, pañcasu, tisṛṣu, catasṛṣu, daśasu, dvādaśasu, ekasyām, aṣṭāsu
- PRON: asmin, tasmin, mayi, etasmin, asyām, yasmin, tasyām, teṣu, sarveṣu, tvayi
- VERB-Gdv: dīkṣaṇīyāyām, anvārambhaṇīyāyām, bhavye, grāhye, jyeye
- VERB-Part: udite, yajamāne, hute, anvārabdhāyām, hutāyām, jāte, pariśrite, anūktāyām, saṃsthite, vaṣaṭkṛte
- Nom
- ADJ: prathamaḥ, mayaḥ, viśve, vid, pratyaṅ, ūrdhvaḥ, priyaḥ, dāḥ, śuciḥ, bhūyaḥ
- ADV: tat, sa, tad, idam, saḥ, etad, etat, adaḥ, kim, pūrvam
- AUX-Part: san, satī, sat, bhavantaḥ, santaḥ, tiṣṭhan
- DET: viśve, viśvāḥ, viśvam, viśvā, svam, bahavaḥ, svā, ubhaye, viśvāni, bahu
- NOUN: agniḥ, devāḥ, indraḥ, prajāpatiḥ, prāṇaḥ, āpaḥ, brahma, vāc, yajñaḥ, kāmaḥ
- NUM: eke, ekaḥ, sapta, trayaḥ, pañca, śatam, ekam, tisraḥ, dve, dvādaśa
- PRON: yaḥ, saḥ, sa, tat, yat, te, sā, eṣa, etat, tvam
- VERB-Gdv: ādheyaḥ, deyam, ādṛtyam, anūcyaḥ, etavyam, upajīvanīyaḥ, hotavyam, havyaḥ, kāryam, kāryā
- VERB-Part: vidvān, yajamānaḥ, jātaḥ, samṛddham, uktam, hutam, yuktaḥ, pratiṣṭhitaḥ, tiṣṭhan, uktaḥ
- Voc
- ADJ: bhagavan, vaso, bhagavaḥ, somyaiḥ, subhage, sudānavaḥ, adrivas, girvaṇaḥ, maghavan, revatīḥ
- NOUN: agne, indra, deva, aśvinā, soma, jātavedaḥ, devāḥ, marutaḥ, oṣadhe, āpaḥ
- NUM: sapta
- PRON: asau, pare, sarvāḥ
- VERB-Gdv: ardhya, kāmyāḥ, śaṃsya
- VERB-Part: yajamāna, jāta, pavamāna, khāte, bhṛta, cikitvaḥ, huta, hūta, iṣita, jñāte
Degree and Polarity
Verbal Features
- Imp
- AUX: astu, santu, bhava, edhi, bhavantu, bhavatu, bhūtu, asāni, bhūtana
- VERB: dhehi, bhava, anubrūhi, pāhi, ehi, etu, pātu, astu, kṛdhi, gaccha
- Ind
- AUX: bhavati, asi, bhavanti, āsīt, asmi, asti, stha, bhavataḥ, āsan, āsa
- VERB: bhavati, veda, asi, uvāca, āha, juhoti, āhuḥ, karoti, abravīt, abhavat
- Jus
- AUX: bhūt, bhuvam, bhūma, bhūḥ, bhūta, bhūvan
- VERB: hiṃsīḥ, dhāḥ, saṃvadiṣṭhāḥ, vidan, bhūt, vigāt, kaḥ, śaṃsiṣam, hāsiṣṭām, juṣanta
- Opt
- AUX: syāt, syuḥ, syāma, syām, syātām, bhavet, syāḥ, bhūyāsam, bhūyāt, bhūyāma
- VERB: juhuyāt, ālabheta, kuryāt, brūyāt, kāmayeta, kurvīta, dadyāt, japet, bhūyāsam, nirvapet
- Pot
- VERB: agrahaiṣyat, atrapsyat, abhaviṣyan, vyapatiṣyat, abhaviṣyat, abheṣyat, ajīviṣyam, aprākṣyaḥ, avakṣyam, avakṣyan
- Sub
- AUX: asat, asaḥ, asasi, asāni, asati, asan, bhavāti, asāma, asāva, bhuvat
- VERB: asat, karat, kṛṇavat, parṣi, dadhat, pṛcchāsi, bhavāti, ciketati, dadat, dhāḥ
- Fut
- AUX: bhaviṣyati, bhaviṣyanti, bhavitā, bhavitāraḥ, bhavitāsmi, bhaviṣyasi, bhaviṣyata, bhaviṣyāmi
- VERB: vyākhyāsyāmaḥ, bhaviṣyati, yakṣyamāṇaḥ, kariṣyati, vipatiṣyati, agrahaiṣyat, atrapsyat, atyeṣyanti, vakṣyāmaḥ, avitā
- VERB-Part: yakṣyamāṇaḥ, bhaviṣyat, hoṣyan, kariṣyataḥ, nirvapsyan, pravatsyan, snāsyan, kariṣyan, vetsyan, abhiṣṭoṣyan
- Past
- AUX: āsīt, āsan, āsa, āsuḥ, āstām, abhūt, bhūt, abhūvam, babhūvuḥ, āyan
- VERB: veda, uvāca, āha, āhuḥ, vidvān, abravīt, abhavat, _, asṛjata, anvāha
- VERB-Part: vidvān, _, jātaḥ, uktam, samṛddham, hutam, jātam, udite, kṛtam, yuktaḥ
- Pqp
- VERB: avāvaśanta, aśiśrayuḥ, cākan, acakriran, acucyavuḥ, adadhāvat, ajabhartana, ajagan, ajagmiran, ajuhuvuḥ
- Pres
- AUX: bhavati, asi, syāt, bhavanti, astu, asmi, san, asti, santam, syuḥ
- AUX-Part: san, santam, sat, sataḥ, satī, satām, satīm, bhavantaḥ, sadbhyaḥ, santaḥ
- VERB: bhavati, asi, juhoti, juhuyāt, karoti, yajamānaḥ, ālabheta, dadhāti, kuryāt, eti
- VERB-Part: yajamānaḥ, yajamānam, yajamānasya, yajamānāya, tiṣṭhan, sat, yajamāna, icchan, jāyamānaḥ, prajānan
- Pass
- VERB: kriyate, hūyate, gīyate, asṛjyata, mucyate, ucyate, kriyamāṇam, atimucyate, vidyate, vijñāyate
- VERB-Part: kriyamāṇam, mathyamānāya, ajyamānāya, ucchrīyamāṇāya, duhyamānā, praṇīyamānāya, ucyamāne, nidhīyamānam, parivīyamāṇāya, prahriyamāṇāya
Pronouns, Determiners, Quantifiers
- 1
- AUX: asmi, syāma, syām, asāni, smaḥ, abhūvam, asāma, asāva, bhuvam, bhūma
- VERB: bhūyāsam, havāmahe, veda, īmahe, upāse, vidma, vṛṇīmahe, prapadye, huve, kṛṇomi
- 2
- AUX: asi, stha, asaḥ, bhava, edhi, sthaḥ, asasi, sthana, abhūtana, syāḥ
- VERB: asi, dhehi, bhava, anubrūhi, pāhi, ehi, kṛdhi, stha, gaccha, jahi
- 3
- AUX: bhavati, syāt, bhavanti, āsīt, astu, asti, syuḥ, santu, asat, bhavataḥ
- VERB: bhavati, veda, uvāca, āha, juhoti, āhuḥ, juhuyāt, karoti, abravīt, abhavat
Other Features
- Compound
- Yes
- ADJ: _, prāc, viśva, udak, uttara, mahā, dakṣiṇa, kṛṣṇa, prathama, ardha
- ADV: pūrva, sarva, sama, tad
- DET: viśva, bahu, sva, _, puru, bhūri, ubhaya, jīva, samāna
- NOUN: _, deva, brahma, indra, yajña, soma, ājya, paśu, anna, aśva
- NUM: _, tri, eka, dvi, sahasra, śata, catur, dvādaśa, pañca, aṣṭa
- PRON: _, sarva, tad, pūrva, anya, yad, etad, apara, para, eka
- VERB-Gdv: _, abhivānya, anubandhya, anvārambhaṇīya, bhogya, citya, havya, upya, āhṛtya, ārabhya
- VERB-Part: _, yajamāna, huta, kṛta, saṃsthita, suta, chinna, dhṛta, jarat, pūta
- Yes
Syntax
Auxiliary Verbs and Copula
- This corpus uses 2 lemmas as copulas (cop). Examples: as, bhū.
- This corpus uses 7 lemmas as auxiliaries (aux). Examples: bhū, as, i, ās, sthā, car, e.
Core Arguments, Oblique Arguments and Adjuncts
Here we consider only relations between verbs (parent) and nouns or pronouns (child).
- nsubj
- VERB--NOUN (4)
- VERB--NOUN-Acc (7)
- VERB--NOUN-Nom (5193)
- VERB--PRON (1)
- VERB--PRON-Acc (3)
- VERB--PRON-Gen (2)
- VERB--PRON-Nom (3985)
- VERB-Conv--NOUN-Nom (45)
- VERB-Conv--PRON-Nom (17)
- VERB-Gdv--NOUN-Nom (76)
- VERB-Gdv--PRON-Nom (71)
- VERB-Inf--NOUN-Dat (4)
- VERB-Inf--NOUN-Gen (1)
- VERB-Inf--NOUN-Nom (12)
- VERB-Inf--PRON-Dat (2)
- VERB-Inf--PRON-Nom (1)
- VERB-Part--NOUN (1)
- VERB-Part--NOUN-Acc (1)
- VERB-Part--NOUN-Dat (1)
- VERB-Part--NOUN-Gen (7)
- VERB-Part--NOUN-Loc (181)
- VERB-Part--NOUN-Nom (614)
- VERB-Part--PRON-Acc (2)
- VERB-Part--PRON-Gen (11)
- VERB-Part--PRON-Loc (20)
- VERB-Part--PRON-Nom (350)
- obj
- VERB--NOUN (4)
- VERB--NOUN-Abl (3)
- VERB--NOUN-Acc (7177)
- VERB--NOUN-Acc-ADP(anu) (1)
- VERB--NOUN-Acc-ADP(apa) (1)
- VERB--NOUN-Acc-ADP(cit) (1)
- VERB--NOUN-Acc-ADP(upa) (1)
- VERB--NOUN-Dat (6)
- VERB--NOUN-Gen (118)
- VERB--NOUN-Ins (7)
- VERB--NOUN-Loc (5)
- VERB--NOUN-Nom (19)
- VERB--NOUN-Voc (1)
- VERB--PRON-Acc (3438)
- VERB--PRON-Acc-ADP(anu) (1)
- VERB--PRON-Acc-ADP(upa) (1)
- VERB--PRON-Dat (8)
- VERB--PRON-Gen (53)
- VERB--PRON-Loc (2)
- VERB--PRON-Nom (7)
- VERB-Conv--NOUN (2)
- VERB-Conv--NOUN-Abl (1)
- VERB-Conv--NOUN-Acc (1398)
- VERB-Conv--NOUN-Dat (1)
- VERB-Conv--NOUN-Gen (14)
- VERB-Conv--NOUN-Nom (1)
- VERB-Conv--PRON-Acc (190)
- VERB-Conv--PRON-Gen (3)
- VERB-Gdv--NOUN-Acc (2)
- VERB-Gdv--NOUN-Nom (1)
- VERB-Inf--NOUN-Abl (1)
- VERB-Inf--NOUN-Acc (28)
- VERB-Inf--NOUN-Dat (8)
- VERB-Inf--NOUN-Gen (8)
- VERB-Inf--NOUN-Nom (1)
- VERB-Inf--PRON-Acc (29)
- VERB-Part--NOUN (9)
- VERB-Part--NOUN-Acc (525)
- VERB-Part--NOUN-Acc-ADP(anu) (1)
- VERB-Part--NOUN-Dat (1)
- VERB-Part--NOUN-Gen (11)
- VERB-Part--NOUN-Ins (1)
- VERB-Part--NOUN-Loc (1)
- VERB-Part--PRON-Acc (84)
- VERB-Part--PRON-Dat (1)
- VERB-Part--PRON-Gen (2)
- iobj
- VERB--NOUN-Acc (238)
- VERB--NOUN-Acc-ADP(prati) (1)
- VERB--NOUN-Dat (530)
- VERB--NOUN-Dat-ADP(kaṃ) (1)
- VERB--NOUN-Gen (9)
- VERB--NOUN-Loc (3)
- VERB--PRON-Acc (393)
- VERB--PRON-Acc-ADP(abhi) (1)
- VERB--PRON-Dat (806)
- VERB--PRON-Dat-ADP(kaṃ) (1)
- VERB--PRON-Gen (15)
- VERB--PRON-Ins (4)
- VERB--PRON-Loc (3)
- VERB-Conv--NOUN-Acc (37)
- VERB-Conv--NOUN-Dat (28)
- VERB-Conv--NOUN-Ins (3)
- VERB-Conv--PRON-Acc (5)
- VERB-Conv--PRON-Dat (17)
- VERB-Gdv--NOUN-Dat (5)
- VERB-Gdv--NOUN-Gen (1)
- VERB-Gdv--PRON-Dat (2)
- VERB-Gdv--PRON-Gen (1)
- VERB-Inf--PRON-Acc (1)
- VERB-Inf--PRON-Dat (2)
- VERB-Part--NOUN (3)
- VERB-Part--NOUN-Dat (24)
- VERB-Part--NOUN-Gen (3)
- VERB-Part--NOUN-Ins (1)
- VERB-Part--PRON-Acc (3)
- VERB-Part--PRON-Dat (27)
- VERB-Part--PRON-Gen (13)
Relations Overview
- This corpus uses 36 relation subtypes: acl:attr, acl:cont, acl:crel, acl:dpct, acl:pred, acl:ptcp, acl:relcl, advcl:caus, advcl:ccomp, advcl:concess, advcl:cond, advcl:consec, advcl:dpct, advcl:fin, advcl:lcl, advcl:manner, advcl:tcl, case:sim, ccomp:rel, compound:coord, compound:name, mark:sim, nmod:appos, nmod:pred, obl:agent, obl:benef, obl:goal, obl:grad, obl:instr, obl:lmod, obl:manner, obl:path, obl:soc, obl:source, obl:tmod, xcomp:result
- The following 7 relation types are not used in this corpus at all: expl, clf, list, goeswith, reparandum, punct, dep