UD Akkadian RIAO
Language: Akkadian (code: akk
)
Family: Afro-Asiatic
This treebank has been part of Universal Dependencies since the UD v2.7 release.
The following people have contributed to making this treebank part of UD: Mikko Luukko, Aleksi Sahala, Sam Hardwick, Krister Lindén.
Repository: UD_Akkadian-RIAO
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.15
License: CC BY-SA 3.0
Genre: nonfiction, government
Questions, comments? General annotation questions (either Akkadian-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [sam • hardwick (æt) iki • fi]. Development of the treebank happens outside the UD repository. If there are bugs, either the original data source or the conversion procedure must be fixed. Do not submit pull requests against the UD repository.
Annotation | Source |
---|---|
Lemmas | assigned by a program, with some manual corrections, but not a full manual verification |
UPOS | annotated manually in non-UD style, automatically converted to UD, with some manual corrections of the conversion |
XPOS | annotated manually |
Features | annotated manually, natively in UD style |
Relations | annotated manually, natively in UD style |
Description
162 royal inscriptions of four early Neo-Assyrian kings.
The data source for the treebank is 162 royal inscriptions of four early Neo-Assyrian kings: Aššur-dan II (r. 934–912 BCE), Adad-nerari II (r. 911–891 BCE), Tukulti-Ninurta II (r. 890–884 BCE) and Ashurnasirpal II (r. 883–859 BCE). Their inscriptions were published by A. Kirk Grayson in his 1991 volume Assyrian Rulers of the Early First Millennium B.C. I (1114 – 859 B.C.) (Royal Inscriptions of Mesopotamia, Assyrian Periods, 2), University of Toronto Press, Toronto.
UD_Akkadian-RIAO is a small treebank which consists of 22 277 words and 1845 sentences. This represents an intact subset of a total of 2211 sentences from the early Neo-Assyrian royal inscriptions of the tenth and ninth centuries BCE. These royal inscriptions were extracted from Oracc (Open Richly Annotated Cuneiform Corpus; http://oracc.museum.upenn.edu/riao/), where all Neo-Assyrian royal inscriptions are lemmatized word-for-word. The language of the corpus is Standard Babylonian, with occasional Assyrianisms, whereas “Akkadian” is the umbrella term for both Assyrian and Babylonian. The treebank was manually annotated following the UD annotation guidelines.
Acknowledgments
This treebank was carried out as part of the Centre of Excellence in Ancient Near Eastern Empires (ANEE) in cooperation with FIN-CLARIN and the Language Bank of Finland, taking place in Helsinki, and funded by the Academy of Finland. We would like to thank Niek Veldhuis (Berkeley), the initiator of the project, and David Bamman (Berkeley), who helped in setting up the first attempt for annotation on the Brat rapid annotation tool. For making this project possible, we are also indebted to Karen Radner, Jamie Novotny and Nathan Morello (all three LMU, Munich) and to Grant Frame and Steve Tinney (both UPenn, Philadelphia).
References
Mikko Luukko, Aleksi Sahala, Sam Hardwick, Krister Lindén. 2020. Akkadian Treebank for Early Neo-Assyrian Royal Inscriptions. In Proceedings of the 19th International Workshop on Treebanks and Linguistic Theories (TLT), Düsseldorf 2020. (https://www.aclweb.org/anthology/2020.tlt-1.11.pdf)
Statistics of UD Akkadian RIAO
POS Tags
ADJ – ADP – ADV – CCONJ – DET – NOUN – NUM – PART – PRON – PROPN – SCONJ – VERB – X
Features
Case – Gender – Mood – NounBase – Number – Person – Subordinative – Tense – Ventive – VerbForm – VerbStem
Relations
acl – acl:relcl – advcl – advmod – advmod:emph – amod – appos – case – cc – ccomp – conj – cop – csubj – dep – det – det:poss – discourse – fixed – iobj – mark – nmod – nmod:poss – nsubj – nummod – obj – obl – parataxis – root – vocative – xcomp
Tokenization and Word Segmentation
- This corpus contains 1874 sentences, 20517 tokens and 23165 syntactic words.
- All tokens in this corpus are followed by a space.
- This corpus contains 75 types of words with spaces. Examples: 1 mē, 2 mē, 1 līm, 1 mē 20, lā pādâ, lā pādû, 2 līm, 3 mē, 8 mē, 3 līm, 7 mē, lā ādiru, 1 šūši, 2 mē 50, 5 mē, lā mēni, lā pādûte, 1 mē 50, 2 līm 8 mē, 5 līm, lā šanān, šakin māt, 1 līm 2 mē, 1 līm 5 mē, 1 līm arbaʾu mē 1 šūši, 1 mē 72, 10 līm, 2 mē 1 šūši, 3 mē 20, Zaba šapliti, arbaʾu mē, arbaʾu mē 1 šūši, arbaʾu mē 70, lā māgirūt, ša rēši, 1 līm šeš mē, 1 mē 40, 1 mē 74, 1 mē x, 1 šūši 9 līm 5 mē 74, 16 līm, 2 līm 5 mē, 2 līm 7 mē 2, 2 mē 57, 3 mē 26, 3 mē 32, 3 mē 70, 3 mē 90, 40 līm 7 līm 74, 50 līm
- This corpus contains 200 types of words that contain both letters and punctuation. Examples: Aššur-naṣir-apli, Adad-nerari, Tukulti-Ninurta, Til-ša-Zabdani, Aššur-dan, Til-Bari, Zaba-šapliti, Bit-Adini, ma’atta, Tamti-rabite, Tukulti-apil-Ešarra, Til-ša-Abtani, mar-Zamani, Til-ša-Abatani, Nur-Adad, arba’i, Amme-baʾli, Aššur-reša-iši, Araš-tua, Bit-Halupe, Libbi-ali, Tamti-rabiti, UD.1.KAM₂, Zaba-eliti, mar-Bahiani, Šalmanu-ašared, Šarrat-niphi, Azi-ili, Salmanu-ašared, Til-uli, Tukulti-Aššur-aṣbat, mar-Adini, Ahi-iababa, Aššur-rabi, Dur-Balaṭi, Dur-aduklimmu, Imgur-Enlil, Mušezib-Ninurta, Patti-hegalli, mar-Halupe, ma’dūte, rā’im, {URU}x, še’ī, 5-te-šu₂, Abul-tabira, Adad-aha-iddina, Adad-itʾi, Ahi-ramu, Ahlame-Armaya
- This corpus contains 2565 multi-word tokens. On average, one multi-word token consists of 2.03 syntactic words.
- There are 618 types of multi-word tokens. Examples: Aššurma, šallassunu, mātiya, bēliya, elišunu, šarrūtiya, kakkīya, šēpīya, bēlūtiya, sihirtiša, dēktašunu, ūmešuma, gimriša, pāniya, qāssu, ēkallišu, šumī, ālānišunu, kalîšina, ittallakuma, alpīšunu, šēpīšu, ṣēnīšunu, gabadibbīšu, ēnahma, dannūtišu, tidūkišunu, šumiya, šumšu, limētuššu, qātiya, abbīya, mātišu, anhūssu, apēlušinani, bēlišu, libbiya, uššēšu, āliya, dannūtišunu, nakrīšu, būšašunu, būšīšunu, ummānātišunu, šāninšu, ayyābīšu, bēltiya, issiya, qerebšu, ašrišu.
Morphology
Tags
- This corpus uses 13 UPOS tags out of 17 possible: ADJ, ADP, ADV, CCONJ, DET, NOUN, NUM, PART, PRON, PROPN, SCONJ, VERB, X
- This corpus does not use the following tags: AUX, INTJ, SYM, PUNCT
- This corpus contains 6 word types tagged as particles (PART): lā, lū, ma, mā, ni, ul
- This corpus contains 11 lemmas tagged as pronouns (PRON): _, ammar, anāku, attunu, mamman, mammāna, mannu, mīnu, yâši, ša, šuāti
- This corpus contains 14 lemmas tagged as determiners (DET): _, ammiu, annû, gabbu, kalû, kalāma, mimma, šuāti, šuātunu, šâtu, šâtunu, šû, šī, šū
- Out of the above, 2 lemmas occurred sometimes as PRON and sometimes as DET: _, šuāti
- This corpus contains 0 lemmas tagged as auxiliaries (AUX):
- There are 3 (de)verbal forms:
- Fin
- VERB: attumuš, akšud, adūk, assakan, aqqur, amhur, appul, aqṭirib, aṣbat, arṣip
- Inf
- VERB: šūzub, epēš, arāk, šumʾud, uppušu, pêli, šuknuše, hulluq, napāhi, šapāri
- Stat
- ADJ: ašarēdāku, kabtāku, šurruhāku, ṣīrāku, nâdāku, gešrāku, šitmurāku, ašṭāku, dandannāku, dannāku
- NOUN: labbāku, qarrādāku, zikarāku, bēlāku, uršānāku, šarrāku
- VERB: bēde, bēdāk, šaknū, ṣali, usbakūni, šaknu, bašû, labi, dān, ṣabtat
Nominal Features
- Com
- ADJ-Stat: ašarēdāku, kabtāku, šurruhāku, ṣīrāku, nâdāku, gešrāku, šitmurāku, ašṭāku, dandannāku, dannāku
- NOUN-Stat: labbāku, qarrādāku, zikarāku, bēlāku, uršānāku, šarrāku
- VERB-Fin: attumuš, akšud, adūk, assakan, aqqur, amhur, appul, aqṭirib, aṣbat, arṣip
- VERB-Stat: bēdāk, usbakūni, usbāku, našâku, tarruṣū
- Fem
- ADJ: rapšāte, balṭāti, kabitta, ma’atta, maʾatta, maʾattu, mādāte, rabītu, uhhuzāte, ṣīrāti
- DET: ammâte, šī, annâte, annīte, šuātu, šuāti, šuʾātu
- NOUN: māt, kiššati, maddattu, ummānāti, māti, šallas, ēkal, mātāti, išātāti, šēpī
- NUM: erbetta, erbetti
- PRON: ša, šina, ši, sa, ki, sina, si, š
- PROPN: Idiqlat, Ištar, Subnat, Zaban, Puratte, Zaba-šapliti, Tamti-rabite, Gula, Tamti-rabiti, Zaba-eliti
- VERB-Fin: ūṣâ, ušabbā, tamgur, ihillā, iṣuddā, luṣṣī, tahdu, tamahhar, tappardâ, taqbi
- VERB-Stat: ṣabtat, šuklulat, šuqallulat
- Masc
- ADJ: dannu, rabû, rabûti, rabê, danni, dannūte, dannūti, arkû, qardu, šaṭra
- DET: šū, šuātu, annî, annê, šuātunu, annâ, šuāte, šuʾātu, šâtu, šâtunu
- NOUN: šar, mār, šarru, ālāni, libbi, ilāni, šadê, bēli, alpī, kakkī
- NUM: ištēn, arba’i, šinip
- PRON: šu, šunu, sunu, su, sunūti, šunūti, ka, šuāšu, attunu, šunū
- PROPN: Aššur, Aššur-naṣir-apli, Adad-nerari, Tukulti-Ninurta, Enlil, Adad, Ninurta, Šamaš, Aššur-dan, Sin
- VERB: ikšud, ittallaku, īšû, iṣbutū, ēnah, šūzub, šaknū, ipēlu, luddiš, ṣali
- VERB-Fin: ikšud, ittallaku, īšû, iṣbutū, ēnah, ipēlu, luddiš, ipparšidūni, lutēr, ušekniša
- VERB-Inf: šūzub, epēš, arāk, šumʾud, uppušu, pêli, šuknuše, hulluq, napāhi, šapāri
- VERB-Stat: šaknū, ṣali, šaknu, bašû, labi, dān, hadāt, marṣi, nādi, šaknūni
- Plur
- ADJ: rabûti, dannūte, dannūti, rapšāte, balṭāti, balṭūti, mādāte, uhhuzāte, ṣīrāti, naṭûte
- DET: ammâte, annâte, šuātunu, šuātu, šâtunu
- NOUN: ālāni, ummānāti, ilāni, alpī, kakkī, mātāti, išātāti, šēpī, narkabāti, nišī
- PRON: šunu, sunu, šina, sunūti, šunūti, sina, attunu, šunū
- PROPN: Anunnaki, Igigi, Igigu
- VERB-Fin: iṣbutū, ipparšidūni, iṣṣabtū, ūrdūni, igdurrū, iplahū, irammū, lukinnū, uttērū, iqbû
- VERB-Stat: šaknū, ṣaprūni, bašû, labšū, tadnūni, šaknūni, šitkunū, ṭubbī
- Sing
- ADJ: dannu, rabû, rabê, danni, arkû, qardu, šaṭra, kabitta, ma’atta, labēru
- ADJ-Stat: ašarēdāku, kabtāku, šurruhāku, ṣīrāku, nâdāku, gešrāku, šitmurāku, ašṭāku, dandannāku, dannāku
- DET: šū, šuātu, šī, annīte, annî, annê, šuʾātu, annâ, šuāte, šuāti
- NOUN: šar, māt, mār, kiššati, šarru, libbi, maddattu, māti, bēli, šallas
- NOUN-Stat: labbāku, qarrādāku, zikarāku, bēlāku, uršānāku, šarrāku
- NUM: arba’i, erbetti, šinip
- PRON: šu, ya, ša, su, ī, ni, ši, anāku, anni, sa
- PROPN: zumri
- VERB: attumuš, akšud, adūk, assakan, aqqur, amhur, appul, aqṭirib, aṣbat, arṣip
- VERB-Fin: attumuš, akšud, adūk, assakan, aqqur, amhur, appul, aqṭirib, aṣbat, arṣip
- VERB-Inf: šūzub, epēš, arāk, šumʾud, uppušu, pêli, šuknuše, hulluq, napāhi, šapāri
- VERB-Stat: bēde, bēdāk, ṣali, usbakūni, šaknu, labi, bašû, dān, ṣabtat, hadāt
- Acc
- ADJ: šaṭra, kabitta, lā pādâ, maʾatta, mahrâ, kabta, šurbâ, arattâ, duššupa, marulta
- DET: annâ
- NOUN: āla, būša, dīkta, maddatta, dūra, pâ, ēkalla, māhira, kussâ, libba
- PRON: mīna, šunū
- PROPN: Azira, Baza, Diduala, Ialligu, Matiate, Mehri, Sarbalia, Suru, Arantu
- Gen
- ADJ: rabê, danni, šitmuri, mahrê, peṣê, marṣi, elli, maʾdi, huššê, rabīte
- DET: annīte, annî, annê, šâtunu
- NOUN: kiššati, libbi, māti, bēli, siparri, šarrūti, muhhi, qereb, sihirti, ūme
- NUM: arba’i, erbetti
- PROPN: Nairi, Suhi, Laqe, Habhi, Habruri, Gilzani, Kalhi, Hatte, Puratte, Babite
- VERB-Inf: pêli, šuknuše, napāhi, šapāri, namāri, ba’āri, du’āki, epāše, esīr, lāmāri
- Loc
- ADP: baluš
- NOUN: limētuš, biriquš, qātuš
- Nom
- ADJ: dannu, rabû, arkû, qardu, labēru, marṣu, maʾattu, naʾdu, lā pādû, rabītu
- DET: šuātu, šâtu
- NOUN: šarru, maddattu, rubû, šadû, bēlu, biltu, ālu, eṭlu, kaspu, pīthallu
- PROPN: Kalhu, Nirbu, Šamšu, Hindanu, Lullu, Purattu, Arumu, Abu, Arahsamnu, Habhu
- VERB-Inf: rapāš
Degree and Polarity
Verbal Features
- Imp
- VERB-Fin: uddiš, tēr, balliṭ, dūku, epuš, hulliq, taʾer, uddiši, šutur
- Ind
- VERB: attumuš, akšud, adūk, assakan, aqqur, amhur, appul, aqṭirib, aṣbat, arṣip
- VERB-Fin: attumuš, akšud, adūk, assakan, aqqur, amhur, appul, aqṭirib, aṣbat, arṣip
- Prec
- VERB-Fin: luddiš, lutēr, lukinnū, luhalliqū, liskipū, lipšuš, liqqi, lušēšibū, līkimū, libriq
- Proh
- VERB-Fin: ušabbā, tapaššiṭ, ušamsak, ekkim, erraba, essir, eṣṣidi, ihassasa, ikassir, imahhar
- Past
- VERB: attumuš, akšud, adūk, assakan, aqqur, amhur, appul, aqṭirib, aṣbat, arṣip
- VERB-Fin: attumuš, akšud, adūk, assakan, aqqur, amhur, appul, aqṭirib, aṣbat, arṣip
- Pres
- VERB-Fin: ipaššiṭu, irammū, išemme, inabbû, iqabbû, išakkanu, išemmû, ukallūni, eppašu, išubbū
Pronouns, Determiners, Quantifiers
- 1
- ADJ-Stat: ašarēdāku, kabtāku, šurruhāku, ṣīrāku, nâdāku, gešrāku, šitmurāku, ašṭāku, dandannāku, dannāku
- NOUN-Stat: labbāku, qarrādāku, zikarāku, bēlāku, uršānāku, šarrāku
- PRON: ya, ī, ni, anāku, anni, wa, a, nim, ā, ayyâši
- VERB: attumuš, akšud, adūk, assakan, aqqur, amhur, appul, aqṭirib, aṣbat, arṣip
- VERB-Fin: attumuš, akšud, adūk, assakan, aqqur, amhur, appul, aqṭirib, aṣbat, arṣip
- VERB-Stat: bēdāk, usbakūni, usbāku, našâku, tarruṣū
- 2
- PRON: ka, ki, attunu
- VERB-Fin: uddiš, tapaššiṭ, tušaršidu, tēr, balliṭ, dūku, epuš, hulliq, tabarri, tapaššiṭi
- VERB-Stat: hadāt
- 3
- DET: šū, šuātu, šī, šuʾātu, šuāte, šuāti
- NOUN: abullu, liātu, mušerbû
- PRON: šu, šunu, ša, sunu, su, šina, ši, sa, sunūti, šunūti
- VERB: bēde, ikšud, ittallaku, īšû, ittumuš, iṣbutū, ēnah, šaknū, ipēlu, ittumša
- VERB-Fin: ikšud, ittallaku, īšû, ittumuš, iṣbutū, ēnah, ipēlu, ittumša, luddiš, ipparšidūni
- VERB-Stat: bēde, šaknū, ṣali, šaknu, bašû, labi, dān, ṣabtat, marṣi, nādi
Other Features
- NounBase
- Bound
- ADJ: mithar
- NOUN: šar, māt, mār, ēkal, bīt, iššak, pān, āl, šakin, ebertān
- NUM: šinip
- VERB-Inf: šūzub, epēš, arāk, šumʾud, hulluq, nanmar, nasāh, zakār, šušmur, ṣabāt
- Free
- ADJ: Aššuru, ella
- DET: annê, šuātunu, annâ, šuātu, šâtu
- NOUN: kiššati, šarru, maddattu, libbi, siparri, išātāti, mātāti, ummānāti, rubû, narkabāti
- NUM: arba’i
- VERB-Inf: pêli, šuknuše, napāhi, šapāri, namāri, aṣbate, ba’āri, du’āki, epāše, lāmāri
- Suffixal
- NOUN: šallas, māti, bēli, šēpī, šum, šarrūti, sihirti, ummānāti, kakkī, bēlūti
- PROPN: zumri
- VERB-Inf: šitapru
- Terminal
- NOUN: bilat, mana, qāta
- Bound
- Subordinative
- Yes
- PART: ni
- VERB-Fin: ittallaku, īšû, apēlu, ipēlu, ipparšidūni, ikšudu, ipaššiṭu, irammū, inabbû, imhuru
- VERB-Stat: šaknū, usbakūni, šaknu, bašû, šaknūni, ṣaprūni, labšū, paqdu, rūqu, tadnūni
- Yes
- Ventive
- Yes
- VERB-Fin: ašlula, uṣabbita, ubla, ušekniša, utēra, ittumša, ētarba, alqâ, attumša, ūrdūni
- VERB-Stat: šuqallula
- Yes
- VerbStem
- D
- VERB: utēr, uṣabbita, ušarrih, ukīn, ussim, utēra, luddiš, unakkir, lutēr, umessi
- VERB-Fin: utēr, uṣabbita, ušarrih, ukīn, ussim, utēra, luddiš, unakkir, lutēr, umessi
- VERB-Inf: uppušu, hulluq, turru, bulluṭ, turri, uppuš, šunnê
- VERB-Stat: tarruṣū, ṭubbī
- Dt
- VERB-Fin: tugdaššara, uttakkaru, uštarrih
- VERB-Stat: šutarruhu
- G
- VERB-Fin: attumuš, akšud, adūk, assakan, aqqur, amhur, appul, aqṭirib, aṣbat, arṣip
- VERB-Inf: epēš, arāk, pêli, napāhi, šapāri, enû, namāri, nasāh, zakār, ṣabāt
- VERB-Stat: bēde, bēdāk, šaknū, ṣali, usbakūni, šaknu, bašû, labi, dān, ṣabtat
- Gt
- VERB-Fin: amdahhiṣi, amdahhiṣ, itabbalu, nimdahhiṣi, ītallu
- VERB-Inf: šitapru
- VERB-Stat: šitkunū
- Gtn
- VERB-Fin: ittallaku, altakkan, attallak, littasqar, ētappašu, attallaku, aštamdih, aštaqqi, iltakkanu, ilteʾʾû
- N
- VERB-Fin: ipparšidūni, attabalkat, ittaklū, ittabalkat, ibbalkitū, ittikil, ipparšidū, ittabalkutū, i’’abit, likkalmū
- VERB-Inf: nanmar
- Ntn
- VERB-Fin: ittatabalkitu
- S
- VERB: ušamqit, ušeklil, ušekniša, šūzub, ušezziz, ušēšib, ušaṣbit, ušātir, ušaṣbis, ušatlimū
- VERB-Fin: ušamqit, ušeklil, ušekniša, ušezziz, ušēšib, ušaṣbit, ušātir, ušaṣbis, ušatlimū, ušaškin
- VERB-Inf: šūzub, šumʾud, šuknuše, šušmur, šuknuši, šūzubu
- VERB-Stat: šuklulat, šuqallulat, šazzūni, šuparruru, šuqallula
- St
- VERB-Fin: uštennû, uštennâ
- Stn
- VERB-Fin: ultanapšaqā, ussakkil, uštanapšaqū
- D
Syntax
Auxiliary Verbs and Copula
- This corpus uses 1 lemmas as copulas (cop). Examples: anāku.
- This corpus does not contain auxiliaries.
Core Arguments, Oblique Arguments and Adjuncts
Here we consider only relations between verbs (parent) and nouns or pronouns (child).
- nsubj
- VERB--NOUN (2)
- VERB--NOUN-Nom (1)
- VERB--PRON (1)
- VERB-Fin--NOUN (117)
- VERB-Fin--NOUN-Acc (3)
- VERB-Fin--NOUN-Nom (42)
- VERB-Fin--PRON (116)
- VERB-Inf--PRON (2)
- VERB-Stat--NOUN (3)
- VERB-Stat--NOUN-Acc (1)
- VERB-Stat--NOUN-Nom (17)
- VERB-Stat--PRON (44)
- obj
- VERB--NOUN (5)
- VERB-Fin--NOUN (1049)
- VERB-Fin--NOUN-ADP(ana) (1)
- VERB-Fin--NOUN-ADP(ina) (2)
- VERB-Fin--NOUN-ADP(ištu) (1)
- VERB-Fin--NOUN-ADP(ša) (1)
- VERB-Fin--NOUN-Acc (122)
- VERB-Fin--NOUN-Gen (7)
- VERB-Fin--NOUN-Gen-ADP(ammar) (2)
- VERB-Fin--NOUN-Nom (177)
- VERB-Fin--PRON (219)
- VERB-Fin--PRON-ADP(ana) (2)
- VERB-Fin--PRON-Acc (2)
- VERB-Inf--NOUN (16)
- VERB-Inf--NOUN-Gen (1)
- VERB-Inf--NOUN-Nom (1)
- VERB-Inf--PRON (2)
- VERB-Stat--NOUN (5)
- VERB-Stat--PRON (6)
- iobj
- VERB-Fin--PRON (3)
Relations Overview
- This corpus uses 4 relation subtypes: acl:relcl, advmod:emph, det:poss, nmod:poss
- The following 11 relation types are not used in this corpus at all: expl, dislocated, aux, clf, flat, compound, list, orphan, goeswith, reparandum, punct