UD Uzbek UT
Language: Uzbek (code: uz
)
Family: Turkic
This treebank has been part of Universal Dependencies since the UD v2.15 release.
The following people have contributed to making this treebank part of UD: Arofat Akhundjanova.
Repository: UD_Uzbek-UT
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.15
License: CC BY-SA 4.0
Genre: news, fiction
Questions, comments? General annotation questions (either Uzbek-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [arak00001 (æt) stud • uni-saarland • de]. Development of the treebank happens directly in the UD repository, so you may submit bug fixes as pull requests against the dev branch.
Annotation | Source |
---|---|
Lemmas | assigned by a program, with some manual corrections, but not a full manual verification |
UPOS | assigned by a program, with some manual corrections, but not a full manual verification |
XPOS | not available |
Features | assigned by a program, with some manual corrections, but not a full manual verification |
Relations | assigned by a program, with some manual corrections, but not a full manual verification |
Description
This is the first Uzbek UD treebank.
Uzbek-UT consists of 500 sentences, 250 of which were sourced from news articles and 250 from fiction. These two genres can be told apart with sentence IDs, meaning that the first half of the treebank belongs to news articles and the second half to fiction. Tokenization and lemmatization were carried out automatically. POS tags and dependency relations were annotated semi-automatically with full manual corrections.
References
- (citation)
Acknowledgments
Statistics of UD Uzbek UT
POS Tags
ADJ – ADP – ADV – AUX – CCONJ – DET – INTJ – NOUN – NUM – PART – PRON – PROPN – PUNCT – SCONJ – SYM – VERB – X
Features
Abbr – Aspect – Case – Degree – Foreign – Mood – Number – NumType – Person – Polarity – Poss – PronType – Reflex – Tense – VerbForm – Voice
Relations
acl – advcl – advmod – amod – appos – aux – case – cc – ccomp – compound – compound:lvc – conj – cop – csubj – det – discourse – expl – fixed – flat – iobj – mark – nmod – nmod:poss – nsubj – nummod – obj – obl – parataxis – punct – root – vocative – xcomp
Tokenization and Word Segmentation
- This corpus contains 500 sentences and 5850 tokens.
- This corpus contains 848 tokens (14%) that are not followed by a space.
- This corpus does not contain words with spaces.
- This corpus contains 504 types of words that contain both letters and punctuation. Examples: bo‘lib, yo‘q, o‘z, O‘zbekiston, bo‘lgan, bo‘yicha, o‘zi, bo‘ldi, o‘sha, yo‘l, ko‘p, so‘ng, bo‘lmaydi, bo‘lsa, e’lon, ko‘ra, ko‘rdi, ko‘z, ko‘zga, ma’lum, o‘tdi, o‘tgan, o‘tkazildi, qo‘ydi, so‘mlik, yomg‘ir, Erdo‘g‘an, Wi-Fi, bag‘rida, ba’zan, ba’zi, bo'lgan, bo'lsa, bo‘lar, bo‘ylab, g‘o‘za, ilg‘ab, ko‘chib, ko‘l, ko‘m-ko‘k, ko‘rgan, ko‘rib, ko‘rinadi, ko‘rsatgan, ko‘zlari, ma’naviyat, noto‘g‘ri, og‘ir, o‘g‘li, o‘n
Morphology
Tags
- This corpus uses 17 UPOS tags out of 17 possible: ADJ, ADP, ADV, AUX, CCONJ, DET, INTJ, NOUN, NUM, PART, PRON, PROPN, PUNCT, SCONJ, SYM, VERB, X
- This corpus contains 15 word types tagged as particles (PART): Axir, barpo, bunyod, duch, g‘oyib, hal, halok, ham, nahot, paydo, sodir, tashkil, voz, xuddi, zabt
- This corpus contains 35 lemmas tagged as pronouns (PRON): Nuh, barcha, bir-biri, biz, bola, bu, hamma, hammayoq, kim, kimdir, mana, men, menga, ne, necha, nechta, neki, nima, oʻz, oʻzi, o‘sha, o‘z, qanaqa, qancha, qanday, qay, qayer, qayoq, qaysi, sen, shu, shunday, siz, u, ular
- This corpus contains 24 lemmas tagged as determiners (DET): ayrim, barcha, bari, ba’zi, bir, biror, bu, bunday, butun, haligi, hamma, har, hech, jami, mana, marta, mazkur, o‘sha, shu, shunaqa, shuncha, shunday, u, ushbu
- Out of the above, 8 lemmas occurred sometimes as PRON and sometimes as DET: barcha, bu, hamma, mana, o‘sha, shu, shunday, u
- This corpus contains 8 lemmas tagged as auxiliaries (AUX): bo'l, edi, ekan, emas, kerak, lozim, mumkin, yoʻq
- There are 5 (de)verbal forms:
- Conv
- VERB: bo‘lib, olib, deb, qilib, borib, chiqib, kelib, kirib, ishlab, yetib
- Fin
- AUX: mumkin, edi, bo'ladi, bo'lgan, kerak
- VERB: berdi, qildi, ega, bo‘ldi, qiladi, etdi, ketdi, beradi, keldi, oldi
- Inf
- VERB: beradi, boshlasalar, bo‘lsa, olmasalar
- Part
- VERB: ketgan, bergan, bo‘lgan, kelayotgan, kelgan, qilgan, yurgan, keladigan, ketayotgan, ko‘rgan
- Vnoun
- VERB: qilish, berish, etish, olish, o‘tish, saqlash, etilishi, aniqlash, aniqlashi, ayirboshlash
Nominal Features
- Plur
- ADJ: boshqalarga, sovuqlarida
- ADV: Olis-olislardagi
- AUX: edilar, ekanligimizga, ekanmiz
- NOUN: ishlar, odamlar, bolalar, davlatlar, kunlari, muzokaralar, qushlar, rahbarlari, vakillari, voqealar
- PRON: ular, bizning, ularning, biz, bular, o‘zlari, ularga, Bizda, Bizlar, Bolalarni
- PROPN: Dekabrchilar, Oyshalarning
- VERB: kelishsa, Yig‘ilganlar, aytadilar, aytishlaricha, bermadilar, boramiz, borishdi, boshladilar, boshlasalar, boshlashadi
- VERB-Fin: aytadilar, bermadilar, boramiz, borishdi, boshladilar, boshlashadi, bo‘lamiz, bo‘larmikinlar, bo‘ldilar, bo‘lishadi
- VERB-Inf: boshlasalar, olmasalar
- VERB-Part: Yig‘ilganlar
- Sing
- AUX: edim, ekanman, bo'lsa, ekani, emasligi
- NOUN: kuni, nafar, odam, yil, davlat, prezidenti, davom, doirasida, qishloq, xabar
- PRON: men, o‘z, uning, o‘zi, u, unga, meni, seni, uni, menga
- PROPN: O‘zbekiston, Toshkent, Rossiya, Ukraina, Koreya, Samarqand, Toshkentda, AQSh, Amerika, Asqar
- VERB: ketdi, berdim, qildi, Shoshmang, anglamadi, aylantirsin, aytsam, beray, bersam, biladi
- VERB-Fin: ketdi, berdim, qildi, Shoshmang, anglamadi, aylantirsin, beray, biladi, bilarmish, bordi
- VERB-Part: bilganim, etayotgani, o‘qiyotganimni
- VERB-Vnoun: ochishim
- Abl
- ADV: Olislardan, oldindan
- NOUN: tomonidan, boshidan, oldidan, orasidan, Agentlikdan, Oradan, Toshlardan, Trubkadan, a’zolaridan, balandlikdan
- PRON: undan, Shundan, Ulardan, bir-biridan, mendan, shulardan
- PROPN: Qipchoqdan, Xudodan, Yaponiyadan
- VERB: boshlanganidan, bo‘ysunishdan, kelishdan, kelishini, o‘sganligidan, o‘tkazilishidan, shoshilmasdan, shoshmasdan, sug‘urtirishdan, tanishtirgandan
- VERB-Conv: shoshilmasdan
- VERB-Part: boshlanganidan, tanishtirgandan
- VERB-Vnoun: bo‘ysunishdan, kelishdan, kelishini, o‘sganligidan, o‘tkazilishidan, sug‘urtirishdan, yorishmasdan
- Acc
- AUX: ekanini, ekanligini
- NOUN: darajasini, faoliyatini, imkonini, joylarni, kitobini, shijoatni, tadbirni, tilini, Maktabni, Tilni
- PRON: seni, uni, meni, o‘zini, Bolalarni, Bularni, Buni, Hammasini, Ularni, nimalarnidir
- PROPN: Akramjonni, Arishevni, Hojini, Karpni, Lolani, O‘lmasboyni, Qur’on, Xersonni, Xizrni
- VERB: yo‘qligini, berayotganini, berganini, berishni, bermaganini, bo‘lishini, bo‘lishni, chiqarishni, cho‘kishni, kuyinishini
- VERB-Part: berayotganini, berganini, bermaganini, o‘qiyotganimni, qilganini, qilinganini
- VERB-Vnoun: berishni, bo‘lishini, bo‘lishni, chiqarishni, cho‘kishni, kuyinishini, oshirishni, o‘rnatishni, shovullashini, suyunishini
- Dat
- ADJ: kulrangga, boshqalarga
- ADV: avvaliga
- AUX: ekaniga, mumkinligiga, ekanligimizga
- NOUN: foizga, ishga, ko‘zga, oldiga, qarorga, amalga, boshiga, evaziga, hayotiga, odamga
- NUM: biriga
- PRON: unga, bunga, menga, o‘ziga, ularga, O‘zingga, barchalarimizga, kimga, nimalargadir, qayoqqa
- PROPN: O‘zbekistonga, Samarqandga, AQSHga, AQShga, Germaniyaga, G‘anijonga, Italiyaga, Koreyaga, Ma’rifatxonga, Oʻzbekistonga
- VERB: yurganga, borilishiga, bo‘lishiga, chaqirishga, eksportga, erishishga, etishga, foydalanishga, ishlashga, koronavirusga
- VERB-Vnoun: borilishiga, bo‘lishiga, chaqirishga, erishishga, etishga, foydalanishga, ishlashga, olishga, oshirishga, o’rganishga
- Gen
- NOUN: dunyoning, yilning, bozorining, haydovchilarning, mahsulotning, mamlakat, markazi, odamlarning, so‘fining, yil
- PRON: uning, o‘z, bizning, ularning, buning, mening, barchamizning, kimningdir, o‘zimning, o‘zining
- PROPN: O‘zbekiston, Abdulhakimning, Argentinaning, Davronning, Erdo‘g‘anning, E’zozaning, Germaniyaning, Hamidaning, Jalolovning, Jayxunning
- Loc
- ADJ: yuksakda, sovuqlarida
- ADV: Nariroqda, yuqorida, Olis-olislardagi
- NOUN: doirasida, yerda, kunda, oqibatida, tumanida, viloyatida, bag‘rida, bahsda, boshida, davomida
- NUM: 17.50da, mingdan
- PRON: unda, Bizda, Qayerlardadir, o‘shanda, o‘zida, ularda
- PROPN: Toshkentda, AQSHdagi, Buxoroda, Eronda, Hongkongda, Iroqda, Isroilda, Istanbuldagi, Ka’bada, Kolumbiyada
- VERB-Vnoun: qilishda, tushganida
- Nom
- AUX: ekani, emasligi
- NOUN: kuni, nafar, odam, davlat, davom, prezidenti, yil, qishloq, xabar, axborot
- PRON: men, o‘zi, oʻz, u, ular, kim, o‘z, o‘zlari, Bizlar, Bu
- PROPN: O‘zbekiston, Toshkent, Rossiya, Koreya, Samarqand, Ukraina, AQSh, Amerika, Asqar, Buxoro
- VERB: tejovchi
Degree and Polarity
- Cmp
- ADJ: erkaroq, halokatliroq, keksaroq, pastroq
- ADV: Avvalroq
- Neg
- AUX: emas, emasmi, yoʻq
- VERB: yo‘q, bo‘lmaydi, bilmagan, ketmas, anglamadi, bermaganini, bilmay, bilolmaydi, chiqavermaydi, chiqmay
- VERB-Conv: bilmay, chiqmay, kirmay, ochmasdan, o‘tmay, qilmasdan, qoqmay, shoshilmasdan, sig‘may
- VERB-Fin: bo‘lmaydi, anglamadi, bilolmaydi, chiqavermaydi, foydalanmagan, ketmayaptimi, kutilmaydi, olmaydi, ololmaydi, o‘tolmaymiz
- VERB-Part: bilmagan, bermaganini, kelmaydigan, ko‘rmagan, ko‘rsatilmagan, kutilmagan, otmagan
Verbal Features
- Prog
- VERB: bormoqda, etmoqda, oshirilmoqda, o‘tmoqda, qilmoqda, bormoqdaman, bo‘lmoqda, chiyillar, cho‘kmoqda, ettirmoqda
- VERB-Fin: bormoqda, etmoqda, oshirilmoqda, o‘tmoqda, qilmoqda, bormoqdaman, bo‘lmoqda, cho‘kmoqda, ettirmoqda, foydalanilmoqda
- Cnd
- VERB: kelishsa, kelsa, aytsam, bersam, borsangiz, boshlasalar, bo‘lsa, chiqsa, ekilsa, ko`rsatsa
- VERB-Fin: borsangiz, kelishsa, ko`rsatsa
- VERB-Inf: boshlasalar, bo‘lsa, olmasalar
- Imp
- VERB-Fin: Shoshmang, aylantirsin, berilsin, ketmasin, qilmang, qilsin, turaqol
- Ind
- VERB: berdi, qildi, bo‘ldi, qiladi, beradi, etdi, ketdi, keldi, oldi, boshladi
- VERB-Fin: berdi, qildi, bo‘ldi, qiladi, etdi, ketdi, beradi, keldi, oldi, boshladi
- VERB-Inf: beradi
- VERB-Part: berilgan, kelayotgandi, yozilgan
- Int
- VERB: borasanmi, bormi, bo‘ladimi, bo‘larmikinlar, egami, ketmayaptimi, ko‘rdi, ko‘rganmisiz, qilganmi, qilmadingiz
- VERB-Fin: bormi, bo‘ladimi, bo‘larmikinlar, ketmayaptimi, ko‘rdi, ko‘rganmisiz, qilganmi, qilmadingiz, qilyaptiykin
- Opt
- VERB-Fin: beray
- Pot
- VERB-Fin: bo‘lmaydi, ololmaydi
- Fut
- AUX-Fin: bo'ladi
- VERB: beradi, qiladi, oladi, aniqlaydi, aylantirsin, aytadilar, aytsam, beriladi, berilsin, bersam
- VERB-Fin: beradi, qiladi, oladi, aniqlaydi, aylantirsin, aytadilar, beriladi, berilsin, bilolmaydi, boramiz
- VERB-Inf: beradi, boshlasalar, olmasalar
- Past
- AUX: edi, edim, edilar, bo'lgan
- AUX-Fin: edi, bo'lgan
- VERB: berdi, qildi, bo‘ldi, etdi, ketdi, keldi, oldi, qilindi, boshladi, bo‘lgan
- VERB-Fin: berdi, qildi, bo‘ldi, etdi, ketdi, keldi, oldi, qilindi, boshladi, bo‘lgan
- VERB-Part: berilgan, keltirilgan, yozilgan
- Pres
- AUX: ekanligini, kerak
- AUX-Fin: kerak
- VERB: bor, ega, qiladi, mavjud, qilmoqda, qoladi, bildiradi, bormoqda, bo‘lmaydi, etadi
- VERB-Conv: qisqarib
- VERB-Fin: ega, qiladi, qilmoqda, qoladi, bildiradi, bormoqda, bo‘lmaydi, etadi, etmoqda, hisoblanadi
- VERB-Part: kelayotgan, etayotgani, kelayotgandi, yaqinlashayotgan
- Pass
- VERB: qilindi, berilgan, etildi, olingan, o‘tkazildi, qilingan, etiladi, kiritildi, oshirilmoqda, qilinadi
- VERB-Fin: qilindi, etildi, o‘tkazildi, etiladi, kiritildi, olingan, oshirilmoqda, qilinadi, qurilgan, taqdirlandi
- VERB-Part: berilgan, qilingan, qo‘yilgan, yaratilgan, yozilgan, belgilangan, bildirilgan, bitilgan, kelinadigan, keltirilgan
- VERB-Vnoun: borilishiga, etilishi, tasdiqlanishi
Pronouns, Determiners, Quantifiers
- Dem
- ADV: shundoq
- DET: bu, shu, ushbu, mazkur, o‘sha, shunday, u, Mana, bunday, shunaqa
- PRON: bu, mana, shu, u, bunga, Shundan, bular, buning, shunday, Bularni
- Ind
- DET: bir
- PRON: kimningdir
- Int
- ADV: Nega, qanday, naqadar, ne, nimagadir, qandoq
- PRON: nima, kim, kimdir, ne, necha, neki, qanaqa, qancha, qanday, qay
- Neg
- DET: hech
- Prs
- PRON: men, o‘z, uning, u, ular, o‘zi, unga, bizning, meni, seni
- Rcp
- PRON: bir-biridan
- Rel
- PRON: qanday
- Tot
- DET: barcha, har, hamma, Butun, ayrim, bari, jami
- PRON: hamma, Hammasini, barchalarimizga, barchamizning, hammayoqni
- Card
- ADV: birgina
- NUM: bir, biri, ikki, 12, 1, 10, 2020, 4, mln, 100
- Frac
- NUM: 10,1
- Ord
- NUM: birinchi, Ikkinchi
- Range
- NUM: 4-16, 8-10, ikki-uchta, milliardlab, yetti-sakkiz, yuzlab
- Sets
- NUM: ikkalasi, ikkovi
- Yes
- NOUN: davlat, dunyoning, kompaniyasi, yilning, Respublikasi, bozorining, gul, mahsulotning, mamlakat, markazi
- PRON: uning, o‘z, ularning, bizning, buning, mening, barchamizning, kimningdir, o‘zi, o‘zimning
- PROPN: O‘zbekiston, Abdulhakimning, Argentinaning, Davronning, Erdo‘g‘anning, E’zozaning, Germaniyaning, Hamidaning, Jalolovning, Jayxunning
- Yes
- PRON: o‘zi, oʻzi, o‘ziga
- 1
- ADJ: hayronman
- AUX: edim, ekanman, ekanligimizga
- PRON: men, bizning, meni, menga, mening, Biz, Bizda, Bizlar, O‘zim, mendan
- VERB: berdim, aytsam, beray, bersam, bilganim, boramiz, bordim, bormoqdaman, bo‘lamiz, bo‘ldim
- VERB-Fin: berdim, beray, boramiz, bordim, bormoqdaman, bo‘lamiz, bo‘ldim, chekdim, kelganimda, ko‘rganman
- VERB-Part: bilganim, o‘qiyotganimni
- VERB-Vnoun: ochishim
- 2
- PRON: seni, sizga, Sen, siz
- VERB: Shoshmang, borasanmi, borsangiz, eshitasiz, qolasiz, tayyorladingiz, turaqol
- VERB-Fin: Shoshmang, borsangiz, eshitasiz, qolasiz, tayyorladingiz, turaqol
- 3
- AUX: bo'lsa, edilar, bo'ladi, bo'lgan, ekanmiz
- AUX-Fin: bo'ladi, bo'lgan
- PRON: o‘z, uning, u, ular, o‘zi, unga, ularning, uni, oʻz, biz
- VERB: berdi, qildi, bo‘ldi, qiladi, beradi, etdi, ketdi, keldi, oldi, boshladi
- VERB-Conv: qisqarib
- VERB-Fin: berdi, qildi, bo‘ldi, qiladi, etdi, ketdi, beradi, keldi, oldi, boshladi
- VERB-Inf: beradi, boshlasalar, bo‘lsa, olmasalar
- VERB-Part: kelayotgan, etayotgani, kelayotgandi, yaqinlashayotgan, yoqqani
- VERB-Vnoun: berishi, topshirishiga
Other Features
- Abbr
- Yes
- NOUN: AJ, IIBB, IShID, JKning, OAV, OKMK, ZTE
- NUM: mln
- PROPN: A., BBC, AQSH, AQSHdagi, AQSHga, AQSh, AQShga, KXDR, MDH, Oʻzbekistonga
- Yes
- Foreign
- Yes
- ADJ: Geographic, National
- NOUN: Wi-Fi, Air, Bluetooth, Dream, France, House, Stroy
- PROPN: Mail.ru, Apple
- Yes
Syntax
Auxiliary Verbs and Copula
- This corpus uses 3 lemmas as copulas (cop). Examples: edi, ekan, emas.
- This corpus uses 8 lemmas as auxiliaries (aux). Examples: edi, mumkin, bo'l, emas, kerak, ekan, lozim, yoʻq.
Core Arguments, Oblique Arguments and Adjuncts
Here we consider only relations between verbs (parent) and nouns or pronouns (child).
- nsubj
- VERB--NOUN-Nom (30)
- VERB--PRON (1)
- VERB--PRON-Nom (2)
- VERB-Conv--NOUN (1)
- VERB-Conv--NOUN-Nom (63)
- VERB-Conv--PRON (4)
- VERB-Conv--PRON-Nom (3)
- VERB-Fin--NOUN (1)
- VERB-Fin--NOUN-Nom (208)
- VERB-Fin--PRON (11)
- VERB-Fin--PRON-Nom (13)
- VERB-Part--NOUN (2)
- VERB-Part--NOUN-Nom (20)
- VERB-Part--PRON-Acc (1)
- VERB-Part--PRON-Nom (1)
- VERB-Vnoun--NOUN (1)
- VERB-Vnoun--NOUN-Gen (1)
- VERB-Vnoun--NOUN-Loc (1)
- VERB-Vnoun--NOUN-Nom (13)
- VERB-Vnoun--NOUN-Nom-ADP(bilan) (1)
- VERB-Vnoun--PRON (1)
- obj
- VERB--NOUN-Acc (6)
- VERB--NOUN-Dat (1)
- VERB--NOUN-Nom (7)
- VERB--PRON (1)
- VERB--PRON-Acc (2)
- VERB-Conv--NOUN-Abl (2)
- VERB-Conv--NOUN-Acc (35)
- VERB-Conv--NOUN-Nom (11)
- VERB-Conv--PRON-Acc (4)
- VERB-Conv--PRON-Dat (1)
- VERB-Conv--PRON-Nom (1)
- VERB-Fin--NOUN-Abl (3)
- VERB-Fin--NOUN-Acc (50)
- VERB-Fin--NOUN-Dat (12)
- VERB-Fin--NOUN-Dat-ADP(qarata) (1)
- VERB-Fin--NOUN-Nom (18)
- VERB-Fin--NOUN-Nom-ADP(bilan) (1)
- VERB-Fin--PRON (4)
- VERB-Fin--PRON-Acc (14)
- VERB-Fin--PRON-Dat (1)
- VERB-Fin--PRON-Nom (1)
- VERB-Part--NOUN-Acc (7)
- VERB-Part--NOUN-Nom (5)
- VERB-Part--PRON-Acc (1)
- VERB-Vnoun--NOUN-Acc (13)
- VERB-Vnoun--NOUN-Dat (2)
- VERB-Vnoun--NOUN-Nom (12)
- VERB-Vnoun--PRON-Acc (2)
- iobj
- VERB--NOUN-Dat (1)
- VERB--PRON-Dat (2)
- VERB--PRON-Nom (1)
- VERB-Conv--NOUN-Dat (7)
- VERB-Conv--NOUN-Nom (1)
- VERB-Conv--PRON-Dat (2)
- VERB-Fin--NOUN (1)
- VERB-Fin--NOUN-Abl (2)
- VERB-Fin--NOUN-Dat (27)
- VERB-Fin--NOUN-Nom-ADP(bilan) (1)
- VERB-Fin--PRON-Dat (5)
- VERB-Part--NOUN-Dat (6)
- VERB-Vnoun--NOUN-Dat (3)
- VERB-Vnoun--NOUN-Dat-ADP(qarshi) (1)
- VERB-Vnoun--PRON-Dat (1)
Verbs with Reflexive Core Objects
- This corpus contains 1 lemmas that occur at least once with a reflexive core object (obj or iobj). Examples: qabul o‘ziga
Relations Overview
- This corpus uses 2 relation subtypes: compound:lvc, nmod:poss
- The following 7 relation types are not used in this corpus at all: dislocated, clf, list, orphan, goeswith, reparandum, dep