UD Nayini AHA
Language: Nayini (code: nyq
)
Family: IE
This treebank has been part of Universal Dependencies since the UD v2.7 release.
The following people have contributed to making this treebank part of UD: AmirHossein Mojiri Foroushani, Hamid Aghaei, Amir Ahmadi.
Repository: UD_Nayini-AHA
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.15
License: CC BY-SA 4.0
Genre: grammar-examples, spoken
Questions, comments? General annotation questions (either Nayini-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [amojiry (æt) gmail • com]. Development of the treebank happens directly in the UD repository, so you may submit bug fixes as pull requests against the dev branch.
Annotation | Source |
---|---|
Lemmas | annotated manually |
UPOS | annotated manually, natively in UD style |
XPOS | annotated manually |
Features | annotated manually, natively in UD style |
Relations | annotated manually, natively in UD style |
Description
The AHA Nayini Treebank is a small treebank for contemporary Nayini. Its corpus is collected and annotated manually. We have prepared this treebank based on interviews with Nayini speakers.
Nayini treebank consist of 10 sentences of this stage. We are trying to make this corpus bigger day by day. AHA is a small group, tries to analyze Iranian language and find their similarities and differences.
Acknowledgments
Theses sentences were prepared with the help of Hoyod people. On behalf of the AHA group, Hoyod people is thanked. Also, Ms. Hanieh Mashayekhi sincerely helped us to translate the sentences. First, we used the sentences suggested by APLL (Academy of Persian Language and Literature) to collect Iranian languages. This project is a research project by AmirHossein, Hamid and Amir (AHA).
You can use this structure to refer to this project:
- Mojiri Foroushani, AmirHossein; Aghaei, Hamid; Ahmadi, Amir (2020): “AHA Nayini dependency treebank”, Universal dependencies (universaldependencies.org)
Statistics of UD Nayini AHA
POS Tags
ADJ – ADP – ADV – AUX – CCONJ – NOUN – NUM – PRON – PUNCT – SCONJ – VERB
Features
Case – Degree – Mood – Number – NumType – Person – Polarity – PronType – Tense – VerbForm
Relations
advcl – advmod – amod – aux – case – cc – ccomp – compound – compound:lvc – flat – mark – nmod – nmod:poss – nsubj – nummod – obj – obl – punct – root
Tokenization and Word Segmentation
- This corpus contains 10 sentences and 78 tokens.
- This corpus contains 11 tokens (14%) that are not followed by a space.
- This corpus does not contain words with spaces.
- This corpus does not contain words that contain both letters and punctuation.
Morphology
Tags
- This corpus uses 11 UPOS tags out of 17 possible: ADJ, ADP, ADV, AUX, CCONJ, NOUN, NUM, PRON, PUNCT, SCONJ, VERB
- This corpus does not use the following tags: PROPN, DET, PART, INTJ, SYM, X
- This corpus contains 3 lemmas tagged as pronouns (PRON): او, م, مو
- This corpus contains 0 lemmas tagged as determiners (DET):
- This corpus contains 1 lemmas tagged as auxiliaries (AUX): دار
- There are 1 (de)verbal forms:
- Part
- VERB: بشتِیون, واجِن
Nominal Features
- Plur
- NOUN: پلهها
- VERB-Part: واجِن
- Sing
- ADV: رویی
- AUX: دارُن
- NOUN: اُو, بار, برنج, بنه, بِرا, تا, ساعت, سالی, صحبی, عبدولو
- PRON: م, مو, ُم, او, ُش
- VERB: بشتِیون, بِکَ, بینِدییه, ته, دِکفتُن, شو, شون, مِگا, نِشو, هاگیفت
- VERB-Part: بشتِیون
- Loc
- ADV: تیگ
- Tem
- ADV: ذِنُ, هِزِ
Degree and Polarity
- Pos
- ADJ: چِن
- Neg
- VERB: بینِدییه, نِشو
Verbal Features
- Imp
- VERB: ته
- Sub
- VERB: شون
- Fut
- AUX: دارُن
- Past
- VERB: دِکفتُن, مِگا, هاگیفت
- Pres
- VERB: بِکَ, بینِدییه, شو, شون, نِشو, وا, واپوشن, کِرو
Pronouns, Determiners, Quantifiers
- Dem
- PRON: مو
- Prs
- PRON: م, ُم, او, مو, ُش
- Card
- NUM: دِه, یک
- 1
- AUX: دارُن
- PRON: م, ُم, مو
- VERB: بشتِیون, بینِدییه, دِکفتُن, شون, مِگا, هاگیفت, واپوشن
- VERB-Part: بشتِیون
- 2
- VERB: ته
- 3
- PRON: او, ُش
- VERB: بِکَ, شو, نِشو, وا, واجِن, کِرو
- VERB-Part: واجِن
Other Features
Syntax
Auxiliary Verbs and Copula
- This corpus does not contain copulas.
- This corpus uses 1 lemmas as auxiliaries (aux). Examples: دار.
Core Arguments, Oblique Arguments and Adjuncts
Here we consider only relations between verbs (parent) and nouns or pronouns (child).
- nsubj
- VERB--NOUN (4)
- VERB--PRON (4)
- obj
- VERB--NOUN (3)
- VERB--PRON (1)