UD Madi Jarawara
Language: Madi (code: jaa
)
Family: Arawan
This treebank has been part of Universal Dependencies since the UD v2.10 release.
The following people have contributed to making this treebank part of UD: Fabrício Ferraz Gerardi.
Repository: UD_Madi-Jarawara
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.15
License: CC BY-SA 4.0
Genre: grammar-examples
Questions, comments? General annotation questions (either Madi-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [fabricio • gerardi (æt) uni-tuebingen • de]. Development of the treebank happens directly in the UD repository, so you may submit bug fixes as pull requests against the dev branch.
Annotation | Source |
---|---|
Lemmas | annotated manually |
UPOS | annotated manually, natively in UD style |
XPOS | not available |
Features | annotated manually, natively in UD style |
Relations | annotated manually, natively in UD style |
Description
UD_Madi-Jarawara is a collection of annotated sentences in Madí (Jarawara dialect) from a variety of sources, including grammar examples, oral stories, didatic material, and dictionary examples.
UD_Madi-Jarawara is a collection of annotated sentences in Madí (Jarawara dialect) from a variety of sources, including grammar examples, oral stories, didatic material, and dictionary examples. The texts are annotated by Fabrício Ferraz Gerardi and Julia Sattler.
…
Acknowledgments
The development of the treebank is supported by the by European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (Grant agreement No. 834050).
References
- Vogel, Alan (2003) Jarawara verb-classes. PhD dissertation. University of Pittsburg.
- Dixon, R. M. W (2004) The Jarawara Language of Southern Amazonia. Oxford University Press.
- Vogel, Alan (2005) Dicionário Jarawara-Portguês. SIL.
- Vogel, Alan (?) Jarawara Interlinear Texts. SIL.
- Vogel, Alan (2019) Jarawara Interlinear Texts. Volume 2. Anápolis: SIL-Brasil.
- Vogel, Alan (2021) A Typology of Finite Subordinate Clauses in Jarawara. SIL.
Statistics of UD Madi Jarawara
POS Tags
ADV – AUX – NOUN – PART – PRON – PUNCT – VERB – X
Features
Aspect – Cfm – Clusivity – Comt – Decl – Dist – Evident – Gender – Intension – NCount – Number – Number[obj] – Number[subj] – Person – Person[obj] – Person[psor] – Person[subj] – PronType – Redup – Report – Tense
Relations
advcl – advmod – amod – aux – compound – cop – dep – discourse – expl – nmod – nsubj – obj – obl – parataxis – punct – root – vocative – xcomp
Tokenization and Word Segmentation
- This corpus contains 20 sentences, 114 tokens and 115 syntactic words.
- All tokens in this corpus are followed by a space.
- This corpus does not contain words with spaces.
- This corpus does not contain words that contain both letters and punctuation.
- This corpus contains 1 multi-word tokens. On average, one multi-word token consists of 2.00 syntactic words.
- There are 1 types of multi-word tokens. Examples: kama.
Morphology
Tags
- This corpus uses 8 UPOS tags out of 17 possible: ADV, AUX, NOUN, PART, PRON, PUNCT, VERB, X
- This corpus does not use the following tags: PROPN, ADJ, DET, NUM, ADP, SCONJ, CCONJ, INTJ, SYM
- This corpus contains 2 word types tagged as particles (PART): oke, otaake
- This corpus contains 9 lemmas tagged as pronouns (PRON): _, e, era, me, nafi, otaa, otara, te, tiwa
- This corpus contains 0 lemmas tagged as determiners (DET):
- This corpus contains 2 lemmas tagged as auxiliaries (AUX): ama, na
- This corpus does not use the VerbForm feature.
Nominal Features
- Fem
- ADV: faro
- AUX: na, ni, kanemetemone, kawahi, naba, naminaba, naminabone, ona, onahara, tinahaboneke
- NOUN: ini
- PART: oke, otaake
- VERB: mita, Otafara, amosarake, awahabana, mitahani, omita, omitani, sawiminahaba, tabaminahaba, tamiba
- Masc
- NOUN: ino
- Plur
- AUX: tinaharo
- PART: otaake
- PRON: me, te, e, eke, era, teke
- VERB: sawiminahaba
- Sing
- AUX: tinahaboneke
- PART: oke
- PRON: e, eke
Degree and Polarity
Verbal Features
- Hab
- VERB: kitate
- Fut
- AUX: naba, naminaba, naminabone
- VERB: awahabana, sawiminahaba, tabaminahaba, tamiba
- IPast
- AUX: onahara
- VERB: Otafara, amosarake, mitahani, omitani
- RPast
- AUX: kanemetemone
- Fh
- AUX: onahara
- VERB: Otafara
- Nfh
- AUX: kanemetemone
- VERB: amosarake, mitahani, omitani
Pronouns, Determiners, Quantifiers
- Prs
- PRON: e, me
- VERB: sawiminahaba
- Tot
- PRON: nafi
- 1
- PART: oke
- PRON: e, eke, era
- 2
- AUX: tinahaboneke, tinaharo
- PRON: te, teke
- VERB: sawiminahaba
- 3
- PRON: me
Other Features
- Cfm
- Morning
- AUX: naminaba, naminabone
- VERB: sawiminahaba, tabaminahaba
- Morning
- Clusivity
- Ex
- PART: otaake
- PRON: otaa, otara
- In
- PRON: e, eke, era
- Ex
- Comt
- Yes
- AUX: kanemetemone, kawahi
- Yes
- Decl
- F
- AUX: amake, naminaba, naminabone, tinahaboneke
- PRON: eke, teke
- VERB: amosarake
- F
- Dist
- Yes
- VERB: tiwari
- Yes
- Intension
- Yes
- AUX: naminabone, tinahaboneke
- VERB: tamabote
- Yes
- NCount
- Yes
- VERB: nafi
- Yes
- Number[obj]
- Plur
- PRON: otara
- Sing
- PRON: Tiwa
- Plur
- Number[subj]
- Plur
- PRON: otaa, te
- Sing
- AUX: ona, onahara
- VERB: Otafara, amosarake, omita, omitani
- Plur
- Person[obj]
- 1
- PRON: otara
- 2
- PRON: Tiwa
- 1
- Person[psor]
- 1
- PRON: e
- 2
- PRON: te
- 1
- Person[subj]
- 1
- AUX: ona, onahara
- PRON: otaa
- VERB: Otafara, amosarake, omita, omitani
- 2
- PRON: te
- 1
- Redup
- Yes
- VERB: titi
- Yes
- Report
- Yes
- AUX: kanemetemone
- Yes
Syntax
Auxiliary Verbs and Copula
- This corpus uses 1 lemmas as copulas (cop). Examples: ama.
- This corpus uses 1 lemmas as auxiliaries (aux). Examples: na.
Core Arguments, Oblique Arguments and Adjuncts
Here we consider only relations between verbs (parent) and nouns or pronouns (child).
- nsubj
- VERB--NOUN (3)
- VERB--PRON (12)
- obj
- VERB--NOUN (6)
- VERB--PRON (8)