UD Coptic Scriptorium
Language: Coptic (code: cop
)
Family: Afro-Asiatic
This treebank has been part of Universal Dependencies since the UD v1.4 release.
The following people have contributed to making this treebank part of UD: Mitchell Abrams, Elizabeth Davidson, Amir Zeldes.
Repository: UD_Coptic-Scriptorium
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.15
License: CC BY 4.0
Genre: bible, fiction, nonfiction
Questions, comments? General annotation questions (either Coptic-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [amir • zeldes (æt) georgetown • edu]. Development of the treebank happens outside the UD repository. If there are bugs, either the original data source or the conversion procedure must be fixed. Do not submit pull requests against the UD repository.
Annotation | Source |
---|---|
Lemmas | annotated manually |
UPOS | annotated manually in non-UD style, automatically converted to UD |
XPOS | annotated manually |
Features | assigned by a program, not checked manually |
Relations | annotated manually, natively in UD style |
Description
UD Coptic contains manually annotated Sahidic Coptic texts, including Biblical texts, sermons, letters, and hagiography.
The Coptic Universal Dependency Treebank is a manually annotated corpus of Sahidic Coptic texts, currently containing excerpts from the Sahidic New Testament Gospel of Mark, Works by Archmandrite Shenoute of Atripe, the Letters of Besa, lives of Sts. Cyrus and Onnophrius, Epistle of Pseudo-Ephrem, the Dormition of John the Apostle and short stories from the Apophthegmata Patrum (Sayings of the Desert Fathers). Detailed information about the treebank is available here:
http://copticscriptorium.org/treebank.html
The data was digitized or previously available in digital format, and annotated manually for part of speech in the project Coptic Scriptorium. For individual credit and further information see:
http://copticscriptorium.org/
Coptic POS tags come from the Coptic Scriptorium tag set, which is available from the project and treebank websites.
Acknowledgments
The underlying POS tagged material was produced as part of the projects Coptic Scriptorium, KOMeT and KELLIA, funded by the NEH in the USA and BMBF and DFG in Germany (see http://copticscriptorium.org/ for more details). Treebank annotation was done mainly by Mitchell Abrams, Liz Davidson and Amir Zeldes. Thanks are also due to Israel Avrahamy, Asael Benyami, Yinon Kahan and Oran Szachter for their contributions.
Statistics of UD Coptic Scriptorium
POS Tags
ADJ – ADP – ADV – AUX – CCONJ – DET – NOUN – NUM – PART – PRON – PROPN – PUNCT – SCONJ – VERB – X
Features
Definite – Foreign – Gender – Gender[psor] – Number – Number[psor] – NumType – Person – Polarity – Poss – PronType – Reflex – VerbForm
Relations
acl – acl:relcl – advcl – advmod – amod – appos – aux – case – cc – ccomp – compound – conj – cop – csubj – dep – det – discourse – dislocated – expl – fixed – flat – iobj – mark – nmod – nmod:poss – nmod:unmarked – nsubj – nummod – obj – obl – obl:unmarked – orphan – parataxis – punct – reparandum – root – vocative – xcomp
Tokenization and Word Segmentation
- This corpus contains 2203 sentences, 26837 tokens and 57098 syntactic words.
- All tokens in this corpus are followed by a space.
- This corpus does not contain words with spaces.
- This corpus contains 11 types of words that contain both letters and punctuation. Examples: ·ⲻ, .....ⲟ..., [....]ⲥ, [...]ϥ, ϩⲏ[..]ⲉ, ⲉ......ⲙ...., ⲉ[.....], ⲉⲃ[........], ⲟⲩⲇ[.......], ⲡ[…]ⲡⲟⲥ, ⲡⲁ[…]ϥⲟϭⲉ
- This corpus contains 16846 multi-word tokens. On average, one multi-word token consists of 2.80 syntactic words.
- There are 9028 types of multi-word tokens. Examples: ⲛⲁϥ, ⲙⲙⲟⲥ, ⲙⲙⲟϥ, ⲉⲣⲟϥ, ⲡⲉϫⲁϥ, ⲙⲡⲛⲟⲩⲧⲉ, ⲛⲁⲩ, ⲛⲧϩⲉ, ⲛⲁⲓ, ⲙⲙⲟⲟⲩ, ⲧⲏⲣⲟⲩ, ⲉϥϫⲱ, ⲛϩⲏⲧϥ, ⲛⲁⲕ, ⲉⲣⲟⲓ, ⲉⲣⲟⲟⲩ, ⲙⲙⲟⲕ, ⲛⲏⲧⲛ, ⲉⲧⲙⲙⲁⲩ, ⲉⲧⲟⲩⲁⲁⲃ, ⲛⲁⲥ, ⲉⲣⲟⲕ, ⲛⲧⲉⲩⲛⲟⲩ, ⲛϩⲏⲧ, ⲛⲧⲉⲓϩⲉ, ⲛⲙⲙⲁϥ, ⲁϥⲉⲓ, ⲙⲡϫⲟⲉⲓⲥ, ⲛⲁⲛ, ⲛⲣⲱⲙⲉ, ⲁϥⲃⲱⲕ, ⲛⲟⲩⲱⲧ, ⲉⲧⲃⲉⲡⲁⲓ, ⲡⲛⲟⲩⲧⲉ, ⲧⲏⲣϥ, ⲙⲙⲟⲓ, ⲙⲡⲣⲱⲙⲉ, ⲛⲥⲱϥ, ⲡⲉⲭⲣⲓⲥⲧⲟⲥ, ⲉⲣⲟⲛ, ⲛϩⲏⲧⲟⲩ, ⲡⲟⲩⲁ, ⲛϩⲟⲟⲩ, ⲛⲧⲉⲡⲛⲟⲩⲧⲉ, ⲁϥϫⲟⲟⲥ, ⲁⲩⲉⲓ, ⲉⲣⲱⲧⲛ, ⲛϩⲟⲩⲟ, ⲡⲉϫⲁⲥ, ⲡⲁϣⲏⲣⲉ.
Morphology
Tags
- This corpus uses 15 UPOS tags out of 17 possible: ADJ, ADP, ADV, AUX, CCONJ, DET, NOUN, NUM, PART, PRON, PROPN, PUNCT, SCONJ, VERB, X
- This corpus does not use the following tags: INTJ, SYM
- This corpus contains 42 word types tagged as particles (PART): ϩⲁⲙⲏⲛ, ϩⲏⲏⲛⲉ, ϩⲏⲏⲧⲉ, ϩⲛ, ϭⲉ, ⲁϩⲉ, ⲁϫⲛ, ⲁⲛⲧⲓ, ⲁⲣⲁ, ⲅⲁⲣ, ⲇⲉ, ⲉ, ⲉϩⲉ, ⲉⲓⲉ, ⲉⲓⲥ, ⲉⲛⲉ, ⲉⲛⲧ, ⲉⲣⲉ, ⲉⲧⲃⲉ, ⲙ, ⲙⲉⲛ, ⲙⲙⲟ, ⲙⲙⲟⲛ, ⲙⲛ, ⲙⲛⲛⲥⲁ, ⲛ, ⲛϭⲉ, ⲛϭⲓ, ⲛⲁ, ⲛⲉ, ⲛⲧ, ⲛⲧⲉ, ⲟⲩⲇⲉ, ⲟⲩⲛ, ⲟⲩⲟⲉⲓ, ⲟⲩⲟⲓ, ⲡⲉ, ⲣⲱ, ⲥⲉ, ⲭⲁⲓⲣⲉ, ⲭⲱⲣⲓⲥ, ⲱ
- This corpus contains 70 lemmas tagged as pronouns (PRON): ϩⲁ_ⲛⲧⲟ, ϩⲁϩⲧⲛ, ϩⲓϫⲛ_ⲛⲧⲟ, ϩⲓⲧⲛ_ⲁⲛⲟⲕ, ϩⲛ_ⲁⲛⲟⲕ, ϩⲱ_ⲁⲛⲟⲕ, ϩⲱⲱ_ⲁⲛⲟⲕ, ϫⲓ_ⲁⲛⲟⲕ, ϫⲡⲟ_ⲛⲧⲟ, ⲁ, ⲁ_ⲛⲧⲟ, ⲁϣ, ⲁⲛⲟⲕ, ⲁⲛⲟⲕ_ⲛⲧⲉ, ⲁⲛⲟⲛ, ⲁⲟⲩⲏⲣ, ⲅ, ⲉ_ⲛⲧⲟ, ⲉϫⲛ_ⲛⲧⲟ, ⲉⲓ, ⲉⲕⲉ, ⲉⲛⲉ, ⲉⲣϣⲁⲛ_ⲁⲛⲟⲕ, ⲉⲣϣⲁⲛ_ⲁⲛⲟⲛ, ⲉⲣϣⲁⲛ_ⲛⲧⲟ, ⲉⲣϣⲁⲛ_ⲛⲧⲟϥ, ⲉⲣϣⲁⲛ_ⲛⲧⲟⲕ, ⲉⲣϣⲁⲛ_ⲛⲧⲟⲟⲩ, ⲉⲣϣⲁⲛ_ⲛⲧⲟⲥ, ⲉⲣϣⲁⲛ_ⲛⲧⲱⲧⲛ, ⲉⲣⲉ_ⲁⲛⲟⲕ, ⲉⲣⲉ_ⲁⲛⲟⲛ, ⲉⲣⲉ_ⲛⲧⲟ, ⲉⲣⲉ_ⲛⲧⲟϥ, ⲉⲣⲉ_ⲛⲧⲟⲕ, ⲉⲣⲉ_ⲛⲧⲟⲟⲩ, ⲉⲣⲉ_ⲛⲧⲟⲥ, ⲉⲣⲉ_ⲛⲧⲱⲧⲛ, ⲉⲥ, ⲉⲧⲃⲉ_ⲁⲛⲟⲕ, ⲉⲧⲉⲣⲉ_ⲛⲧⲟ, ⲉⲧⲉⲧⲛϣⲁⲛ, ⲉⲧⲉⲧⲛⲉ, ⲕ, ⲙⲉⲩ, ⲙⲙⲓⲛⲙⲙⲟ_ⲛⲧⲟ, ⲙⲡⲉ_ⲛⲧⲟ, ⲛ_ⲛⲧⲟ, ⲛⲉⲣⲉ_ⲛⲧⲟ, ⲛⲓⲙ, ⲛⲥⲁ_ⲛⲧⲟ, ⲛⲧⲉ_ⲁⲛⲟⲕ, ⲛⲧⲉⲧⲛ, ⲛⲧⲛ_ⲁⲛⲟⲕ, ⲛⲧⲟ, ⲛⲧⲟϥ, ⲛⲧⲟⲕ, ⲛⲧⲟⲟⲩ, ⲛⲧⲟⲥ, ⲛⲧⲱⲧⲛ, ⲟⲩ, ⲟⲩⲏⲣ, ⲡⲉ, ⲡⲱⲥ, ⲣⲁⲧ_ⲁⲛⲟⲕ, ⲣⲟ_ⲛⲧⲟ, ⲥϥ, ⲧⲉⲧ, ⲧⲉⲧⲛ, ⲧⲣⲉϥ
- This corpus contains 29 lemmas tagged as determiners (DET): ϩⲛ, ϭⲉ, ϯ, ⲕⲉ, ⲛ, ⲛⲁ, ⲛⲁⲓ, ⲛⲉⲓ, ⲛⲟⲩⲓ, ⲟⲩ, ⲡ, ⲡⲁ, ⲡⲁⲓ, ⲡⲉϥ, ⲡⲉⲓ, ⲡⲉⲕ, ⲡⲉⲛ, ⲡⲉⲥ, ⲡⲉⲧⲛ, ⲡⲉⲩ, ⲡⲏ, ⲡⲓ, ⲡⲟⲩ, ⲡⲟⲩⲕ, ⲡⲱϥ, ⲡⲱⲓ, ⲡⲱⲕ, ⲡⲱⲧⲛ, ⲧ
- Out of the above, 1 lemmas occurred sometimes as PRON and sometimes as DET: ⲟⲩ
- This corpus contains 26 lemmas tagged as auxiliaries (AUX): ϣ, ϣⲁ, ϣⲁⲛⲧⲉ, ϣⲁⲣⲉ, ϫⲡⲓ, ⲁ, ⲉϣ, ⲉⲣϣⲁⲛ, ⲉⲣⲉ, ⲙⲁⲣⲉ, ⲙⲉ, ⲙⲉⲣⲉ, ⲙⲛ, ⲙⲡⲁⲧⲉ, ⲙⲡⲉ, ⲙⲡⲣⲧⲣⲉ, ⲛⲁ, ⲛⲉ, ⲛⲉϣ, ⲛⲉⲣⲉ, ⲛⲛⲉ, ⲛⲧⲉ, ⲛⲧⲉⲣⲉ, ⲟⲩⲛ, ⲧⲁⲣ, ⲧⲁⲣⲉ
- Out of the above, 6 lemmas occurred sometimes as AUX and sometimes as VERB: ϣⲁ, ϫⲡⲓ, ⲙⲉ, ⲙⲛ, ⲛⲁ, ⲟⲩⲛ
- There are 2 (de)verbal forms:
- Fin
- AUX: ϣ, ⲉϣ, ϫⲡⲓ, ⲛⲉϣ
- PRON: ϫⲓⲧ, ϫⲡⲟ
- VERB: ⲡⲉϫⲁ, ⲉⲓ, ϫⲱ, ϣⲱⲡⲉ, ⲃⲱⲕ, ϫⲟⲟ, ⲛⲁⲩ, ϯ, ⲣ, ⲥⲱⲧⲙ
- Inf
- VERB: ϯ, ⲃⲱⲕ, ⲛⲁⲩ, ⲣ, ϭⲱ, ϫⲓ, ⲁⲁ, ⲕⲁ, ⲧⲁⲙⲟ, ϣⲁϫⲉ
Nominal Features
- Fem
- DET: ⲧ, ⲧⲉ, ⲧⲉϥ, ⲧⲉⲓ, ⲧⲁ, ⲧⲉⲕ, ⲧⲁⲓ, ⲧⲉⲩ, ⲧⲉⲥ, ⲧⲟⲩ
- PRON: ⲥ, ⲧⲉ, ⲉ, ⲛⲧⲟⲥ, ⲉⲣⲟ, ⲁ, ⲛⲧⲟ, ⲙⲙⲟ, ⲣ, ⲁⲣ
- Masc
- DET: ⲡ, ⲡⲉ, ⲡⲁ, ⲡⲉϥ, ⲡⲁⲓ, ⲡⲉⲓ, ⲡⲉⲕ, ⲡⲉⲛ, ⲡⲉⲥ, ⲡⲉⲩ
- PRON: ϥ, ⲕ, ⲡⲉ, ⲅ, ⲛⲧⲟϥ, ⲡ, ⲛⲧⲟⲕ, ⲉⲕϣⲁⲛ, ⲉϥϣⲁⲛ, ⲉϥⲉ
- Plur
- DET: ⲛ, ⲛⲉ, ⲛⲉϥ, ⲛⲁⲓ, ⲛⲁ, ⲙ, ⲛⲉⲛ, ⲛⲉⲩ, ⲛⲉⲕ, ⲛⲉⲧⲛ
- PRON: ⲩ, ⲟⲩ, ⲛ, ⲧⲛ, ⲧⲉⲧⲛ, ⲥⲉ, ⲛⲉ, ⲧⲏⲩⲧⲛ, ⲉⲩⲉ, ⲛⲧⲱⲧⲛ
- Sing
- DET: ⲡ, ⲧ, ⲟⲩ, ⲡⲉ, ϩⲉⲛ, ⲡⲁ, ⲡⲉϥ, ⲧⲉ, ⲡⲁⲓ, ⲡⲉⲓ
- PRON: ϥ, ⲥ, ⲓ, ⲕ, ⲡⲉ, ϯ, ⲧⲉ, ⲁⲛⲟⲕ, ⲅ, ⲛⲧⲟϥ
- Def
- DET: ⲡ, ⲧ, ⲛ, ⲡⲉ, ⲡⲁ, ⲡⲉϥ, ⲧⲉ, ⲡⲁⲓ, ⲛⲉ, ⲛⲉϥ
- NOUN: ⲙⲙⲓⲛⲙⲙⲟ, ⲙⲙⲓⲛⲙⲙⲱ
- PRON: ϥ, ⲩ, ⲥ, ⲓ, ⲟⲩ, ⲕ, ⲛ, ⲧⲛ, ⲧⲉⲧⲛ, ⲥⲉ
- Ind
- DET: ⲟⲩ, ϩⲉⲛ, ⲩ
Degree and Polarity
- Neg
- ADV: ⲁⲛ, ⲛ, ⲧⲙ, ⲙⲡⲣ, ⲙ, ⲟⲩ, ⲟⲩⲕ, ⲙⲉ, ⲟⲩⲇⲉ
- AUX: ⲙⲡ, ⲙⲡⲉ, ⲛⲛⲉ, ⲙⲉ, ⲙⲛ, ⲙⲡⲣⲧⲣⲉ, ⲙⲡⲁⲧ, ⲙⲁⲣⲉ, ⲙⲡⲁⲧⲉ, ⲛⲛ
- CCONJ: ⲟⲩⲇⲉ
- PART: ⲙⲙⲟⲛ, ⲟⲩⲇⲉ
- PRON: ⲙⲡⲉ
- SCONJ: ⲟⲩⲇⲉ
- VERB: ⲙⲛ, ⲙⲛⲧ, ⲙⲛⲧⲁ, ⲙⲙⲛ, ⲙⲙⲛⲧ, ⲙⲛⲧⲏ, ⲙⲙⲛⲧⲁ, ⲙⲙⲛⲧⲉ, ⲙⲙⲛⲧⲏ
- X: ⲟⲩ
Verbal Features
Pronouns, Determiners, Quantifiers
- Art
- DET: ⲡ, ⲧ, ⲛ, ⲟⲩ, ⲡⲉ, ϩⲉⲛ, ⲕⲉ, ⲧⲉ, ⲛⲉ, ⲩ
- Dem
- DET: ⲡⲁⲓ, ⲛⲁⲓ, ⲡⲉⲓ, ⲧⲉⲓ, ⲧⲁⲓ, ⲛⲉⲓ, ⲡⲓ, ⲛⲓ, ⲡⲏ, ⲛⲏ
- Ind
- PRON: ⲛⲓⲙ, ⲟⲩ
- Int
- ADV: ⲧⲱⲛ
- PRON: ⲟⲩ, ⲁϣ, ⲛⲓⲙ, ⲟⲩⲏⲣ, ⲁⲟⲩⲏⲣ, ⲡⲱⲥ
- Prs
- DET: ⲡⲁ, ⲡⲉϥ, ⲛⲉϥ, ⲧⲉϥ, ⲛⲁ, ⲡⲉⲕ, ⲡⲉⲛ, ⲧⲁ, ⲛⲉⲛ, ⲛⲉⲩ
- NOUN: ⲙⲙⲓⲛⲙⲙⲟ, ⲙⲙⲓⲛⲙⲙⲱ
- PRON: ϥ, ⲩ, ⲥ, ⲓ, ⲟⲩ, ⲕ, ⲛ, ⲧⲛ, ⲧⲉⲧⲛ, ⲥⲉ
- Rcp
- NOUN: ⲉⲣⲏⲩ
- Tot
- NOUN: ⲧⲏⲣ
- Card
- NUM: ⲟⲩⲁ, ⲥⲛⲁⲩ, ϣⲉ, ϣⲟⲙⲛⲧ, ⲙⲛⲧⲥⲛⲟⲟⲩⲥ, ⲙⲏⲧ, ⲥⲁϣϥ, ⲟⲩⲉⲓ, ⲧⲃⲁ, ⲥⲛⲧⲉ
- Yes
- DET: ⲡⲁ, ⲡⲉϥ, ⲛⲉϥ, ⲧⲉϥ, ⲛⲁ, ⲡⲉⲕ, ⲡⲉⲛ, ⲧⲁ, ⲛⲉⲛ, ⲛⲉⲩ
- PRON: ϥ, ⲟⲩ, ⲕ, ⲥ, ⲛ, ⲧⲛ, ⲧ, ⲩ, ⲧⲏⲩⲧⲛ, ⲓ
- Yes
- NOUN: ⲙⲙⲓⲛⲙⲙⲟ, ⲙⲙⲓⲛⲙⲙⲱ
- PRON: ⲙⲙⲓⲛⲙⲙⲟ
- 1
- DET: ⲡⲁ, ⲡⲉⲛ, ⲛⲁ, ⲛⲉⲛ, ⲧⲁ, ⲧⲉⲛ, ⲡⲱⲓ, ⲛⲟⲩⲓ
- PRON: ⲓ, ⲛ, ϯ, ⲁⲛⲟⲕ, ⲧⲛ, ⲧⲁ, ⲧ, ⲁⲛⲅ, ⲁⲛⲟⲛ, ⲁ
- 2
- DET: ⲡⲉⲕ, ⲧⲉⲕ, ⲛⲉⲕ, ⲛⲉⲧⲛ, ⲡⲟⲩ, ⲧⲟⲩ, ⲡⲉⲧⲛ, ⲛⲟⲩ, ⲧⲉⲧⲛ, ⲡⲱⲧⲛ
- PRON: ⲕ, ⲧⲉⲧⲛ, ⲧⲛ, ⲅ, ⲧⲏⲩⲧⲛ, ⲉ, ⲛⲧⲱⲧⲛ, ⲧⲉ, ⲛⲧⲟⲕ, ⲉⲕϣⲁⲛ
- 3
- DET: ⲡⲉϥ, ⲛⲉϥ, ⲧⲉϥ, ⲛⲉⲩ, ⲡⲉⲥ, ⲡⲉⲩ, ⲧⲉⲩ, ⲧⲉⲥ, ⲛⲉⲥ, ⲡⲱϥ
- PRON: ϥ, ⲩ, ⲥ, ⲟⲩ, ⲥⲉ, ⲛⲧⲟϥ, ⲉⲩⲉ, ⲛⲧⲟⲟⲩ, ⲛⲧⲟⲥ, ⲉϥϣⲁⲛ
- Fem
- DET: ⲡⲉⲥ, ⲧⲉⲥ, ⲡⲟⲩ, ⲧⲟⲩ, ⲛⲟⲩ, ⲛⲉⲥ
- Masc
- DET: ⲡⲉϥ, ⲛⲉϥ, ⲧⲉϥ, ⲡⲉⲕ, ⲧⲉⲕ, ⲛⲉⲕ, ⲡⲱⲕ, ⲡⲱϥ, ⲛⲟⲩⲕ, ⲧⲱϥ
- Plur
- DET: ⲡⲉⲛ, ⲛⲉⲛ, ⲛⲉⲩ, ⲛⲉⲧⲛ, ⲡⲉⲩ, ⲧⲉⲩ, ⲡⲉⲧⲛ, ⲧⲉⲛ, ⲧⲉⲧⲛ, ⲡⲱⲧⲛ
- Sing
- DET: ⲡⲉϥ, ⲡⲁ, ⲛⲉϥ, ⲧⲉϥ, ⲡⲉⲕ, ⲛⲁ, ⲧⲁ, ⲧⲉⲕ, ⲛⲉⲕ, ⲡⲉⲥ
Other Features
- Foreign
- Yes
- ADJ: ⲕⲁⲑⲟⲗⲓⲕⲏ
- ADP: ⲕⲁⲧⲁ, ⲭⲱⲣⲓⲥ, ⲡⲁⲣⲁ, ⲡⲣⲟⲥ, ⲕⲁⲧⲁⲣⲟ, ⲙ, ⲡⲁⲣⲁⲣⲟ
- ADV: ⲕⲁⲗⲱⲥ, ⲧⲟⲧⲉ, ⲗⲟⲓⲡⲟⲛ, ⲙⲁⲗⲓⲥⲧⲁ, ⲉⲧⲓ, ⲏⲇⲏ, ⲡⲁⲗⲓⲛ, ϩⲟⲗⲱⲥ, ⲉⲓⲧⲁ, ⲟⲩⲕ
- AUX: ⲟⲩⲛ
- CCONJ: ⲁⲗⲗⲁ, ⲏ, ⲟⲩⲇⲉ, ⲉⲓⲧⲉ, ⲙⲏ, ⲡⲗⲏⲛ, ⲕⲁⲓ, ⲟⲩⲧⲉ, ⲕⲁⲛ, ϩⲟⲧⲁⲛ
- DET: ⲛⲁⲓ
- NOUN: ⲭⲣⲓⲥⲧⲟⲥ, ⲡⲛⲉⲩⲙⲁ, ⲁⲡⲁ, ⲕⲟⲥⲙⲟⲥ, ⲥⲱⲙⲁ, ⲙⲁⲑⲏⲧⲏⲥ, ⲗⲁⲟⲥ, ⲥⲁⲣⲝ, ⲙⲟⲛⲁⲭⲟⲥ, ⲡⲟⲗⲓⲥ
- NUM: ⲟⲩⲁ, ⲥⲉ
- PART: ⲇⲉ, ⲅⲁⲣ, ⲱ, ⲙⲉⲛ, ϩⲁⲙⲏⲛ, ⲁⲣⲁ, ⲟⲩⲛ, ⲭⲁⲓⲣⲉ, ⲭⲱⲣⲓⲥ, ⲁⲛⲧⲓ
- PRON: ⲟⲩ
- PROPN: ⲓⲏⲥⲟⲩⲥ, ⲓⲱϩⲁⲛⲛⲏⲥ, ⲃⲟⲉⲥ, ϩⲣⲟⲩⲑ, ⲛⲟⲉⲙⲓⲛ, ⲥⲁⲧⲁⲛⲁⲥ, ⲓⲁⲕⲱⲃ, ⲇⲏⲙⲏⲧⲣⲓⲟⲥ, ⲇⲓⲁⲃⲟⲗⲟⲥ, ⲡⲁⲩⲗⲟⲥ
- SCONJ: ϩⲱⲥⲧⲉ, ϩⲱⲥ, ⲉⲓⲙⲏⲧⲓ, ⲉⲡⲉⲓⲇⲏ, ⲙⲏⲡⲟⲧⲉ, ϩⲟⲧⲁⲛ, ⲕⲁⲛ, ⲙⲏⲧⲓ, ⲙⲏ, ⲅⲁⲣ
- VERB: ⲥⲧⲁⲩⲣⲟⲩ, ⲕⲣⲓⲛⲉ, ⲡⲁⲣⲁⲕⲁⲗⲉⲓ, ⲡⲓⲥⲧⲉⲩⲉ, ⲥⲕⲁⲛⲇⲁⲗⲓⲍⲉ, ⲑⲩⲥⲓⲁⲍⲉ, ⲁⲛⲁⲭⲱⲣⲉⲓ, ⲁⲥⲡⲁⲍⲉ, ⲕⲗⲏⲣⲟⲛⲟⲙⲉⲓ, ⲛⲏⲥⲧⲉⲩⲉ
- VERB-Fin: ⲥⲧⲁⲩⲣⲟⲩ, ⲡⲁⲣⲁⲕⲁⲗⲉⲓ, ⲡⲓⲥⲧⲉⲩⲉ, ⲕⲣⲓⲛⲉ, ⲥⲕⲁⲛⲇⲁⲗⲓⲍⲉ, ⲑⲩⲥⲓⲁⲍⲉ, ⲁⲛⲁⲭⲱⲣⲉⲓ, ⲕⲗⲏⲣⲟⲛⲟⲙⲉⲓ, ⲛⲏⲥⲧⲉⲩⲉ, ⲛⲟⲉⲓ
- VERB-Inf: ⲧⲁⲗⲉⲇⲱⲣⲟⲛ, ⲡⲁⲣⲁⲅⲉ, ϯⲛⲧⲟⲗⲏ, ⲁⲛⲉⲭⲉ, ⲁⲡⲟⲧⲁⲥⲥⲉ, ⲁⲥⲡⲁⲍⲉ, ⲃⲁⲡⲧⲓⲍⲉ, ⲇⲓⲁⲕⲣⲓⲛⲉ, ⲉⲡⲓⲃⲟⲩⲗⲉⲩⲉ, ⲉⲩⲁⲅⲅⲉⲗⲓⲍⲉ
- X: ⲙⲟⲛⲟⲛ, ⲁⲣⲭⲏⲉⲡⲓⲥⲕⲟⲡⲟⲩ, ⲁⲩⲧⲟⲩ, ⲉⲡⲓⲥⲧⲟⲗⲏ, ⲉⲡⲫⲁⲑⲁ, ⲕⲟⲩⲙ, ⲟⲩⲇ[.......], ⲧⲟⲩ, ⲭⲉⲓⲙⲏⲧⲓ
- Yes
Syntax
Auxiliary Verbs and Copula
- This corpus uses 1 lemmas as copulas (cop). Examples: ⲡⲉ.
- This corpus uses 26 lemmas as auxiliaries (aux). Examples: ⲁ, ⲛⲁ, ⲛⲧⲉ, ⲛⲧⲉⲣⲉ, ⲛⲉⲣⲉ, ⲙⲡⲉ, ϣⲁⲣⲉ, ⲙⲁⲣⲉ, ϣ, ⲛⲛⲉ, ⲉⲣϣⲁⲛ, ⲙⲉⲣⲉ, ϣⲁⲛⲧⲉ, ⲙⲛ, ⲙⲡⲁⲧⲉ, ⲟⲩⲛ, ⲙⲡⲣⲧⲣⲉ, ⲉⲣⲉ, ⲧⲁⲣ, ⲧⲁⲣⲉ, ϣⲁ, ϫⲡⲓ, ⲉϣ, ⲙⲉ, ⲛⲉ, ⲛⲉϣ.
Core Arguments, Oblique Arguments and Adjuncts
Here we consider only relations between verbs (parent) and nouns or pronouns (child).
- nsubj
- VERB--NOUN (114)
- VERB--NOUN-ADP(ⲛ) (2)
- VERB--NOUN-ADP(ⲛ)-ADP(ⲛ) (1)
- VERB--PRON (32)
- VERB-Fin--NOUN (561)
- VERB-Fin--NOUN-ADP(ϩⲓⲣⲛ) (1)
- VERB-Fin--NOUN-ADP(ⲛ) (7)
- VERB-Fin--PRON (4317)
- obj
- VERB--NOUN (41)
- VERB--NOUN-ADP(ⲛ) (13)
- VERB--PRON (176)
- VERB-Fin--NOUN (396)
- VERB-Fin--NOUN-ADP(ⲙ) (3)
- VERB-Fin--NOUN-ADP(ⲛ) (483)
- VERB-Fin--NOUN-ADP(ⲛ)-ADP(ⲛ) (12)
- VERB-Fin--PRON (763)
- VERB-Fin--PRON-ADP(ⲉ) (1)
- VERB-Fin--PRON-ADP(ⲛ) (243)
- VERB-Fin--PRON-ADP(ⲧⲟⲟⲧ) (1)
- VERB-Inf--NOUN (25)
- VERB-Inf--NOUN-ADP(ⲛ) (19)
- VERB-Inf--NOUN-ADP(ⲛ)-ADP(ⲛ) (1)
- VERB-Inf--PRON (52)
- VERB-Inf--PRON-ADP(ⲛ) (4)
- iobj
- VERB--NOUN (5)
- VERB--PRON (73)
- VERB--PRON-ADP(ⲛ) (2)
- VERB-Fin--NOUN (1)
- VERB-Fin--PRON (1)
Relations Overview
- This corpus uses 4 relation subtypes: acl:relcl, nmod:poss, nmod:unmarked, obl:unmarked
- The following 3 relation types are not used in this corpus at all: clf, list, goeswith