home edit page issue tracker

This page pertains to UD version 2.

UD Komi Zyrian Lattice

Language: Komi Zyrian (code: kpv)
Family: Uralic, Permic

This treebank has been part of Universal Dependencies since the UD v2.2 release.

The following people have contributed to making this treebank part of UD: Niko Partanen, KyungTae Lim, Thierry Poibeau, Jack Rueter.

Repository: UD_Komi_Zyrian-Lattice
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.13

License: CC BY-SA 4.0

Genre: fiction

Questions, comments? General annotation questions (either Komi Zyrian-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [nikotapiopartanen (æt) gmail • com]. Development of the treebank happens directly in the UD repository, so you may submit bug fixes as pull requests against the dev branch.

Annotation Source
Lemmas assigned by a program, with some manual corrections, but not a full manual verification
UPOS annotated manually in non-UD style, automatically converted to UD, with some manual corrections of the conversion
XPOS assigned by a program, with some manual corrections, but not a full manual verification
Features assigned by a program, with some manual corrections, but not a full manual verification
Relations annotated manually, natively in UD style

Description

UD Komi-Zyrian Lattice is a treebank of written standard Komi-Zyrian.

The treebank contains continuous texts in the written Komi-Zyrian language. All of the data comes from openly available sources, or are individual sentences used here with exact references to the original works. The largest segments are from Lev Uspenskiy’s book Нёль боевӧй случай (URN http://urn.fi/URN:NBN:fi-fe2014102045428), which was digitalized in the National Library of Finland’s Fenno-Ugrica project, and Ivan Belyx’s short story which was added to Komi Nebögain online library by the author himself in 2013. Some articles from the newspaper Выль туйӧд’s 1939 number 67 have also been included as complete texts. Several texts are used for individual examples, and these are marked with the tag -ind, to distinguish them from the running text. The complete list of sources with additional information and further links is available in this README file below.

Acknowledgments

This work has been developed within the framework of the LAKME project funded by a grant from Paris Sciences et Lettres (IDEX PSL reference ANR-10-IDEX-0001-02). Thierry Poibeau is also partially supported by a RGNF-CNRS (grant between the LATTICE-CNRS Laboratory and the Russian State University for the Humanities in Moscow). In the UD releases 2.7 and 2.8. Kone Foundation-funded research project Language Documentation meets Language Technology: The Next Step in the Description of Komi, led by Rogier Blokland and Michael Rießler, expanded the treebank. In this phase the annotation work was done by Niko Partanen and Jack Rueter. GiellaLT infrastucture was used in morphological analysis.

If you use this treebank in your work, please cite:

Sources used

Sentence id start Publication name Publishing year Link to Fenno-Ugrica Link to Komi Nebögain  
belykh1997a Иван Белых 1997: Шера зэр 1997   http://komikyv.org/kpv/content/шера-зэр  
belykh2005a Иван Белых 2005: Таысь унаыс оз и ков 2005   http://komikyv.org/kpv/node/26865  
falkner1927a Фалькнер В.Н. 1927: Лоам ми ёнӧсь, дзоньвидзаӧсь да!: мӧд нига” 1927 http://urn.fi/URN:NBN:fi-fe2014102045431 http://komikyv.org/kpv/contents/loam-mi-yonos-dzonvidzaos-da  
goffensefer1927a Гоффеншефер С. 1927: Чакотка 1927 http://urn.fi/URN:NBN:fi-fe201604159701 http://komikyv.org/kpv/contents/chahotka  
koskabyr1925a Кос Кабыр 1925: Мувыв тӧдмалӧм. Медводдза нига = Ывлавыв тӧдмалӧм. Арся уджъяс. 1925 http://urn.fi/URN:NBN:fi-fe2014070132058 http://komikyv.org/kpv/contents/yvlavyv-todmalom-1  
kpv_1-7_Vyl_tujod_1939_09_10_0001 Выль туйӧд 9.10.1939, page 1 1939 http://urn.fi/URN:NBN:fi-fe201802013020    
kpv_1-7_Vyl_tujod_1939_09_10_0002 Выль туйӧд 9.10.1939, page 2 1939 http://urn.fi/URN:NBN:fi-fe201802013020    
rossinskiy1925a Российскӧй Д. 1925: Гут разӧдӧ висьӧмъяс 1925 http://urn.fi/URN:NBN:fi-fe2014070332096 http://komikyv.org/kpv/content/гут-разӧдӧ-висьӧмъяс  
uspensky1940a Успенский, Лев 1940: Нёль боевӧй случай 1940 http://urn.fi/URN:NBN:fi-fe2014102045428    
Minin:OjsjaCvettez Minjin I. A. (Минин И. А.) 1962. Vojsya dzoridzjas (Войся дзоридзьяс) (висьт) // Войвыв кодзув (1962. №1) 1962   http://komikyv.org/kpv/node/31062  

In release 2.7 various sentences used in the Komi-language grammar Ӧнія коми кыв (2000) were included. These sentences are marked with sent_id’s that contain components OKK:2000:page:n-th sentence:original author. These sentences were included to cover different grammatical phenomena in Komi more thoroughly. When refering to these sentences, we advise you also cite the original source:

Statistics of UD Komi Zyrian Lattice

POS Tags

ADJADPADVAUXCCONJDETINTJNOUNNUMPARTPRONPROPNPUNCTSCONJVERB

Features

AbbrAdpTypeAdvTypeAnimacyCaseCliticConnegativeDefiniteDegreeDerivationEvidentMoodNounTypeNumberNumber[psor]Number[subj]NumTypePartFormPersonPerson[psor]PolarityPronTypeReflexTenseVariantVerbFormVoice

Relations

aclacl:relcladvcladvcl:evaladvcl:lcladvcl:tcladvmodadvmod:cauadvmod:degadvmod:emphadvmod:evaladvmod:focadvmod:freqadvmod:lmodadvmod:mmodadvmod:tmodamodapposauxaux:aspectaux:necaux:negaux:potaux:tensecasecccc:preconjccompcompoundconjcopcsubjcsubj:copdetdiscoursedislocatedfixedflatflat:nameflat:numlistmarknmodnmod:apposnmod:lmodnmod:objnmod:possnmod:subjnsubjnsubj:copnummodobjoblobl:agentobl:cauobl:cmpobl:lmodobl:mclobl:tmodorphanparataxispunctreparandumrootvocativexcomp

Tokenization and Word Segmentation

Morphology

Tags

Nominal Features

Degree and Polarity

Verbal Features

Pronouns, Determiners, Quantifiers

Other Features

Syntax

Auxiliary Verbs and Copula

Core Arguments, Oblique Arguments and Adjuncts

Here we consider only relations between verbs (parent) and nouns or pronouns (child).

Verbs with Reflexive Core Objects

Relations Overview