UD for Azerbaijani
Tokenization and Word Segmentation
- In general, words are delimited by whitespace characters.
- Punctuation marks tokenized as separate tokens (words).
- Hyphenated fixed and compounds are kept as three tokens: tez-tez, sil-süpür.
Morphology
Tags
- Azerbaijani uses 14 universal POS categories, including:
ADJ
yaxçı, böyük, ect.ADP
qədər, için, -ki, etc.ADV
daha, də, fәqәt, etc.AUX
i, dəyil, etc.CCONJ
amma, və, ya, etc.DET
o, bir, bu, etc.INTJ
Pəs.NOUN
pәncәrә, maşɪn, etc.NUM
bir, neçә, beş, etc.PRON
o, mən, etc.PROPN
Deniz, Sam, etc.PUNCT
-, ?, ., etc.SCONJ
ki, əyər, etc.VERB
getdi, fikr eliyәm, yudurtdu, yazdığını, etc.
Features
- Azerbaijani nouns are typically suffixed. However, a small number of prefixes are borrowed mostly from Persian.
- Azerbaijani, like all other Turkic languages, is devoid of grammatical gender. But there are ways to express gender lexically, if required.
- In Number marking of nouns, there are two values: Singular and Plural; Singular forms are unmarked.
- There are seven Cases in Azerbaijani: NOM, GEN, DAT, ACC, COM/INS, LOC, ABL. The NOM case does not take a suffix.
- Yes-no questions can be constructed in two ways: (i) by using interrogative intonation contour, and (ii) by adding the interrogative particle aya.
Inflectional Structure Representation
- Noun + (Plural marker) + (Possessive pronoun) + (Case marker)
- Adjective + (Comparative/Superlative degree marker)
- Verb stem + (Reciprocal marker) + (Causative marker) + (Voice marker) + (Reflexive marker) + (Negation marker) + (Mood marker) + {(Imperfective Aspect marker) + (Perfective Aspect marker)} + (Tense marker) + Person and Number marker.
Syntax
Core Arguments, Oblique Arguments and Adjuncts
- Nominal subject
nsubj
is a noun phrase in the nominative case, without adposition.- A subordinate clause may serve as the subject and is labeled
csubj
.
- A subordinate clause may serve as the subject and is labeled
- Object
obj
is a noun phrase without adposition and typically in the accusative case. - Oblique
obl
is a non-core nominal (noun, pronoun, noun phrase) argument or adjunct.
Relations Overview
- The following relation subtypes are used in Azerbaijani:
compound:redup
for reduplicated compoundscsubj:outer
for outer clause clausal subjectnsubj:outer
outer clause nominal subjectadvmod:emph
for emphasizing word, intensifier- compound:lvc` for light-verb constructions
- The following relation types are not used in Azerbaijani at all:
dep
,dislocated
,expl
,goeswith
,iobj
,list
,reparandum
Treebanks
There is 1 Azerbaijani UD treebank: