UD for Haitian Creole
Kreyòl (Kreyòl Ayisyen, Haitian Creole, iso-639-1: ht) is the main language of Haïti. It is spoken by more than 10M speakers both in Haïti and in the diaspora (Canada and USA mainly). It is an official language of Haïti. The dialect spoken in the Cap Haïtien dialect differs slightly in its lexicon from the Center and South varieties.
Tokenization and Word Segmentation
- Words are delimited by whitespace characters.
- Some sequences of grammatical words are often written in one othographic word:
- PRON-AUX: lap = li + ap, yap = yo + ap, kap = ki + ap, nap = nou + ap;
- sakap = sa + ki + ap;
- ADV-AUX: pat = pa + te, pata = pa + ta, pap = pa + ap;
- AUX-AUX: tap = t + ap, taka = ta + ka; sete = se + te.
Morphology
Haitian Creole is an isolating language.
Tags
PART
has not been used.- Haïtian has four preverbal particules that have been tagged
AUX
: te, ap, ta, pral. PotentialAUX
(not taggedAUX
): ka ‘can’, pa ‘not’. - The
DET
-PRON
distinction is unclear and still under studies. We decided to tagDET
the pronominal words depending on aNOUN
, but their forms are similar toPRON
s and they are positioned after theNOUN
as the noun complements.
Features
PRON
havePerson
andNumber
features. PronominalDET
also. SomeDET
have aDefinite
feature.AUX
have a feature:- te:
Tense=Past
; - ap:
Aspect=Prog
; - ta:
Mood=Cnd
; - pral:
Tense=Fut
; - se: copula.
- te:
- Other POS have no morphosyntactic features.
- The treebank is glossed (
Gloss
feature) and translated (text_fr
) in French, the lexifier language of this creole.
Syntax
- Haïtian is an SVO language.
- It has double object constructions, with the Recipient before the Theme. The recipient has been tagged
iobj
and the Themeobj
. We do not know any property to distinguish them except their resepective order and their semantic roles. - Some
ADJ
s,NOUN
s, andADV
s can root a clause. There is a copula, se, which is used in existential constructions. - It has
nmod
s without markers. - It has
DET
s before and after theNOUN
.DET
s after theNOUN
could benmod
. A further study is needed. - Some
ADJ
s must be before theNOUN
, while others must be after theNOUN
. The repartition is similar to French. - It has some Serial Verb Constructions, annotated
compound:svc
. - We have one cleft construction in our corpus, annotated advcl:cleft.
- The parataxis:insert relation is used for parenthetical clauses which could not be considered as independent sentences, since they are not saturated.
- Oblique modifiers have been distinguished by the obl:mod sub-relation.
Treebanks
There is 1 Haitian Creole UD treebank: