home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Finnish-FTB: Features: Clitic

This feature is language-specific. It occurs with 7 different values: Han, Ka, Kaan, Kin, Ko, Pa, S. Some words have combined values of the feature; 9 combinations have been observed: Han|Ka, Han|Kin, Han|Ko, Han|Pa, Ka|S, Kaan|Ko, Kin|Ko, Ko|S, Pa|S.

2949 tokens (2%) have a non-empty value of Clitic. 1730 types (4%) occur at least once with a non-empty value of Clitic. 762 lemmas (4%) occur at least once with a non-empty value of Clitic. The feature is used with 11 part-of-speech tags: VERB (965; 1% instances), AUX (708; 0% instances), NOUN (354; 0% instances), PRON (312; 0% instances), ADV (217; 0% instances), PART (119; 0% instances), ADJ (110; 0% instances), DET (78; 0% instances), PROPN (57; 0% instances), NUM (21; 0% instances), ADP (8; 0% instances).

VERB

965 VERB tokens (4% of all VERB tokens) have a non-empty value of Clitic.

The most frequent other feature values with which VERB and Clitic co-occurred: PartForm=EMPTY (924; 96%), InfForm=EMPTY (910; 94%), Voice=Act (888; 92%), Case=EMPTY (869; 90%), VerbForm=Fin (868; 90%), Number=Sing (689; 71%), Mood=Ind (681; 71%), Tense=Pres (504; 52%).

VERB tokens may have the following values of Clitic:

Paradigm ollaHanHan,KoKaanKinKoKo,SPaPa,SS
Case=Gen|Number=Sing|PartForm=Past|VerbForm=Part|Voice=Actolleenkaan
Case=Gen|Number=Sing|PartForm=Pres|VerbForm=Part|Voice=Actolevankaan
Case=Ine|InfForm=2|VerbForm=Inf|Voice=Actollessakaan
Case=Lat|InfForm=1|VerbForm=Inf|Voice=ActollakaanOllakoOllapa
Case=Nom|Number=Sing|PartForm=Past|Style=Coll|VerbForm=Part|Voice=Actollukkiollukko
Case=Nom|Number=Sing|PartForm=Past|VerbForm=Part|Voice=Actollutkaan
Case=Nom|Number=Plur|PartForm=Past|VerbForm=Part|Voice=Actolleetkin
Connegative=Yes|Mood=Ind|Number=Sing|Tense=Past|VerbForm=Fin|Voice=Actollutkaan
Connegative=Yes|Mood=Ind|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=Actookin
Connegative=Yes|Mood=Ind|Tense=Pres|VerbForm=Fin|Voice=Actolekaan
Mood=Cnd|Number=Sing|Person=3|Style=Coll|VerbForm=Fin|Voice=ActOiskohanoliskinoisko, Olisko
Mood=Cnd|Number=Sing|Person=3|VerbForm=Fin|Voice=ActolisikinolisikoOlisipa
Mood=Imp|Number=Sing|Person=2|VerbForm=Fin|Voice=Actolekin
Mood=Ind|Number=Sing|Person=1|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=ActOonksmä, ooks, oonko
Mood=Ind|Number=Sing|Person=1|Tense=Pres|VerbForm=Fin|Voice=ActOlenko
Mood=Ind|Number=Sing|Person=2|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=Actootko, ootsä, Ookkonää, oleks
Mood=Ind|Number=Sing|Person=2|Tense=Past|VerbForm=Fin|Voice=ActOlithanOlitkos
Mood=Ind|Number=Sing|Person=2|Tense=Pres|VerbForm=Fin|Voice=ActoletkaanOletko
Mood=Ind|Number=Sing|Person=3|Style=Coll|Tense=Past|VerbForm=Fin|Voice=ActolikiiOliks
Mood=Ind|Number=Sing|Person=3|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=Actonkionks, onk
Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin|Voice=Actolikohanolikaanolikinoliko
Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=ActonhanOnkohanonkaanonkinonkoonkosOnpa
Mood=Ind|Number=Plur|Person=1|Tense=Pres|VerbForm=Fin|Voice=Actolemmeko
Mood=Ind|Number=Plur|Person=2|Style=Coll|Tense=Past|VerbForm=Fin|Voice=ActOlitteks
Mood=Ind|Number=Plur|Person=2|Tense=Pres|VerbForm=Fin|Voice=ActOletteko
Mood=Ind|Number=Plur|Person=3|Tense=Past|VerbForm=Fin|Voice=Actolivatkaan
Mood=Ind|Number=Plur|Person=3|Tense=Pres|VerbForm=Fin|Voice=Actovatkinovatko
Mood=Ind|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=PassOllaas
Mood=Ind|Tense=Pres|VerbForm=Fin|Voice=PassOllaanpas
Mood=Pot|Number=Sing|Person=3|VerbForm=Fin|Voice=ActLiekö

AUX

708 AUX tokens (7% of all AUX tokens) have a non-empty value of Clitic.

The most frequent other feature values with which AUX and Clitic co-occurred: VerbForm=Fin (703; 99%), Voice=Act (697; 98%), Number=Sing (624; 88%), Person=3 (519; 73%), Tense=EMPTY (403; 57%), Polarity=EMPTY (371; 52%).

AUX tokens may have the following values of Clitic:

Paradigm eiHanHan,KoHan,PaKaKoKo,SPaPa,SS
Mood=Imp|Number=Sing|Person=2ÄlähänäläkäÄläs
Mood=Imp|Number=Plur|Person=2Älkääs
Number=Sing|Person=1|Style=Collenhä
Number=Sing|Person=1enhänEnköhänenkäenköenkösEnpäEnpäs
Number=Sing|Person=2|Style=CollEksää, Etsä, eks
Number=Sing|Person=2ethänetkäetkö
Number=Sing|Person=3|Style=Colleihäeiks, eiks'eipa
Number=Sing|Person=3eihäneiköhänEipähäneikäeiköEiköseipäeipäs
Number=Plur|Person=1emmekä
Number=Plur|Person=2ettekäEttekö
Number=Plur|Person=3Eivätköhäneivätkäeivätköeivätpä

NOUN

354 NOUN tokens (1% of all NOUN tokens) have a non-empty value of Clitic.

The most frequent other feature values with which NOUN and Clitic co-occurred: Number=Sing (263; 74%).

NOUN tokens may have the following values of Clitic:

Paradigm lapsiHanHan,KinKaanKin
Case=Ade|Number=Plurlapsillakin
Case=Ela|Number=SingLapsestakin
Case=Ill|Number=PlurLapsiinhan
Case=Nom|Number=Singlapsikaanlapsikin
Case=Nom|Number=Plurlapsetkin
Case=Par|Number=Plur|Style=Colllapsijakkiihan

Clitic seems to be lexical feature of NOUN. 92% lemmas (249) occur only with one value of Clitic.

PRON

312 PRON tokens (3% of all PRON tokens) have a non-empty value of Clitic.

The most frequent other feature values with which PRON and Clitic co-occurred: Number=Sing (238; 76%), Person=EMPTY (206; 66%), Case=Nom (178; 57%).

PRON tokens may have the following values of Clitic:

Paradigm seHanKaanKaan,KoKinKoPaPa,S
Case=Adesilläkin
Case=ElasiitähänsiitäkäänSiitäkinSiitäpä
Case=Gensenhänsenkään
Case=IllsiihenkinSiihenkö
Case=Inesiinäkinsiinäpä
Case=Ine|Style=Collsiinähä
Case=NomsehänsekäänsekinseköSepäSepäs
Case=Nom|Style=Collseki
Case=ParSitähänsitäkäänsitäkäänkösitäkinSitäkö

ADV

217 ADV tokens (2% of all ADV tokens) have a non-empty value of Clitic.

The most frequent other feature values with which ADV and Clitic co-occurred: PronType=EMPTY (132; 61%).

ADV tokens may have the following values of Clitic:

Paradigm miksiHanHan,KaHan,KoKoPa
miksihänMiksikähänMiksiköhänMiksikömiksipä

PART

119 PART tokens (2% of all PART tokens) have a non-empty value of Clitic.

PART tokens may have the following values of Clitic:

Paradigm kylläHanKaanKinPaPa,S
_kyllähänkylläkäänkylläkinKylläpäkylläpäs
Style=Collkylhän

ADJ

110 ADJ tokens (1% of all ADJ tokens) have a non-empty value of Clitic.

The most frequent other feature values with which ADJ and Clitic co-occurred: Number=Sing (70; 64%).

ADJ tokens may have the following values of Clitic:

Paradigm omaKinPa
Case=Ela|Number=Plur|Style=Collomistaki
Case=Nom|Number=SingOmapa
Case=Nom|Number=Pluromatkin

Clitic seems to be lexical feature of ADJ. 95% lemmas (73) occur only with one value of Clitic.

DET

78 DET tokens (2% of all DET tokens) have a non-empty value of Clitic.

The most frequent other feature values with which DET and Clitic co-occurred: Number=Sing (47; 60%).

DET tokens may have the following values of Clitic:

Paradigm tämäHanKaanKinKo
Case=EssTänäkääntänäkin
Case=Gentämänkääntämänkin
Case=Inetässäkin
Case=NomTämähänTämäkäänTämäkö
Case=Par|Style=Colltätäkä

PROPN

57 PROPN tokens (1% of all PROPN tokens) have a non-empty value of Clitic.

The most frequent other feature values with which PROPN and Clitic co-occurred: Number=Sing (56; 98%).

PROPN tokens may have the following values of Clitic:

Paradigm suomiKinKo
Case=ElaSuomestakin
Case=GenSuomenkin
Case=IneSuomessakinSuomessako
Case=NomSuomikin

Clitic seems to be lexical feature of PROPN. 98% lemmas (44) occur only with one value of Clitic.

NUM

21 NUM tokens (1% of all NUM tokens) have a non-empty value of Clitic.

The most frequent other feature values with which NUM and Clitic co-occurred: NumType=Card (21; 100%), Number=Sing (20; 95%), Case=Nom (15; 71%).

NUM tokens may have the following values of Clitic:

Paradigm yksiKaanKin
Case=Essyhtenäkään
Case=Genyhdenkin
Case=Nomyksikäänyksikin

ADP

8 ADP tokens (0% of all ADP tokens) have a non-empty value of Clitic.

ADP tokens may have the following values of Clitic: