home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Turkish-BOUN: POS Tags: AUX

There are 8 AUX lemmas (0%), 242 AUX types (1%) and 3752 AUX tokens (3%). Out of 16 observed tags, the rank of AUX is: 15 in number of lemmas, 8 in number of types and 8 in number of tokens.

The 10 most frequent AUX lemmas: y, ol, i, mi, değil, null, dur, bulun

The 10 most frequent AUX types: du, olarak, dı, ken, değil, tı, di, dır, ti, sa

The 10 most frequent ambiguous lemmas: y (AUX 2325, PART 5, ADJ 2), ol (VERB 1754, AUX 448, ADP 33, NOUN 6, ADV 5, ADJ 1), i (AUX 398, PART 115, VERB 6, NOUN 4), mi (AUX 292, NOUN 1, VERB 1), değil (AUX 243, VERB 6, CCONJ 3), dur (VERB 111, AUX 7, NOUN 2), bulun (VERB 202, AUX 1)

The 10 most frequent ambiguous types: olarak (AUX 269, ADP 33, VERB 2, ADV 1), değil (AUX 154, CCONJ 3), mi (AUX 95, NOUN 1), ydi (AUX 80, PART 1), (AUX 70, PUNCT 1), tir (AUX 58, ADV 2), se (AUX 47, PART 2), tur (AUX 46, NOUN 4), dur (AUX 44, VERB 4, PART 1), olan (VERB 213, AUX 42)


The form / lemma ratio of AUX is 30.250000 (the average of all parts of speech is 2.412899).

The 1st highest number of forms (121) was observed with the lemma “y”: ‘di, ‘dı, ‘tü, ‘ydi, ‘ymiş, MİŞ, acaksa, acaktı, acaktım, di, dik, dikleri, diler, dim, din, diniz, dir, dirler, du, duk, dukları, duktan, dum, dun, dunuz, dur, duğun, dü, dük, düler, düm, dür, dı, dık, dım, dınız, dır, dırlar, ecekti, ecektik, idi, idiler, idim, idiyse, iken, imiş, iseniz, ken, lar, lardır, ler, lerdir, miş, mişler, muş, muşum, muşuz, müş, mış, mışız, sa, sak, sam, san, sanız, se, sek, sem, sen, seniz, sin, siniz, sinizdir, sın, sınız, ti, tik, tim, tin, tir, tu, tuk, tum, tur, tü, tük, tüm, tür, tı, tık, tım, tır, ydi, ydik, ydiler, ydim, ydin, ydu, yduk, ydum, ydü, ydı, ydık, ydılar, ydım, ydın, ydınız, yim, yken, ymiş, ymuş, ymış, ysa, ysan, ysanız, yse, yseniz, yum, yım, yız, ız.

The 2nd highest number of forms (58) was observed with the lemma “ol”: olabilir, olabilmekte, olacak, olacaklar, olacağı, olacağını, olacağız, olan, olandır, olanlar, olanlarla, olanların, olanı, olanımız, olarak, oldu, olduklarına, olduklarını, oldum, olduğu, olduğum, olduğumuz, olduğun, olduğuna, olduğunda, olduğundan, olduğunu, olduğunuzu, olmadığı, olmadığını, olmak, olmaktan, olmalı, olmamakla, olmamdır, olmamış, olmanın, olması, olmasına, olmasını, olmasının, olmasıydı, olmaya, olmayacaktır, olmayan, olmayanlar, olmaz, olmuş, olmuşlar, olmuştur, olsa, olup, olur, olurken, olurlar, olursunuz, oluyor, oluşundan.

The 3rd highest number of forms (47) was observed with the lemma “i”: ‘dir, ‘ydi, dik, dir, dirler, du, dur, durlar, dür, dı, dır, dırlar, idi, iken, imiş, ise, iseniz, kalacaktır, ken, lardır, mişim, mış, mışsın, sa, sak, se, sede, sem, tir, tur, tür, tı, tır, ydi, ydik, ydim, ydü, ydı, yim, yken, ymış, ysa, yse, yum, yım, yız, İdi.

AUX occurs with 12 features: Number (3106; 83% instances), Person (3106; 83% instances), Tense (2630; 70% instances), Polarity (2400; 64% instances), Evident (1577; 42% instances), Aspect (1337; 36% instances), Mood (977; 26% instances), VerbForm (600; 16% instances), Case (293; 8% instances), Number[psor] (81; 2% instances), Person[psor] (81; 2% instances), Voice (20; 1% instances)

AUX occurs with 40 feature-value pairs: Aspect=Hab, Aspect=Imp, Aspect=Perf, Aspect=Prog, Aspect=Prosp, Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Evident=Fh, Evident=Nfh, Mood=Cnd, Mood=Des, Mood=Gen, Mood=Imp, Mood=Ind, Mood=Nec, Mood=Pot, Number=Plur, Number=Sing, Number[psor]=Plur, Number[psor]=Sing, Person=1, Person=2, Person=3, Person[psor]=1, Person[psor]=2, Person[psor]=3, Polarity=Neg, Polarity=Pos, Tense=Fut, Tense=Past, Tense=Pres, VerbForm=Conv, VerbForm=Part, VerbForm=Vnoun, Voice=Pass

AUX occurs with 187 feature combinations. The most frequent feature combination is Evident=Fh|Number=Sing|Person=3|Polarity=Pos|Tense=Past (764 tokens). Examples: du, dı, di, tı, ti, tu, tü, acaktı, dü, ecekti


AUX nodes are attached to their parents using 16 different relations: cop (2909; 78% instances), aux (458; 12% instances), discourse:q (287; 8% instances), root (30; 1% instances), conj (24; 1% instances), advcl (9; 0% instances), discourse (8; 0% instances), compound (5; 0% instances), acl (4; 0% instances), aux:q (4; 0% instances), compound:lvc (4; 0% instances), ccomp (3; 0% instances), obj (3; 0% instances), obl (2; 0% instances), compound:redup (1; 0% instances), parataxis (1; 0% instances)

Parents of AUX nodes belong to 14 different parts of speech: VERB (2039; 54% instances), NOUN (1056; 28% instances), ADJ (390; 10% instances), PRON (84; 2% instances), ADV (61; 2% instances), PROPN (59; 2% instances), (30; 1% instances), AUX (9; 0% instances), NUM (9; 0% instances), ADP (6; 0% instances), DET (5; 0% instances), CCONJ (2; 0% instances), INTJ (1; 0% instances), PUNCT (1; 0% instances)

3666 (98%) AUX nodes are leaves.

38 (1%) AUX nodes have one child.

11 (0%) AUX nodes have two children.

37 (1%) AUX nodes have three or more children.

The highest child degree of a AUX node is 8.

Children of AUX nodes are attached using 19 different relations: punct (72; 32% instances), conj (28; 13% instances), nsubj (21; 9% instances), obj (21; 9% instances), obl (16; 7% instances), advmod (13; 6% instances), nmod (11; 5% instances), amod (9; 4% instances), cc (6; 3% instances), discourse (6; 3% instances), discourse:q (6; 3% instances), advcl (3; 1% instances), acl (2; 1% instances), advmod:emph (2; 1% instances), ccomp (2; 1% instances), cop (2; 1% instances), case (1; 0% instances), mark (1; 0% instances), nummod (1; 0% instances)

Children of AUX nodes belong to 15 different parts of speech: PUNCT (72; 32% instances), NOUN (50; 22% instances), VERB (31; 14% instances), ADJ (16; 7% instances), ADV (13; 6% instances), PRON (12; 5% instances), AUX (9; 4% instances), CCONJ (8; 4% instances), INTJ (4; 2% instances), PART (2; 1% instances), PROPN (2; 1% instances), ADP (1; 0% instances), DET (1; 0% instances), NUM (1; 0% instances), SCONJ (1; 0% instances)