home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Turkish-BOUN: POS Tags: NOUN

There are 6291 NOUN lemmas (41%), 17008 NOUN types (46%) and 38351 NOUN tokens (31%). Out of 16 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: yer, gün, insan, yıl, var, zaman, şey, el, iç, ara

The 10 most frequent NOUN types: var, yok, zaman, gün, şey, içinde, yıl, arasında, yer, üzerine

The 10 most frequent ambiguous lemmas: yer (NOUN 358, VERB 4, NUM 1, PROPN 1), gün (NOUN 339, ADV 8, PROPN 5), insan (NOUN 329, VERB 5, ADV 1, PRON 1, PROPN 1), yıl (NOUN 329, ADV 9, PROPN 3, VERB 2), var (NOUN 306, VERB 108, ADJ 5), zaman (NOUN 284, ADV 12, PROPN 1, VERB 1), şey (NOUN 282, ADV 4, PRON 1), el (NOUN 229, PROPN 6, ADJ 2, ADV 2, NUM 1), (NOUN 229, ADJ 51, VERB 37, PROPN 3, ADV 2, NUM 2, PRON 2, ADP 1), ara (NOUN 214, VERB 54, ADJ 35, ADV 11, ADP 1)

The 10 most frequent ambiguous types: var (NOUN 286, VERB 17, ADJ 5), yok (NOUN 176, ADV 8, ADJ 6), zaman (NOUN 169, ADV 10), gün (NOUN 164, ADV 2), içinde (NOUN 135, ADJ 22, ADV 2, ADP 1), yıl (NOUN 110, ADV 4), arasında (NOUN 113, ADJ 15, ADP 1), yer (NOUN 100, VERB 2), su (NOUN 77, ADV 1), insan (NOUN 82, VERB 1)

Morphology

The form / lemma ratio of NOUN is 2.703545 (the average of all parts of speech is 2.412899).

The 1st highest number of forms (41) was observed with the lemma “arkadaş”: Arkadaşlarımla, Arkadaşlarımın, Arkadaşlarımıza, Arkadaşınızın, arkadaş, arkadaşa, arkadaşla, arkadaşlar, arkadaşlarla, arkadaşları, arkadaşlarım, arkadaşlarıma, arkadaşlarımı, arkadaşlarımız, arkadaşlarımızdan, arkadaşlarımızı, arkadaşların, arkadaşlarına, arkadaşlarından, arkadaşlarınla, arkadaşlarını, arkadaşlarının, arkadaşlarınıza, arkadaşlarınızla, arkadaşlarınızı, arkadaşlarınızın, arkadaşlarıyla, arkadaşlıkları, arkadaştan, arkadaşı, arkadaşım, arkadaşımdan, arkadaşımla, arkadaşımın, arkadaşımız, arkadaşın, arkadaşına, arkadaşını, arkadaşının, arkadaşınız, arkadaşıyla.

The 2nd highest number of forms (38) was observed with the lemma “iş”: iş, işe, işi, işim, işimde, işimize, işimizi, işin, işinden, işine, işini, işinin, işiyle, işle, işler, işlerde, işleri, işlerimi, işlerimiz, işlerin, işlerinde, işlerine, işlerini, işlerinin, işleriyle, işlerle, işsizlik, işte, işten, iştir, İş, İşe, İşi, İşim, İşin, İşler, İşleri, İşte.

The 3rd highest number of forms (36) was observed with the lemma “göz”: GÖZLERLE, Gözlerimden, Gözlerin, Gözünüz, göz, gözde, gözden, göze, gözle, gözler, gözlerden, gözleri, gözlerim, gözlerimi, gözlerinde, gözlerinden, gözlerine, gözlerini, gözlerinin, gözleriniz, gözleriyle, gözü, gözüm, gözümde, gözüme, gözümün, gözümüzde, gözümüzü, gözümüzün, gözün, gözünde, gözünden, gözüne, gözünü, gözünün, gözüyle.

NOUN occurs with 8 features: Person (38302; 100% instances), Number (38301; 100% instances), Case (37858; 99% instances), Number[psor] (12927; 34% instances), Person[psor] (12927; 34% instances), Polarity (697; 2% instances), Abbr (13; 0% instances), PronType (2; 0% instances)

NOUN occurs with 22 feature-value pairs: Abbr=Yes, Case=Abl, Case=Acc, Case=Dat, Case=Equ, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Number=Plur, Number=Sing, Number[psor]=Plur, Number[psor]=Sing, Person=1, Person=2, Person=3, Person[psor]=1, Person[psor]=2, Person[psor]=3, Polarity=Neg, Polarity=Pos, PronType=Int

NOUN occurs with 193 feature combinations. The most frequent feature combination is Case=Nom|Number=Sing|Person=3 (12711 tokens). Examples: zaman, gün, şey, yıl, yer, insan, su, kabul, söz, kez

Relations

NOUN nodes are attached to their parents using 33 different relations: obl (9282; 24% instances), nmod:poss (6730; 18% instances), obj (5603; 15% instances), nsubj (5501; 14% instances), conj (3071; 8% instances), root (2716; 7% instances), nmod (1311; 3% instances), amod (1019; 3% instances), advcl (486; 1% instances), compound (450; 1% instances), acl (354; 1% instances), obl:tmod (334; 1% instances), ccomp (246; 1% instances), flat (232; 1% instances), iobj (166; 0% instances), compound:redup (147; 0% instances), nmod:part (105; 0% instances), csubj (96; 0% instances), parataxis (90; 0% instances), appos (70; 0% instances), vocative (70; 0% instances), discourse (69; 0% instances), xcomp (60; 0% instances), case (36; 0% instances), clf (29; 0% instances), nsubj:outer (19; 0% instances), orphan (17; 0% instances), list (14; 0% instances), dislocated (10; 0% instances), nummod (7; 0% instances), compound:lvc (6; 0% instances), dep (4; 0% instances), fixed (1; 0% instances)

Parents of NOUN nodes belong to 17 different parts of speech: VERB (18108; 47% instances), NOUN (14552; 38% instances), (2716; 7% instances), ADJ (1187; 3% instances), PROPN (818; 2% instances), ADV (302; 1% instances), PRON (259; 1% instances), NUM (183; 0% instances), ADP (71; 0% instances), DET (62; 0% instances), AUX (50; 0% instances), CCONJ (20; 0% instances), INTJ (11; 0% instances), PUNCT (8; 0% instances), PART (2; 0% instances), SCONJ (1; 0% instances), X (1; 0% instances)

11852 (31%) NOUN nodes are leaves.

14478 (38%) NOUN nodes have one child.

6684 (17%) NOUN nodes have two children.

5337 (14%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 14.

Children of NOUN nodes are attached using 44 different relations: nmod:poss (9061; 18% instances), punct (6245; 13% instances), amod (5790; 12% instances), det (4467; 9% instances), conj (2927; 6% instances), acl (2792; 6% instances), obl (2196; 4% instances), nsubj (1920; 4% instances), compound (1510; 3% instances), cc (1347; 3% instances), case (1317; 3% instances), nummod (1305; 3% instances), nmod (1175; 2% instances), compound:lvc (1084; 2% instances), advmod (1018; 2% instances), obj (842; 2% instances), advmod:emph (753; 2% instances), cop (726; 1% instances), advcl (536; 1% instances), dep:der (441; 1% instances), aux (222; 0% instances), discourse (192; 0% instances), ccomp (157; 0% instances), flat (150; 0% instances), appos (139; 0% instances), csubj (131; 0% instances), compound:redup (122; 0% instances), parataxis (105; 0% instances), discourse:q (85; 0% instances), obl:tmod (80; 0% instances), cc:preconj (78; 0% instances), mark (55; 0% instances), nmod:part (32; 0% instances), list (29; 0% instances), clf (26; 0% instances), vocative (21; 0% instances), orphan (18; 0% instances), iobj (15; 0% instances), dislocated (5; 0% instances), nsubj:outer (4; 0% instances), xcomp (4; 0% instances), dep (2; 0% instances), fixed (2; 0% instances), aux:q (1; 0% instances)

Children of NOUN nodes belong to 16 different parts of speech: NOUN (14552; 30% instances), VERB (6975; 14% instances), PUNCT (6245; 13% instances), ADJ (5005; 10% instances), DET (4509; 9% instances), PROPN (2851; 6% instances), CCONJ (1680; 3% instances), NUM (1576; 3% instances), ADV (1290; 3% instances), PART (1122; 2% instances), ADP (1105; 2% instances), AUX (1056; 2% instances), PRON (1048; 2% instances), SCONJ (60; 0% instances), INTJ (51; 0% instances), X (2; 0% instances)