home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Turkish-Penn: POS Tags: X

There are 172 X lemmas (1%), 196 X types (1%) and 323 X tokens (0%). Out of 15 observed tags, the rank of X is: 7 in number of lemmas, 7 in number of types and 13 in number of tokens.

The 10 most frequent X lemmas: _, %15, %9, tefek, %8, %2, %5, %7.875, %1, %10

The 10 most frequent X types: %15, %9, tefek, %8, %2, %5, %7.875, de, %1, %10

The 10 most frequent ambiguous lemmas: %15 (X 9, NOUN 2), %9 (X 9, NOUN 2), %8 (X 7, NOUN 1), %5 (X 6, NOUN 1), %10 (X 5, NOUN 1), %12 (X 5, NOUN 1), %30 (X 4, NOUN 1), %20 (X 3, NOUN 1), %50 (X 3, NOUN 1), değiş (VERB 79, NOUN 23, ADV 18, ADJ 16, X 2)

The 10 most frequent ambiguous types: de (CCONJ 635, X 6, NOUN 2, ADV 1, VERB 1), arası (NOUN 20, X 3), ki (SCONJ 103, X 3, NOUN 1), çok (ADV 262, ADJ 193, DET 24, ADP 3, X 3, NOUN 1), da (CCONJ 642, X 2, ADV 1), den (X 2, VERB 1), değiş (X 2, NOUN 1), 3.4 (NUM 2, X 1), dışı (NOUN 42, X 1), nın (NOUN 1, X 1)

Morphology

The form / lemma ratio of X is 1.139535 (the average of all parts of speech is 2.012465).

The 1st highest number of forms (25) was observed with the lemma “_”: ‘daki, ‘de, ‘e, ‘nin, ‘ye, ‘yu, ‘ın, .’e, .’un, 3.4, arası, da, dan, de, den, dışı, ki, nun, nın, s, sonu, sunu, un, çok, çoğu.

The 2nd highest number of forms (1) was observed with the lemma “%0.2”: %0.2.

The 3rd highest number of forms (1) was observed with the lemma “%0.25”: %0.25.

X does not occur with any features.

Relations

X nodes are attached to their parents using 13 different relations: amod (91; 28% instances), nmod (70; 22% instances), obl (62; 19% instances), goeswith (46; 14% instances), compound (12; 4% instances), conj (12; 4% instances), obj (7; 2% instances), nsubj (6; 2% instances), list (5; 2% instances), fixed (4; 1% instances), root (4; 1% instances), appos (3; 1% instances), advcl (1; 0% instances)

Parents of X nodes belong to 9 different parts of speech: NOUN (163; 50% instances), VERB (49; 15% instances), PROPN (25; 8% instances), ADV (23; 7% instances), ADJ (20; 6% instances), NUM (19; 6% instances), X (17; 5% instances), (4; 1% instances), DET (3; 1% instances)

243 (75%) X nodes are leaves.

62 (19%) X nodes have one child.

12 (4%) X nodes have two children.

6 (2%) X nodes have three or more children.

The highest child degree of a X node is 5.

Children of X nodes are attached using 16 different relations: conj (15; 14% instances), punct (15; 14% instances), nmod (14; 13% instances), amod (12; 11% instances), cc (11; 10% instances), compound (8; 7% instances), advmod (7; 7% instances), obj (6; 6% instances), case (5; 5% instances), list (5; 5% instances), obl (3; 3% instances), nsubj (2; 2% instances), advcl (1; 1% instances), det (1; 1% instances), fixed (1; 1% instances), flat (1; 1% instances)

Children of X nodes belong to 11 different parts of speech: NOUN (22; 21% instances), ADJ (18; 17% instances), X (17; 16% instances), PUNCT (15; 14% instances), CCONJ (11; 10% instances), PROPN (8; 7% instances), ADV (6; 6% instances), ADP (4; 4% instances), NUM (4; 4% instances), DET (1; 1% instances), VERB (1; 1% instances)