Treebank Statistics: UD_Turkish-Penn: POS Tags: X
There are 172 X
lemmas (1%), 196 X
types (1%) and 323 X
tokens (0%).
Out of 15 observed tags, the rank of X
is: 7 in number of lemmas, 7 in number of types and 13 in number of tokens.
The 10 most frequent X
lemmas: _, %15, %9, tefek, %8, %2, %5, %7.875, %1, %10
The 10 most frequent X
types: %15, %9, tefek, %8, %2, %5, %7.875, de, %1, %10
The 10 most frequent ambiguous lemmas: %15 (X 9, NOUN 2), %9 (X 9, NOUN 2), %8 (X 7, NOUN 1), %5 (X 6, NOUN 1), %10 (X 5, NOUN 1), %12 (X 5, NOUN 1), %30 (X 4, NOUN 1), %20 (X 3, NOUN 1), %50 (X 3, NOUN 1), değiş (VERB 79, NOUN 23, ADV 18, ADJ 16, X 2)
The 10 most frequent ambiguous types: de (CCONJ 635, X 6, NOUN 2, ADV 1, VERB 1), arası (NOUN 20, X 3), ki (SCONJ 103, X 3, NOUN 1), çok (ADV 262, ADJ 193, DET 24, ADP 3, X 3, NOUN 1), da (CCONJ 642, X 2, ADV 1), den (X 2, VERB 1), değiş (X 2, NOUN 1), 3.4 (NUM 2, X 1), dışı (NOUN 42, X 1), nın (NOUN 1, X 1)
- de
- arası
- ki
- çok
- ADV 262: Cuma günü bunların üzerine düşünüldüğünde çok geç kalmış olunacaktı .
- ADJ 193: Bu piyasa çok kötü hasar aldı .
- DET 24: Ama pek çok marka alan için yarışırken , artık durum böyle değil .
- ADP 3: Ama cihaz ışıklandırmaktan çok karartıyor .
- X 3: Bir çok çalışan dün evde kaldı , ama müşteri hizmeti sürdürülmeye devam edildi .
- NOUN 1: Maidenform çok karlı olduğunu ama özelliklerin temininin reddedildiğini söylüyor .
- da
- den
- değiş
- 3.4
- dışı
- nın
Morphology
The form / lemma ratio of X
is 1.139535 (the average of all parts of speech is 2.012465).
The 1st highest number of forms (25) was observed with the lemma “_”: ‘daki, ‘de, ‘e, ‘nin, ‘ye, ‘yu, ‘ın, .’e, .’un, 3.4, arası, da, dan, de, den, dışı, ki, nun, nın, s, sonu, sunu, un, çok, çoğu.
The 2nd highest number of forms (1) was observed with the lemma “%0.2”: %0.2.
The 3rd highest number of forms (1) was observed with the lemma “%0.25”: %0.25.
X
does not occur with any features.
Relations
X
nodes are attached to their parents using 13 different relations: amod (91; 28% instances), nmod (70; 22% instances), obl (62; 19% instances), goeswith (46; 14% instances), compound (12; 4% instances), conj (12; 4% instances), obj (7; 2% instances), nsubj (6; 2% instances), list (5; 2% instances), fixed (4; 1% instances), root (4; 1% instances), appos (3; 1% instances), advcl (1; 0% instances)
Parents of X
nodes belong to 9 different parts of speech: NOUN (163; 50% instances), VERB (49; 15% instances), PROPN (25; 8% instances), ADV (23; 7% instances), ADJ (20; 6% instances), NUM (19; 6% instances), X (17; 5% instances), (4; 1% instances), DET (3; 1% instances)
243 (75%) X
nodes are leaves.
62 (19%) X
nodes have one child.
12 (4%) X
nodes have two children.
6 (2%) X
nodes have three or more children.
The highest child degree of a X
node is 5.
Children of X
nodes are attached using 16 different relations: conj (15; 14% instances), punct (15; 14% instances), nmod (14; 13% instances), amod (12; 11% instances), cc (11; 10% instances), compound (8; 7% instances), advmod (7; 7% instances), obj (6; 6% instances), case (5; 5% instances), list (5; 5% instances), obl (3; 3% instances), nsubj (2; 2% instances), advcl (1; 1% instances), det (1; 1% instances), fixed (1; 1% instances), flat (1; 1% instances)
Children of X
nodes belong to 11 different parts of speech: NOUN (22; 21% instances), ADJ (18; 17% instances), X (17; 16% instances), PUNCT (15; 14% instances), CCONJ (11; 10% instances), PROPN (8; 7% instances), ADV (6; 6% instances), ADP (4; 4% instances), NUM (4; 4% instances), DET (1; 1% instances), VERB (1; 1% instances)