home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Turkish-PUD: POS Tags: PROPN

There are 1005 PROPN lemmas (20%), 1167 PROPN types (16%) and 1525 PROPN tokens (9%). Out of 16 observed tags, the rank of PROPN is: 2 in number of lemmas, 3 in number of types and 5 in number of tokens.

The 10 most frequent PROPN lemmas: _, Çin, Amerika, Avrupa, İngiliz, Trump, Akdeniz, Avustralya, Roma, Abd

The 10 most frequent PROPN types: İngiliz, Çin, Alman, Amerika, ABD, Hong, Trump, Akdeniz, Avrupa, Avustralya

The 10 most frequent ambiguous lemmas: _ (ADJ 133, NOUN 83, AUX 68, PUNCT 62, NUM 30, PROPN 26, VERB 19, ADV 15, ADP 7, PRON 5, X 5, SYM 4, DET 1), amerikan (PROPN 5, NOUN 2), de (ADV 61, VERB 29, PROPN 5, ADJ 1, NOUN 1, X 1), Amerikalı (PROPN 3, ADJ 2), Çinli (PROPN 3, ADJ 1), and (PROPN 2, CCONJ 1, NOUN 1), el (NOUN 21, PROPN 2), san (PROPN 2, VERB 1), . (PUNCT 988, PROPN 1), America (PROPN 1, X 1)

The 10 most frequent ambiguous types: Amerikan (PROPN 5, NOUN 2), de (ADV 61, PROPN 4, X 1), Amerikalı (ADJ 2, PROPN 2), and (CCONJ 1, NOUN 1, PROPN 1), America (PROPN 1, X 1), Birleşik (ADJ 7, PROPN 1), Devletleri (NOUN 2, PROPN 1), Kral (NOUN 4, PROPN 1), Post (NOUN 1, PROPN 1), Savaşı’nda (NOUN 1, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.161194 (the average of all parts of speech is 1.517471).

The 1st highest number of forms (22) was observed with the lemma “_”: Alice, B., B.C., Bellingham, Gaunt’lı, Halifax, Hindistan, I., Knightley’in, Maldivler, Marat, Milice, Mısırlı, Sade’de, St., Ya, Z., a, al-Jardaan, Ötzi, İspanyolca, İspanyolcaya.

The 2nd highest number of forms (5) was observed with the lemma “İngiltere”: İngiltere, İngiltere’de, İngiltere’den, İngiltere’ye, İngiltere’yi.

The 3rd highest number of forms (4) was observed with the lemma “Almanya”: Almanya, Almanya’da, Almanya’nın, Almanya’yı.

PROPN occurs with 5 features: Number (1500; 98% instances), Case (697; 46% instances), Number[psor] (14; 1% instances), Person[psor] (14; 1% instances), Person (1; 0% instances)

PROPN occurs with 12 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Number=Plur, Number=Sing, Number[psor]=Sing, Person=3, Person[psor]=3

PROPN occurs with 26 feature combinations. The most frequent feature combination is Number=Sing (789 tokens). Examples: İngiliz, Çin, Alman, Amerika, ABD, Hong, Akdeniz, Avrupa, Fransız, Hint

Relations

PROPN nodes are attached to their parents using 17 different relations: nmod:poss (435; 29% instances), flat (284; 19% instances), nsubj (281; 18% instances), obl (113; 7% instances), conj (90; 6% instances), appos (89; 6% instances), amod (79; 5% instances), obj (61; 4% instances), nmod (38; 2% instances), iobj (19; 1% instances), compound (17; 1% instances), advcl (7; 0% instances), root (7; 0% instances), orphan (2; 0% instances), acl (1; 0% instances), ccomp (1; 0% instances), xcomp (1; 0% instances)

Parents of PROPN nodes belong to 9 different parts of speech: NOUN (759; 50% instances), PROPN (332; 22% instances), VERB (295; 19% instances), ADJ (86; 6% instances), X (18; 1% instances), ADV (17; 1% instances), PRON (7; 0% instances), (7; 0% instances), NUM (4; 0% instances)

868 (57%) PROPN nodes are leaves.

402 (26%) PROPN nodes have one child.

180 (12%) PROPN nodes have two children.

75 (5%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 6.

Children of PROPN nodes are attached using 23 different relations: flat (299; 29% instances), punct (225; 22% instances), case (99; 10% instances), conj (90; 9% instances), cc (69; 7% instances), acl (56; 5% instances), amod (52; 5% instances), appos (27; 3% instances), compound (20; 2% instances), cop (17; 2% instances), advmod:emph (10; 1% instances), nmod (10; 1% instances), nmod:poss (8; 1% instances), nsubj (8; 1% instances), det (7; 1% instances), advmod (6; 1% instances), obl (5; 0% instances), nummod (4; 0% instances), cc:preconj (2; 0% instances), orphan (2; 0% instances), aux (1; 0% instances), parataxis (1; 0% instances), xcomp (1; 0% instances)

Children of PROPN nodes belong to 13 different parts of speech: PROPN (332; 33% instances), PUNCT (225; 22% instances), NOUN (134; 13% instances), ADP (99; 10% instances), CCONJ (71; 7% instances), ADJ (67; 7% instances), NUM (20; 2% instances), AUX (19; 2% instances), X (19; 2% instances), ADV (17; 2% instances), DET (7; 1% instances), VERB (6; 1% instances), PRON (3; 0% instances)