Treebank Statistics: UD_Tatar-NMCTT: POS Tags: PROPN
There are 83 PROPN
lemmas (9%), 96 PROPN
types (8%) and 167 PROPN
tokens (7%).
Out of 14 observed tags, the rank of PROPN
is: 4 in number of lemmas, 4 in number of types and 5 in number of tokens.
The 10 most frequent PROPN
lemmas: Татарстан, Кама, Казан, Төмән, Марат, Россия, Чиләбе, рамил, Әхмәтов, Василий
The 10 most frequent PROPN
types: Татарстан, Кама, Төмән, Марат, Татарстанның, Казан, Рамил, Татарстанда, Чиләбе, Әхмәтов
The 10 most frequent ambiguous lemmas: Россия (PROPN 4, NOUN 1, PRON 1), Курск (PROPN 2, NOUN 1), татар (NOUN 11, PROPN 1)
The 10 most frequent ambiguous types: Россия (PROPN 3, PRON 1), Татар (NOUN 3, PROPN 1)
- Россия
- Татар
Morphology
The form / lemma ratio of PROPN
is 1.156627 (the average of all parts of speech is 1.414579).
The 1st highest number of forms (3) was observed with the lemma “Казан”: Казан, Казанда, Казанның.
The 2nd highest number of forms (3) was observed with the lemma “Кама”: Кама, Камага, Каманың.
The 3rd highest number of forms (3) was observed with the lemma “Кырым”: Кырым, Кырымда, Кырымнан.
PROPN
occurs with 3 features: Case (165; 99% instances), Number (165; 99% instances), Person[psor] (2; 1% instances)
PROPN
occurs with 7 feature-value pairs: Case=Abl
, Case=Dat
, Case=Gen
, Case=Loc
, Case=Nom
, Number=Sing
, Person[psor]=3
PROPN
occurs with 8 feature combinations.
The most frequent feature combination is Case=Nom|Number=Sing
(137 tokens).
Examples: Татарстан, Кама, Төмән, Марат, Казан, Рамил, Чиләбе, Әхмәтов, Василий, Россия
Relations
PROPN
nodes are attached to their parents using 7 different relations: nmod (75; 45% instances), flat (43; 26% instances), appos (18; 11% instances), obl (11; 7% instances), nsubj (10; 6% instances), compound (7; 4% instances), conj (3; 2% instances)
Parents of PROPN
nodes belong to 4 different parts of speech: NOUN (107; 64% instances), PROPN (33; 20% instances), VERB (20; 12% instances), ADJ (7; 4% instances)
117 (70%) PROPN
nodes are leaves.
40 (24%) PROPN
nodes have one child.
7 (4%) PROPN
nodes have two children.
3 (2%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 5.
Children of PROPN
nodes are attached using 10 different relations: flat (31; 48% instances), punct (14; 22% instances), amod (6; 9% instances), conj (6; 9% instances), compound (3; 5% instances), acl (1; 2% instances), case (1; 2% instances), ccomp (1; 2% instances), nmod (1; 2% instances), parataxis (1; 2% instances)
Children of PROPN
nodes belong to 6 different parts of speech: PROPN (33; 51% instances), PUNCT (14; 22% instances), ADJ (10; 15% instances), VERB (4; 6% instances), NOUN (3; 5% instances), ADP (1; 2% instances)