Treebank Statistics: UD_Russian-PUD: POS Tags: PROPN
There are 832 PROPN
lemmas (16%), 969 PROPN
types (12%) and 1209 PROPN
tokens (6%).
Out of 17 observed tags, the rank of PROPN
is: 4 in number of lemmas, 4 in number of types and 6 in number of tokens.
The 10 most frequent PROPN
lemmas: США, Китай, Америка, Трамп, Великобритания, Европа, Австралия, Гонконг, Италия, Франция
The 10 most frequent PROPN
types: США, Великобритании, Америки, Италии, Китай, Клинтон, Австралии, Европы, Онтарио, BBC
The 10 most frequent ambiguous lemmas: де (PART 3, PROPN 1)
The 10 most frequent ambiguous types: По (ADP 15, PROPN 1), Сад (NOUN 1, PROPN 1)
- По
- ADP 15: По линии бабушки Мисима был прямым потомком Токугавы Иэясу .
- PROPN 1: Основные европейские реки , такие как Рейн , Рона , Инн , Тичино и По , текут из Швейцарии , все они имеют истоки в Альпах , протекают через соседние страны и впадают в Северное , Средиземное , Адриатическое и Черное моря .
- Сад
Morphology
The form / lemma ratio of PROPN
is 1.164663 (the average of all parts of speech is 1.496727).
The 1st highest number of forms (5) was observed with the lemma “Трамп”: Трамп, Трампа, Трампе, Трампом, Трампу.
The 2nd highest number of forms (4) was observed with the lemma “Великобритания”: Великобританией, Великобритании, Великобританию, Великобритания.
The 3rd highest number of forms (4) was observed with the lemma “Гонконг”: Гонконг, Гонконга, Гонконге, Гонконгу.
PROPN
occurs with 6 features: Animacy (1101; 91% instances), Case (1101; 91% instances), Gender (1101; 91% instances), Number (1101; 91% instances), Foreign (108; 9% instances), Abbr (53; 4% instances)
PROPN
occurs with 15 feature-value pairs: Abbr=Yes
, Animacy=Anim
, Animacy=Inan
, Case=Acc
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Foreign=Yes
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Number=Plur
, Number=Sing
PROPN
occurs with 52 feature combinations.
The most frequent feature combination is Animacy=Anim|Case=Nom|Gender=Masc|Number=Sing
(254 tokens).
Examples: Джон, Джордж, Мисима, Рафферти, Сигал, Трамп, Уинстон, Шэнь, Август, Анайя
Relations
PROPN
nodes are attached to their parents using 18 different relations: nmod (379; 31% instances), nsubj (214; 18% instances), flat:name (200; 17% instances), obl (128; 11% instances), conj (72; 6% instances), obj (47; 4% instances), appos (40; 3% instances), flat (37; 3% instances), flat:foreign (35; 3% instances), iobj (27; 2% instances), nsubj:pass (20; 2% instances), parataxis (3; 0% instances), xcomp (2; 0% instances), acl (1; 0% instances), acl:relcl (1; 0% instances), ccomp (1; 0% instances), obl:agent (1; 0% instances), orphan (1; 0% instances)
Parents of PROPN
nodes belong to 12 different parts of speech: NOUN (542; 45% instances), VERB (402; 33% instances), PROPN (222; 18% instances), ADJ (14; 1% instances), AUX (6; 0% instances), X (6; 0% instances), DET (5; 0% instances), ADP (4; 0% instances), ADV (4; 0% instances), NUM (2; 0% instances), PART (1; 0% instances), PRON (1; 0% instances)
641 (53%) PROPN
nodes are leaves.
358 (30%) PROPN
nodes have one child.
137 (11%) PROPN
nodes have two children.
73 (6%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 7.
Children of PROPN
nodes are attached using 25 different relations: case (239; 27% instances), punct (164; 18% instances), flat:name (117; 13% instances), conj (83; 9% instances), amod (70; 8% instances), cc (59; 7% instances), appos (40; 4% instances), flat:foreign (32; 4% instances), acl:relcl (15; 2% instances), nmod (14; 2% instances), acl (11; 1% instances), flat (10; 1% instances), advmod (9; 1% instances), det (6; 1% instances), parataxis (6; 1% instances), nsubj (4; 0% instances), cop (3; 0% instances), mark (3; 0% instances), orphan (3; 0% instances), nummod (2; 0% instances), advcl (1; 0% instances), nummod:gov (1; 0% instances), obj (1; 0% instances), obl (1; 0% instances), xcomp (1; 0% instances)
Children of PROPN
nodes belong to 15 different parts of speech: ADP (227; 25% instances), PROPN (222; 25% instances), PUNCT (164; 18% instances), ADJ (86; 10% instances), NOUN (61; 7% instances), CCONJ (58; 6% instances), VERB (23; 3% instances), SCONJ (16; 2% instances), ADV (14; 2% instances), DET (6; 1% instances), NUM (6; 1% instances), PART (4; 0% instances), X (4; 0% instances), AUX (3; 0% instances), PRON (1; 0% instances)