Treebank Statistics: UD_Russian-Poetry: POS Tags: PROPN
There are 376 PROPN
lemmas (4%), 448 PROPN
types (2%) and 588 PROPN
tokens (1%).
Out of 17 observed tags, the rank of PROPN
is: 5 in number of lemmas, 5 in number of types and 12 in number of tokens.
The 10 most frequent PROPN
lemmas: А, Москва, Волга, Русь, Воронский, В, Россия, Анжелина, Иуда, Майков
The 10 most frequent PROPN
types: А., В., Воронский, Волге, Анжелина, Москва, Москве, Н., Русь, Восток
The 10 most frequent ambiguous lemmas:
The 10 most frequent ambiguous types: Восток (PROPN 4, NOUN 1), Земля (NOUN 8, PROPN 2), Лакримоза (NOUN 4, PROPN 2), Г. (NOUN 1, PROPN 1), Дукат (NOUN 1, PROPN 1), Кастусь (NOUN 1, PROPN 1), Лира (NOUN 1, PROPN 1), Луна (NOUN 6, PROPN 1), Севера (NOUN 1, PROPN 1), Холодной (ADJ 1, PROPN 1)
- Восток
- Земля
- Лакримоза
- Г.
- Дукат
- Кастусь
- NOUN 1: Чтоб , их повторяя потом наизусть , Я б душу обрадовал в ней поселеньем Огня , на котором горел Кастусь По каплям , по дням , по селеньям .
- PROPN 1: Кастусь , ты со мною , ты сызнова здесь С упреком ( что может быть горше ? ) , Ты здесь – и я смешан , как верст этих смесь , Как смешанный говор под Оршей .
- Лира
- Луна
- Севера
- Холодной
Morphology
The form / lemma ratio of PROPN
is 1.191489 (the average of all parts of speech is 1.831021).
The 1st highest number of forms (7) was observed with the lemma “Москва”: МОСКВА, МОСКВЕ, МОСКВОЙ, Москва, Москве, Москвою, Москвы.
The 2nd highest number of forms (3) was observed with the lemma “Берлин”: Берлин, Берлине, Берлином.
The 3rd highest number of forms (3) was observed with the lemma “Волга”: Волга, Волге, Волгой.
PROPN
occurs with 7 features: NameType (587; 100% instances), Animacy (529; 90% instances), Case (529; 90% instances), Gender (529; 90% instances), Number (529; 90% instances), Abbr (57; 10% instances), InflClass (42; 7% instances)
PROPN
occurs with 23 feature-value pairs: Abbr=Yes
, Animacy=Anim
, Animacy=Inan
, Case=Acc
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Gender=Fem
, Gender=Masc
, Gender=Neut
, InflClass=Ind
, NameType=Com
, NameType=Geo
, NameType=Giv
, NameType=Pat
, NameType=Pro
, NameType=Prs
, NameType=Sur
, NameType=Zoo
, Number=Plur
, Number=Sing
PROPN
occurs with 80 feature combinations.
The most frequent feature combination is Animacy=Anim|Case=Nom|Gender=Masc|NameType=Sur|Number=Sing
(60 tokens).
Examples: Воронский, Гинсбург, Гитлер, Ленин, Майков, Пустота, Сперанский, Аверроэс, Аткинс, БЕЛЫЙ
Relations
PROPN
nodes are attached to their parents using 16 different relations: nmod (104; 18% instances), nsubj (96; 16% instances), flat:name (86; 15% instances), root (69; 12% instances), obl (61; 10% instances), vocative (40; 7% instances), conj (36; 6% instances), appos (35; 6% instances), obj (26; 4% instances), iobj (19; 3% instances), parataxis (6; 1% instances), xcomp (4; 1% instances), advcl (2; 0% instances), nsubj:pass (2; 0% instances), obl:agent (1; 0% instances), orphan (1; 0% instances)
Parents of PROPN
nodes belong to 10 different parts of speech: VERB (210; 36% instances), NOUN (159; 27% instances), PROPN (112; 19% instances), (69; 12% instances), ADJ (13; 2% instances), ADV (10; 2% instances), PRON (10; 2% instances), DET (3; 1% instances), NUM (1; 0% instances), PART (1; 0% instances)
264 (45%) PROPN
nodes are leaves.
189 (32%) PROPN
nodes have one child.
92 (16%) PROPN
nodes have two children.
43 (7%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 7.
Children of PROPN
nodes are attached using 25 different relations: punct (160; 30% instances), case (97; 18% instances), flat:name (85; 16% instances), amod (65; 12% instances), conj (40; 7% instances), cc (16; 3% instances), det (16; 3% instances), advmod (11; 2% instances), appos (6; 1% instances), nmod (6; 1% instances), nsubj (6; 1% instances), acl (4; 1% instances), acl:relcl (4; 1% instances), orphan (4; 1% instances), parataxis (4; 1% instances), discourse (3; 1% instances), cop (2; 0% instances), iobj (2; 0% instances), parataxis:discourse (2; 0% instances), advcl (1; 0% instances), dislocated (1; 0% instances), flat (1; 0% instances), mark (1; 0% instances), nummod (1; 0% instances), obl (1; 0% instances)
Children of PROPN
nodes belong to 15 different parts of speech: PUNCT (160; 30% instances), PROPN (112; 21% instances), ADP (92; 17% instances), ADJ (61; 11% instances), NOUN (34; 6% instances), DET (18; 3% instances), CCONJ (16; 3% instances), VERB (13; 2% instances), PART (9; 2% instances), SCONJ (6; 1% instances), ADV (5; 1% instances), PRON (5; 1% instances), INTJ (4; 1% instances), AUX (2; 0% instances), NUM (2; 0% instances)