home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-Ruthenian: POS Tags: PROPN

There are 2544 PROPN lemmas (32%), 4248 PROPN types (21%) and 7694 PROPN tokens (7%). Out of 17 observed tags, the rank of PROPN is: 1 in number of lemmas, 2 in number of types and 7 in number of tokens.

The 10 most frequent PROPN lemmas: Полоцкъ, Иванъ, Рига, Александръ, Станиславъ, Глебовичъ, Ивашко, Андрей, Ганусъ, Сопега

The 10 most frequent PROPN types: Полоцку, Ивана, Ризѣ, Иван, Полоцкꙋ, Александръ, Сопега, иванъ, Ризе, Станиславъ

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types: Иванова (PROPN 3, ADJ 2), Беломꙋ (ADJ 2, PROPN 1), Белыи (ADJ 1, PROPN 1), Болваницкого (ADJ 1, PROPN 1), Брумарева (ADJ 1, PROPN 1), Василева (ADJ 1, PROPN 1), Вл(а)д(ы)ка (NOUN 1, PROPN 1), Дьяку (NOUN 1, PROPN 1), Иванову (ADJ 1, PROPN 1), Ивановых (ADJ 1, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.669811 (the average of all parts of speech is 2.589846).

The 1st highest number of forms (40) was observed with the lemma “Полоцкъ”: […]цка, Пол(о)цка, Пол(о)цкоу, Пол(о)цку, Пол(о)цкꙋ, Полотеско, Полотескъ, Полотско, Полотску, Полотскꙋ, Полотсце, Полотськѣ, Полотъску, Полотьска, Полотьсце, Полоц(ку), Полоцка, Полоцком, Полоцкомъ, Полоцкоу, Полоцку, Полоцкъ, Полоцкꙋ, Полоцъку, Полоцъкꙋ, Полоцька, Полоцьку, Полоцькꙋ, Полочку, Полочьк(у), Полочьку, Полочьска, Полочьсце, Полтеск, Полтескъ, Полтѣскъ, Полътескъ, Польтескъ, Польтескь, полотскѹ.

The 2nd highest number of forms (19) was observed with the lemma “Иванъ”: Івана, Іванъ, Ив(а)н, Ив(ан), Иван, Иван(а), Ивана, Иване, Иваном, Иваномъ, Иваноу, Ивану, Ивань, Иванꙋ, Ывана, Ываном, Ываномъ, ива(н̑), иванъ.

The 3rd highest number of forms (18) was observed with the lemma “Судимонтовичъ”: Соудимонтович, Соудимонтовича, Соудимонтовиччя, Суди(монтович), Судимонтович, Судимонтович(а), Судимонтовича, Судимонтовичу, Судимонтовичя, Судимонътовича, Судимонътовичу, Судимонътовичъ, Сꙋдимонт(о)вич(а), Сꙋдимонтовича, Сꙋдимонтовичу, Сꙋдимонътович, Сꙋдимонътовича, Сꙋдимонътовичу.

PROPN occurs with 7 features: Case (7693; 100% instances), Gender (7693; 100% instances), NameType (7693; 100% instances), Number (7693; 100% instances), Animacy (514; 7% instances), Typo (2; 0% instances), Abbr (1; 0% instances)

PROPN occurs with 22 feature-value pairs: Abbr=Yes, Animacy=Anim, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, Gender=Fem, Gender=Masc, Gender=Neut, NameType=Geo, NameType=Giv, NameType=Hus, NameType=Oth, NameType=Pat, NameType=Prs, NameType=Sur, Number=Plur, Number=Sing, Typo=Yes

PROPN occurs with 121 feature combinations. The most frequent feature combination is Case=Nom|Gender=Masc|NameType=Giv|Number=Sing (1615 tokens). Examples: Иван, Александръ, иванъ, Станиславъ, Жикгимонт, Ивашко, Кгастовтъ, Кушлеико, Логвинъ, Станислав

Relations

PROPN nodes are attached to their parents using 22 different relations: appos (2131; 28% instances), flat:name (1762; 23% instances), conj (1262; 16% instances), obl (769; 10% instances), nmod (626; 8% instances), root (307; 4% instances), nsubj (298; 4% instances), orphan (255; 3% instances), iobj (139; 2% instances), obj (79; 1% instances), parataxis (29; 0% instances), acl:relcl (8; 0% instances), compound (7; 0% instances), advcl (6; 0% instances), nsubj:pass (4; 0% instances), vocative (3; 0% instances), xcomp (3; 0% instances), reparandum (2; 0% instances), amod (1; 0% instances), dep (1; 0% instances), list (1; 0% instances), obl:agent (1; 0% instances)

Parents of PROPN nodes belong to 11 different parts of speech: NOUN (3063; 40% instances), PROPN (2869; 37% instances), VERB (1240; 16% instances), (307; 4% instances), PRON (113; 1% instances), ADJ (71; 1% instances), ADV (18; 0% instances), DET (7; 0% instances), NUM (4; 0% instances), PART (1; 0% instances), X (1; 0% instances)

3087 (40%) PROPN nodes are leaves.

2494 (32%) PROPN nodes have one child.

1085 (14%) PROPN nodes have two children.

1028 (13%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 26.

Children of PROPN nodes are attached using 30 different relations: flat:name (1759; 20% instances), case (1733; 19% instances), punct (1656; 19% instances), conj (1349; 15% instances), cc (973; 11% instances), appos (460; 5% instances), nmod (230; 3% instances), amod (200; 2% instances), det (180; 2% instances), orphan (126; 1% instances), nsubj (53; 1% instances), acl:relcl (37; 0% instances), advmod (37; 0% instances), obl (24; 0% instances), parataxis (16; 0% instances), acl (14; 0% instances), cop (12; 0% instances), advcl (6; 0% instances), list (4; 0% instances), mark (4; 0% instances), dep (3; 0% instances), iobj (3; 0% instances), nummod (3; 0% instances), compound (2; 0% instances), obj (2; 0% instances), reparandum (2; 0% instances), discourse (1; 0% instances), expl (1; 0% instances), obl:tmod (1; 0% instances), parataxis:discourse (1; 0% instances)

Children of PROPN nodes belong to 15 different parts of speech: PROPN (2869; 32% instances), ADP (1731; 19% instances), PUNCT (1656; 19% instances), NOUN (1093; 12% instances), CCONJ (971; 11% instances), ADJ (220; 2% instances), DET (190; 2% instances), VERB (59; 1% instances), PART (29; 0% instances), PRON (29; 0% instances), ADV (15; 0% instances), AUX (12; 0% instances), NUM (7; 0% instances), SCONJ (7; 0% instances), SYM (4; 0% instances)