home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Veps-VWT: POS Tags: PROPN

There are 21 PROPN lemmas (5%), 28 PROPN types (5%) and 45 PROPN tokens (3%). Out of 13 observed tags, the rank of PROPN is: 6 in number of lemmas, 6 in number of types and 9 in number of tokens.

The 10 most frequent PROPN lemmas: Kalag’, Kaskez, Natalja, Päžar, Silakova, Jevgenjevna, Piter, Päžar’, vepsän, Änižjärv

The 10 most frequent PROPN types: Kalag’, Natalja, Päžarvehe, Kaskez, Kaskezaspäi, Piterin, Silakova, Vepsän, Änižjärven, Himjogi

The 10 most frequent ambiguous lemmas: vepsän (ADJ 18, PROPN 2)

The 10 most frequent ambiguous types: Vepsän (ADJ 3, PROPN 2)

Morphology

The form / lemma ratio of PROPN is 1.333333 (the average of all parts of speech is 1.526854).

The 1st highest number of forms (4) was observed with the lemma “Kaskez”: Kaskez, Kaskezaha, Kaskezas, Kaskezaspäi.

The 2nd highest number of forms (2) was observed with the lemma “Jevgenjevna”: Jevgenjevna, Jevgenjevnad.

The 3rd highest number of forms (2) was observed with the lemma “Kalag’”: Kalag’, Kalages.

PROPN occurs with 2 features: Case (45; 100% instances), Number (45; 100% instances)

PROPN occurs with 11 feature-value pairs: Case=Abl, Case=Ade, Case=Com, Case=Ela, Case=Gen, Case=Ill, Case=Ine, Case=Nom, Case=Par, Case=Ter, Number=Sing

PROPN occurs with 10 feature combinations. The most frequent feature combination is Case=Nom|Number=Sing (18 tokens). Examples: Kalag’, Natalja, Kaskez, Silakova, Himjogi, Jevgenjevna, Kalarand, Päžar’

Relations

PROPN nodes are attached to their parents using 9 different relations: nmod (20; 44% instances), obl (10; 22% instances), flat (5; 11% instances), nsubj (3; 7% instances), conj (2; 4% instances), root (2; 4% instances), appos (1; 2% instances), nsubj:cop (1; 2% instances), obj (1; 2% instances)

Parents of PROPN nodes belong to 5 different parts of speech: NOUN (22; 49% instances), VERB (13; 29% instances), PROPN (7; 16% instances), (2; 4% instances), PRON (1; 2% instances)

27 (60%) PROPN nodes are leaves.

15 (33%) PROPN nodes have one child.

1 (2%) PROPN nodes have two children.

2 (4%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 5.

Children of PROPN nodes are attached using 12 different relations: punct (6; 22% instances), flat (5; 19% instances), aux (2; 7% instances), cc (2; 7% instances), conj (2; 7% instances), cop (2; 7% instances), nmod (2; 7% instances), nsubj:cop (2; 7% instances), advcl (1; 4% instances), advmod (1; 4% instances), amod (1; 4% instances), obl (1; 4% instances)

Children of PROPN nodes belong to 9 different parts of speech: PROPN (7; 26% instances), PUNCT (6; 22% instances), AUX (4; 15% instances), NOUN (4; 15% instances), CCONJ (2; 7% instances), ADJ (1; 4% instances), ADV (1; 4% instances), PRON (1; 4% instances), VERB (1; 4% instances)