home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Breton-KEB: POS Tags: PROPN

There are 112 PROPN lemmas (6%), 120 PROPN types (5%) and 307 PROPN tokens (3%). Out of 17 observed tags, the rank of PROPN is: 6 in number of lemmas, 6 in number of types and 9 in number of tokens.

The 10 most frequent PROPN lemmas: Breizh, Yann, Yannig, Lenaig, Pariz, Frañs, Kemper, Kembre, Naoned, Divi

The 10 most frequent PROPN types: Breizh, Yann, Yannig, Lenaig, Pariz, Frañs, Kembre, Naoned, Divi, Europa

The 10 most frequent ambiguous lemmas: Ofis (X 4, PROPN 1)

The 10 most frequent ambiguous types: Brezhoneg (PROPN 4, NOUN 1), Bremañ (ADV 10, PROPN 2), Kemener (PROPN 2, NOUN 1), Ofis (X 4, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.071429 (the average of all parts of speech is 1.395664).

The 1st highest number of forms (3) was observed with the lemma “Pariz”: Bariz, Paris, Pariz.

The 2nd highest number of forms (2) was observed with the lemma “Breizh”: Breizh, Vreizh.

The 3rd highest number of forms (2) was observed with the lemma “Gwened”: Gwened, Wened.

PROPN occurs with 2 features: Number (307; 100% instances), Gender (107; 35% instances)

PROPN occurs with 4 feature-value pairs: Gender=Fem, Gender=Masc, Number=Plur, Number=Sing

PROPN occurs with 4 feature combinations. The most frequent feature combination is Number=Sing (199 tokens). Examples: Breizh, Yann, Pariz, Frañs, Kembre, Yannig, Lenaig, Naoned, Europa, Karaez

Relations

PROPN nodes are attached to their parents using 14 different relations: nsubj (87; 28% instances), nmod:gen (67; 22% instances), obl (53; 17% instances), flat:name (21; 7% instances), nmod (21; 7% instances), conj (16; 5% instances), obj (11; 4% instances), obl:agent (11; 4% instances), root (8; 3% instances), appos (7; 2% instances), nmod:poss (2; 1% instances), dep (1; 0% instances), discourse (1; 0% instances), dislocated (1; 0% instances)

Parents of PROPN nodes belong to 9 different parts of speech: VERB (144; 47% instances), NOUN (113; 37% instances), PROPN (32; 10% instances), (8; 3% instances), ADJ (5; 2% instances), PRON (2; 1% instances), ADV (1; 0% instances), DET (1; 0% instances), NUM (1; 0% instances)

162 (53%) PROPN nodes are leaves.

98 (32%) PROPN nodes have one child.

25 (8%) PROPN nodes have two children.

22 (7%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 10.

Children of PROPN nodes are attached using 18 different relations: case (95; 40% instances), punct (35; 15% instances), flat:name (19; 8% instances), conj (17; 7% instances), dep (16; 7% instances), det (12; 5% instances), appos (6; 3% instances), cc (6; 3% instances), cop (5; 2% instances), nmod:gen (5; 2% instances), advmod (4; 2% instances), aux (4; 2% instances), nsubj (4; 2% instances), obl (3; 1% instances), acl (2; 1% instances), amod (2; 1% instances), nmod (2; 1% instances), parataxis (1; 0% instances)

Children of PROPN nodes belong to 14 different parts of speech: ADP (94; 39% instances), PUNCT (35; 15% instances), PROPN (32; 13% instances), NOUN (20; 8% instances), X (16; 7% instances), DET (12; 5% instances), AUX (9; 4% instances), CCONJ (6; 3% instances), ADV (4; 2% instances), ADJ (3; 1% instances), VERB (3; 1% instances), NUM (2; 1% instances), INTJ (1; 0% instances), PRON (1; 0% instances)