Treebank Statistics: UD_Breton-KEB: POS Tags: PROPN
There are 112 PROPN
lemmas (6%), 120 PROPN
types (5%) and 307 PROPN
tokens (3%).
Out of 17 observed tags, the rank of PROPN
is: 6 in number of lemmas, 6 in number of types and 9 in number of tokens.
The 10 most frequent PROPN
lemmas: Breizh, Yann, Yannig, Lenaig, Pariz, Frañs, Kemper, Kembre, Naoned, Divi
The 10 most frequent PROPN
types: Breizh, Yann, Yannig, Lenaig, Pariz, Frañs, Kembre, Naoned, Divi, Europa
The 10 most frequent ambiguous lemmas: Ofis (X 4, PROPN 1)
The 10 most frequent ambiguous types: Brezhoneg (PROPN 4, NOUN 1), Bremañ (ADV 10, PROPN 2), Kemener (PROPN 2, NOUN 1), Ofis (X 4, PROPN 1)
- Brezhoneg
- Bremañ
- Kemener
- Ofis
Morphology
The form / lemma ratio of PROPN
is 1.071429 (the average of all parts of speech is 1.395664).
The 1st highest number of forms (3) was observed with the lemma “Pariz”: Bariz, Paris, Pariz.
The 2nd highest number of forms (2) was observed with the lemma “Breizh”: Breizh, Vreizh.
The 3rd highest number of forms (2) was observed with the lemma “Gwened”: Gwened, Wened.
PROPN
occurs with 2 features: Number (307; 100% instances), Gender (107; 35% instances)
PROPN
occurs with 4 feature-value pairs: Gender=Fem
, Gender=Masc
, Number=Plur
, Number=Sing
PROPN
occurs with 4 feature combinations.
The most frequent feature combination is Number=Sing
(199 tokens).
Examples: Breizh, Yann, Pariz, Frañs, Kembre, Yannig, Lenaig, Naoned, Europa, Karaez
Relations
PROPN
nodes are attached to their parents using 14 different relations: nsubj (87; 28% instances), nmod:gen (67; 22% instances), obl (53; 17% instances), flat:name (21; 7% instances), nmod (21; 7% instances), conj (16; 5% instances), obj (11; 4% instances), obl:agent (11; 4% instances), root (8; 3% instances), appos (7; 2% instances), nmod:poss (2; 1% instances), dep (1; 0% instances), discourse (1; 0% instances), dislocated (1; 0% instances)
Parents of PROPN
nodes belong to 9 different parts of speech: VERB (144; 47% instances), NOUN (113; 37% instances), PROPN (32; 10% instances), (8; 3% instances), ADJ (5; 2% instances), PRON (2; 1% instances), ADV (1; 0% instances), DET (1; 0% instances), NUM (1; 0% instances)
162 (53%) PROPN
nodes are leaves.
98 (32%) PROPN
nodes have one child.
25 (8%) PROPN
nodes have two children.
22 (7%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 10.
Children of PROPN
nodes are attached using 18 different relations: case (95; 40% instances), punct (35; 15% instances), flat:name (19; 8% instances), conj (17; 7% instances), dep (16; 7% instances), det (12; 5% instances), appos (6; 3% instances), cc (6; 3% instances), cop (5; 2% instances), nmod:gen (5; 2% instances), advmod (4; 2% instances), aux (4; 2% instances), nsubj (4; 2% instances), obl (3; 1% instances), acl (2; 1% instances), amod (2; 1% instances), nmod (2; 1% instances), parataxis (1; 0% instances)
Children of PROPN
nodes belong to 14 different parts of speech: ADP (94; 39% instances), PUNCT (35; 15% instances), PROPN (32; 13% instances), NOUN (20; 8% instances), X (16; 7% instances), DET (12; 5% instances), AUX (9; 4% instances), CCONJ (6; 3% instances), ADV (4; 2% instances), ADJ (3; 1% instances), VERB (3; 1% instances), NUM (2; 1% instances), INTJ (1; 0% instances), PRON (1; 0% instances)