Treebank Statistics: UD_Kaapor-TuDeT: POS Tags: PROPN
There are 16 PROPN
lemmas (8%), 18 PROPN
types (8%) and 20 PROPN
tokens (5%).
Out of 14 observed tags, the rank of PROPN
is: 5 in number of lemmas, 5 in number of types and 6 in number of tokens.
The 10 most frequent PROPN
lemmas: Maíra, Purutu, _, ʃaʔe, Ana, Arauxu, Kaninde, Oropo, Tuti, kaitā
The 10 most frequent PROPN
types: Xõru, ʃaʔe, Anake, Arauxu, Kaninde, Maiɾ, Mataru, Maíra, Maírake, Oropo
The 10 most frequent ambiguous lemmas: _ (VERB 3, NOUN 2, PROPN 2, ADV 1, NUM 1, PRON 1), kamarar (NOUN 1, PROPN 1)
The 10 most frequent ambiguous types: kamarar (NOUN 1, PROPN 1)
Morphology
The form / lemma ratio of PROPN
is 1.125000 (the average of all parts of speech is 1.154229).
The 1st highest number of forms (2) was observed with the lemma “Maíra”: Maíra, Maírake.
The 2nd highest number of forms (2) was observed with the lemma “Purutu”: Purutu, Purutuke.
The 3rd highest number of forms (1) was observed with the lemma “Ana”: Anake.
PROPN
occurs with 3 features: Animacy (2; 10% instances), Case (2; 10% instances), Number (2; 10% instances)
PROPN
occurs with 3 feature-value pairs: Animacy=Hum
, Case=Aff
, Number=Sing
PROPN
occurs with 3 feature combinations.
The most frequent feature combination is _
(16 tokens).
Examples: Xõru, Anake, Arauxu, Kaninde, Maiɾ, Maíra, Maírake, Oropo, Purutu, Tuti
Relations
PROPN
nodes are attached to their parents using 4 different relations: nsubj (14; 70% instances), obj (3; 15% instances), nmod (2; 10% instances), obl (1; 5% instances)
Parents of PROPN
nodes belong to 2 different parts of speech: VERB (18; 90% instances), NOUN (2; 10% instances)
15 (75%) PROPN
nodes are leaves.
5 (25%) PROPN
nodes have one child.
The highest child degree of a PROPN
node is 1.
Children of PROPN
nodes are attached using 3 different relations: discourse (3; 60% instances), case (1; 20% instances), nmod (1; 20% instances)
Children of PROPN
nodes belong to 3 different parts of speech: PART (3; 60% instances), ADP (1; 20% instances), PRON (1; 20% instances)