Treebank Statistics: UD_Tamil-MWTT: POS Tags: PROPN
There are 7 PROPN
lemmas (1%), 32 PROPN
types (4%) and 315 PROPN
tokens (12%).
Out of 13 observed tags, the rank of PROPN
is: 10 in number of lemmas, 7 in number of types and 4 in number of tokens.
The 10 most frequent PROPN
lemmas: குமார், ராஜா, அமெரிக்கா, சென்னை, கமலா, பாண்டிச்சேரி, ராமன்
The 10 most frequent PROPN
types: குமார், குமாருக்கு, குமாரை, ராஜா, ராஜாவை, குமாருக்குத், குமாருக்குப், ராஜாவுக்கு, அமெரிகாவுக்கு, குமாருக்குச்
The 10 most frequent ambiguous lemmas: ராஜா (PROPN 21, ADV 1), சென்னை (PROPN 2, NOUN 1)
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of PROPN
is 4.571429 (the average of all parts of speech is 1.743028).
The 1st highest number of forms (18) was observed with the lemma “குமார்”: குமாரது, குமாரால், குமாராவது, குமாரிடம், குமாருக்கு, குமாருக்குச், குமாருக்குத், குமாருக்குப், குமாருடையது, குமாரும், குமாரே, குமாரை, குமாரைத், குமாரைப், குமாரோ, குமாரோடு, குமார், குமார்தான்.
The 2nd highest number of forms (8) was observed with the lemma “ராஜா”: ராஜா, ராஜாவிடம், ராஜாவுக்கு, ராஜாவும், ராஜாவை, ராஜாவைத், ராஜாவைப், ராஜாவோ.
The 3rd highest number of forms (2) was observed with the lemma “சென்னை”: சென்னைக்கு, சென்னையில்.
PROPN
occurs with 5 features: Case (315; 100% instances), Number (315; 100% instances), Person (315; 100% instances), Polite (39; 12% instances), Gender (2; 1% instances)
PROPN
occurs with 11 feature-value pairs: Case=Acc
, Case=Com
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Gender=Neut
, Number=Sing
, Person=3
, Polite=Form
PROPN
occurs with 12 feature combinations.
The most frequent feature combination is Case=Nom|Number=Sing|Person=3
(202 tokens).
Examples: குமார், ராஜா, குமாரும், குமாரோ, குமாராவது, குமாரே, குமார்தான், ராஜாவும், ராஜாவோ
Relations
PROPN
nodes are attached to their parents using 11 different relations: nsubj (228; 72% instances), nsubj:nc (38; 12% instances), obj (17; 5% instances), obl (9; 3% instances), root (7; 2% instances), iobj (5; 2% instances), nsubj:pass (4; 1% instances), conj (3; 1% instances), nmod (2; 1% instances), obl:cmpr (1; 0% instances), obl:lmod (1; 0% instances)
Parents of PROPN
nodes belong to 6 different parts of speech: VERB (282; 90% instances), NOUN (18; 6% instances), (7; 2% instances), PROPN (4; 1% instances), ADV (2; 1% instances), PRON (2; 1% instances)
299 (95%) PROPN
nodes are leaves.
9 (3%) PROPN
nodes have one child.
6 (2%) PROPN
nodes have two children.
1 (0%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 3.
Children of PROPN
nodes are attached using 7 different relations: punct (7; 29% instances), case (6; 25% instances), nsubj (4; 17% instances), conj (3; 13% instances), obl (2; 8% instances), advcl (1; 4% instances), cc (1; 4% instances)
Children of PROPN
nodes belong to 7 different parts of speech: PUNCT (7; 29% instances), ADP (6; 25% instances), NOUN (4; 17% instances), PROPN (4; 17% instances), ADV (1; 4% instances), CCONJ (1; 4% instances), PRON (1; 4% instances)