Treebank Statistics: UD_Vietnamese-TueCL: POS Tags: PROPN
There are 55 PROPN
lemmas (7%), 55 PROPN
types (7%) and 55 PROPN
tokens (3%).
Out of 15 observed tags, the rank of PROPN
is: 5 in number of lemmas, 5 in number of types and 11 in number of tokens.
The 10 most frequent PROPN
lemmas: Abdul, Arthur, Ba, Bangalore, Benjamin, Cenote, Châu Á, Con, Dolarhyde, F
The 10 most frequent PROPN
types: Abdul, Arthur, Ba, Bangalore, Benjamin, Cenote, Châu Á, Con, Dolarhyde, F
The 10 most frequent ambiguous lemmas:
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of PROPN
is 1.000000 (the average of all parts of speech is 1.000000).
The 1st highest number of forms (1) was observed with the lemma “Abdul”: Abdul.
The 2nd highest number of forms (1) was observed with the lemma “Arthur”: Arthur.
The 3rd highest number of forms (1) was observed with the lemma “Ba”: Ba.
PROPN
occurs with 4 features: NameType (54; 98% instances), Foreign (21; 38% instances), Case (1; 2% instances), Typo (1; 2% instances)
PROPN
occurs with 11 feature-value pairs: Case=Voc
, Foreign=Yes
, NameType=Com
, NameType=Geo
, NameType=Giv
, NameType=Nat
, NameType=Oth
, NameType=Pro
, NameType=Prs
, NameType=Sur
, Typo=Yes
PROPN
occurs with 14 feature combinations.
The most frequent feature combination is Foreign=Yes|NameType=Oth
(11 tokens).
Examples: Famillia, Federation, Future, Knights, Los, Michoacana, Sinaloa, Templar, The, You
Relations
PROPN
nodes are attached to their parents using 12 different relations: flat (17; 31% instances), compound (10; 18% instances), nmod (8; 15% instances), obj (6; 11% instances), nsubj (3; 5% instances), vocative (3; 5% instances), conj (2; 4% instances), obl (2; 4% instances), advcl (1; 2% instances), appos (1; 2% instances), nmod:poss (1; 2% instances), parataxis (1; 2% instances)
Parents of PROPN
nodes belong to 4 different parts of speech: PROPN (20; 36% instances), NOUN (19; 35% instances), VERB (14; 25% instances), NUM (2; 4% instances)
29 (53%) PROPN
nodes are leaves.
15 (27%) PROPN
nodes have one child.
4 (7%) PROPN
nodes have two children.
7 (13%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 4.
Children of PROPN
nodes are attached using 12 different relations: flat (17; 38% instances), case (9; 20% instances), punct (8; 18% instances), cc (2; 4% instances), conj (2; 4% instances), acl:relcl (1; 2% instances), advcl (1; 2% instances), advmod (1; 2% instances), amod (1; 2% instances), compound (1; 2% instances), nmod (1; 2% instances), nsubj (1; 2% instances)
Children of PROPN
nodes belong to 9 different parts of speech: PROPN (20; 44% instances), ADP (9; 20% instances), PUNCT (8; 18% instances), CCONJ (2; 4% instances), VERB (2; 4% instances), ADJ (1; 2% instances), ADV (1; 2% instances), NOUN (1; 2% instances), PRON (1; 2% instances)