home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Vietnamese-TueCL: POS Tags: PROPN

There are 55 PROPN lemmas (7%), 55 PROPN types (7%) and 55 PROPN tokens (3%). Out of 15 observed tags, the rank of PROPN is: 5 in number of lemmas, 5 in number of types and 11 in number of tokens.

The 10 most frequent PROPN lemmas: Abdul, Arthur, Ba, Bangalore, Benjamin, Cenote, Châu Á, Con, Dolarhyde, F

The 10 most frequent PROPN types: Abdul, Arthur, Ba, Bangalore, Benjamin, Cenote, Châu Á, Con, Dolarhyde, F

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of PROPN is 1.000000 (the average of all parts of speech is 1.000000).

The 1st highest number of forms (1) was observed with the lemma “Abdul”: Abdul.

The 2nd highest number of forms (1) was observed with the lemma “Arthur”: Arthur.

The 3rd highest number of forms (1) was observed with the lemma “Ba”: Ba.

PROPN occurs with 4 features: NameType (54; 98% instances), Foreign (21; 38% instances), Case (1; 2% instances), Typo (1; 2% instances)

PROPN occurs with 11 feature-value pairs: Case=Voc, Foreign=Yes, NameType=Com, NameType=Geo, NameType=Giv, NameType=Nat, NameType=Oth, NameType=Pro, NameType=Prs, NameType=Sur, Typo=Yes

PROPN occurs with 14 feature combinations. The most frequent feature combination is Foreign=Yes|NameType=Oth (11 tokens). Examples: Famillia, Federation, Future, Knights, Los, Michoacana, Sinaloa, Templar, The, You

Relations

PROPN nodes are attached to their parents using 12 different relations: flat (17; 31% instances), compound (10; 18% instances), nmod (8; 15% instances), obj (6; 11% instances), nsubj (3; 5% instances), vocative (3; 5% instances), conj (2; 4% instances), obl (2; 4% instances), advcl (1; 2% instances), appos (1; 2% instances), nmod:poss (1; 2% instances), parataxis (1; 2% instances)

Parents of PROPN nodes belong to 4 different parts of speech: PROPN (20; 36% instances), NOUN (19; 35% instances), VERB (14; 25% instances), NUM (2; 4% instances)

29 (53%) PROPN nodes are leaves.

15 (27%) PROPN nodes have one child.

4 (7%) PROPN nodes have two children.

7 (13%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 4.

Children of PROPN nodes are attached using 12 different relations: flat (17; 38% instances), case (9; 20% instances), punct (8; 18% instances), cc (2; 4% instances), conj (2; 4% instances), acl:relcl (1; 2% instances), advcl (1; 2% instances), advmod (1; 2% instances), amod (1; 2% instances), compound (1; 2% instances), nmod (1; 2% instances), nsubj (1; 2% instances)

Children of PROPN nodes belong to 9 different parts of speech: PROPN (20; 44% instances), ADP (9; 20% instances), PUNCT (8; 18% instances), CCONJ (2; 4% instances), VERB (2; 4% instances), ADJ (1; 2% instances), ADV (1; 2% instances), NOUN (1; 2% instances), PRON (1; 2% instances)