Treebank Statistics: UD_Indonesian-PUD: POS Tags: PROPN
There are 1282 PROPN
lemmas (30%), 1282 PROPN
types (26%) and 2113 PROPN
tokens (11%).
Out of 17 observed tags, the rank of PROPN
is: 2 in number of lemmas, 2 in number of types and 4 in number of tokens.
The 10 most frequent PROPN
lemmas: Amerika, Inggris, Eropa, of, Tiongkok, the, Prancis, Spanyol, Yunani, Oktober
The 10 most frequent PROPN
types: Amerika, Inggris, Eropa, of, Tiongkok, the, Prancis, Spanyol, Yunani, Oktober
The 10 most frequent ambiguous lemmas: de (PROPN 5, X 1), pulau (NOUN 11, PROPN 1), panglima (NOUN 1, PROPN 1)
The 10 most frequent ambiguous types: Perjanjian (PROPN 8, NOUN 2), Selatan (PROPN 8, NOUN 3), Utara (PROPN 7, NOUN 1), de (PROPN 5, X 1), Kaisar (NOUN 5, PROPN 5), Perang (NOUN 6, PROPN 5), III (PROPN 4, NUM 1), Barat (PROPN 3, NOUN 1), Dunia (NOUN 4, PROPN 3), Kota (PROPN 3, NOUN 2)
- Perjanjian
- Selatan
- Utara
- PROPN 7: Aljazair Utara berada dalam zona suhu dan memiliki iklim sedang Mediterania .
- NOUN 1: Mulai abad ke-16 , orang Eropa yang mengunjungi wilayah Karibia menyebut nya “ Laut Selatan ” ( Samudra Pasifik , di sebelah selatan tanah genting Panama ) , bukan “ Laut Utara ” ( Laut Karibia , di sebelah utara tanah genting yang sama ) .
- de
- Kaisar
- Perang
- III
- Barat
- Dunia
- Kota
Morphology
The form / lemma ratio of PROPN
is 1.000000 (the average of all parts of speech is 1.137428).
The 1st highest number of forms (1) was observed with the lemma “’s”: ’s.
The 2nd highest number of forms (1) was observed with the lemma “A”: A.
The 3rd highest number of forms (1) was observed with the lemma “A.S”: A.S.
PROPN
occurs with 1 features: Abbr (76; 4% instances)
PROPN
occurs with 1 feature-value pairs: Abbr=Yes
PROPN
occurs with 2 feature combinations.
The most frequent feature combination is _
(2037 tokens).
Examples: Amerika, Inggris, Eropa, of, Tiongkok, the, Prancis, Spanyol, Yunani, Oktober
Relations
PROPN
nodes are attached to their parents using 17 different relations: nmod (624; 30% instances), flat:name (607; 29% instances), nsubj (262; 12% instances), obl (159; 8% instances), conj (97; 5% instances), appos (77; 4% instances), obj (77; 4% instances), nsubj:pass (47; 2% instances), nmod:tmod (45; 2% instances), nmod:poss (39; 2% instances), flat (31; 1% instances), obl:agent (29; 1% instances), root (10; 0% instances), obl:tmod (4; 0% instances), acl:relcl (3; 0% instances), amod (1; 0% instances), orphan (1; 0% instances)
Parents of PROPN
nodes belong to 9 different parts of speech: PROPN (765; 36% instances), NOUN (734; 35% instances), VERB (547; 26% instances), NUM (26; 1% instances), ADJ (15; 1% instances), (10; 0% instances), X (10; 0% instances), PRON (5; 0% instances), SYM (1; 0% instances)
1209 (57%) PROPN
nodes are leaves.
477 (23%) PROPN
nodes have one child.
238 (11%) PROPN
nodes have two children.
189 (9%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 8.
Children of PROPN
nodes are attached using 24 different relations: flat:name (607; 37% instances), case (365; 22% instances), punct (218; 13% instances), conj (102; 6% instances), nmod (80; 5% instances), cc (78; 5% instances), acl:relcl (39; 2% instances), appos (36; 2% instances), nmod:lmod (21; 1% instances), flat (19; 1% instances), amod (14; 1% instances), nmod:tmod (13; 1% instances), nsubj (13; 1% instances), cop (10; 1% instances), acl (8; 0% instances), det (7; 0% instances), advmod (6; 0% instances), nummod (5; 0% instances), nmod:poss (3; 0% instances), advcl (2; 0% instances), obl (2; 0% instances), cc:preconj (1; 0% instances), mark (1; 0% instances), parataxis (1; 0% instances)
Children of PROPN
nodes belong to 15 different parts of speech: PROPN (765; 46% instances), ADP (365; 22% instances), PUNCT (218; 13% instances), NOUN (105; 6% instances), CCONJ (79; 5% instances), VERB (45; 3% instances), NUM (25; 2% instances), ADJ (15; 1% instances), AUX (10; 1% instances), DET (8; 0% instances), PRON (7; 0% instances), ADV (5; 0% instances), X (2; 0% instances), PART (1; 0% instances), SCONJ (1; 0% instances)