Treebank Statistics: UD_Indonesian-GSD: POS Tags: PROPN
There are 10349 PROPN
lemmas (52%), 10472 PROPN
types (47%) and 22418 PROPN
tokens (18%).
Out of 17 observed tags, the rank of PROPN
is: 1 in number of lemmas, 1 in number of types and 2 in number of tokens.
The 10 most frequent PROPN
lemmas: indonesia, kabupaten, kecamatan, jawa, provinsi, amerika, timur, barat, the, jepang
The 10 most frequent PROPN
types: indonesia, kabupaten, kecamatan, Jawa, provinsi, Amerika, Timur, Barat, the, jepang
The 10 most frequent ambiguous lemmas: indonesia (PROPN 437, NOUN 5), kabupaten (PROPN 225, NOUN 41), jawa (PROPN 155, NOUN 2), provinsi (PROPN 135, NOUN 20), timur (PROPN 97, NOUN 29), barat (PROPN 94, NOUN 42), tengah (PROPN 87, NOUN 34, ADJ 6, ADP 2), kota (NOUN 206, PROPN 83), utara (PROPN 79, NOUN 43), selatan (PROPN 71, NOUN 40, ADJ 1)
The 10 most frequent ambiguous types: kabupaten (NOUN 33, PROPN 7), kecamatan (PROPN 116, NOUN 90), Jawa (PROPN 155, NOUN 1), provinsi (PROPN 53, NOUN 17), Timur (PROPN 97, NOUN 1), Barat (PROPN 94, NOUN 4), tengah (NOUN 13, ADJ 4, ADP 2, PROPN 1), kota (NOUN 162, PROPN 14), utara (NOUN 41, PROPN 4), Selatan (PROPN 71, ADJ 1)
- kabupaten
- kecamatan
- Jawa
- provinsi
- Timur
- PROPN 97: Maumere adalah ibu kota Kabupaten Sikka , Nusa Tenggara Timur .
- NOUN 1: Kabupaten Bener Meriah yang beribukota di Simpang Tiga Redelong memiliki luas 1.919,69 km2 terdiri dari 10 Kecamatan dan 233 desa , Bener Meriah terletak 4 ° 33 50 - 4 ° 54 50 Lintang Utara dan 96 ° 40 75 - 97 ° 17 50 Bujur Timur dengan tinggi rata-rata di atas permukaan laut 100 - 2.500 mdpl Kabupaten Bener Meriah mencakup bagian utara Kabupaten Aceh Tengah yang berbatasan dengan Komoditi unggulan Kabupaten Bener Meriah yaitu sektor Perkebunan dan jasa .
- Barat
- tengah
- NOUN 13: Provinsi ini terletak di bagian tengah di negara itu .
- ADJ 4: David Vernon Watson ( ) , adalah mantan pemain sepak bola asal Inggris yang berposisi sebagai bek tengah .
- ADP 2: Di sini letak desa nya berada di bagian tengah Sumbawa .
- PROPN 1: Di desa Candinata Kecamatan Kutasari Kabupaten Purbalingga Jawa tengah sampai dengan tahun 2011 terdapat terdapat Tempat Pendidikan formal dan Nonformal .
- kota
- utara
- Selatan
- PROPN 71: Luas wilayah nya terbesar ketiga di Korea Selatan .
- ADJ 1: Begitu pula pada Peristiwa Permesta tahun 1957 sampai dengan tahun 1959 ditempat ini pula terjadi nya pertempuran pertama antara Tentara Pusat atau TNI yang dating dari arah Selatan dan dibantu oleh masyarakat setempat yang dikomandoi oleh Kapten Daan Olii berhadapan dengan tentara Permesta dikomandoi oleh Kapten Mawikere dan Kapten Pantouw yang dating dari arah Kotamobagu dengan dibatasi oleh Sungai Ongkag yang membelah Desa Mopait sebagai pusat pertahanan masing-masing pihak . ( Kapten Daan Olii tercatat pernah menjabat Bupati di Bolaang Mongondow ) .
Morphology
The form / lemma ratio of PROPN
is 1.011885 (the average of all parts of speech is 1.120343).
The 1st highest number of forms (20) was observed with the lemma “_”: ’, ‘’, Aphrodite’s, Bird’s, He’s, Heaven’s, It’s, Ja’s, Jane’s, Levy’s, Lloyd’s, Moody’s, Pepper’s, Punk’s, Robert’s, Rubik’s, She’s, TV’s, Valve’s, Woman’s.
The 2nd highest number of forms (3) was observed with the lemma “dr”: DR, Dr, Dr..
The 3rd highest number of forms (2) was observed with the lemma “a”: Amu, a.
PROPN
occurs with 2 features: Abbr (4; 0% instances), Typo (1; 0% instances)
PROPN
occurs with 2 feature-value pairs: Abbr=Yes
, Typo=Yes
PROPN
occurs with 3 feature combinations.
The most frequent feature combination is _
(22413 tokens).
Examples: indonesia, kabupaten, kecamatan, Jawa, provinsi, Amerika, Timur, Barat, the, jepang
Relations
PROPN
nodes are attached to their parents using 27 different relations: flat:name (8080; 36% instances), nmod (4933; 22% instances), nsubj (2390; 11% instances), appos (1820; 8% instances), conj (1516; 7% instances), obl (1514; 7% instances), obj (775; 3% instances), nsubj:pass (478; 2% instances), amod (375; 2% instances), root (194; 1% instances), flat (179; 1% instances), advcl (27; 0% instances), acl (25; 0% instances), dep (25; 0% instances), obl:tmod (21; 0% instances), parataxis (19; 0% instances), ccomp (8; 0% instances), iobj (8; 0% instances), vocative (8; 0% instances), list (7; 0% instances), compound (4; 0% instances), obl:agent (4; 0% instances), discourse (2; 0% instances), nmod:tmod (2; 0% instances), xcomp (2; 0% instances), acl:relcl (1; 0% instances), csubj (1; 0% instances)
Parents of PROPN
nodes belong to 14 different parts of speech: PROPN (11504; 51% instances), NOUN (5807; 26% instances), VERB (4585; 20% instances), (194; 1% instances), NUM (127; 1% instances), ADJ (115; 1% instances), PRON (33; 0% instances), ADV (19; 0% instances), X (12; 0% instances), SYM (9; 0% instances), ADP (6; 0% instances), CCONJ (4; 0% instances), PART (2; 0% instances), AUX (1; 0% instances)
9280 (41%) PROPN
nodes are leaves.
7103 (32%) PROPN
nodes have one child.
3409 (15%) PROPN
nodes have two children.
2626 (12%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 28.
Children of PROPN
nodes are attached using 36 different relations: flat:name (8072; 34% instances), punct (4795; 20% instances), case (2872; 12% instances), conj (1565; 7% instances), appos (1539; 6% instances), nummod (974; 4% instances), cc (767; 3% instances), nmod (711; 3% instances), amod (526; 2% instances), compound (373; 2% instances), acl:relcl (317; 1% instances), det (284; 1% instances), nsubj (167; 1% instances), cop (139; 1% instances), advmod (96; 0% instances), flat (83; 0% instances), acl (81; 0% instances), dep (50; 0% instances), nmod:poss (42; 0% instances), advcl (37; 0% instances), mark (30; 0% instances), nmod:tmod (25; 0% instances), obl (25; 0% instances), advmod:emph (24; 0% instances), parataxis (23; 0% instances), obj (20; 0% instances), xcomp (12; 0% instances), nmod:lmod (9; 0% instances), ccomp (7; 0% instances), list (6; 0% instances), csubj (2; 0% instances), aux (1; 0% instances), cc:preconj (1; 0% instances), discourse (1; 0% instances), goeswith (1; 0% instances), nsubj:pass (1; 0% instances)
Children of PROPN
nodes belong to 17 different parts of speech: PROPN (11504; 49% instances), PUNCT (4795; 20% instances), ADP (2875; 12% instances), NOUN (1189; 5% instances), NUM (1046; 4% instances), CCONJ (769; 3% instances), VERB (515; 2% instances), DET (282; 1% instances), ADJ (241; 1% instances), AUX (140; 1% instances), ADV (94; 0% instances), PRON (89; 0% instances), SYM (47; 0% instances), PART (40; 0% instances), SCONJ (29; 0% instances), X (22; 0% instances), INTJ (1; 0% instances)