home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Icelandic-Modern: POS Tags: PROPN

There are 878 PROPN lemmas (14%), 1134 PROPN types (11%) and 2743 PROPN tokens (3%). Out of 16 observed tags, the rank of PROPN is: 2 in number of lemmas, 4 in number of types and 12 in number of tokens.

The 10 most frequent PROPN lemmas: ísland, þingmaður, ólympíuleikar, hrafnhildur, alþingi, rúv, ríó, em, íslendingur, evrópusamband

The 10 most frequent PROPN types: þm., Íslands, RÚV, Hrafnhildur, Ríó, EM, Ísland, Ólympíuleikunum, Alþingi, Íslandi

The 10 most frequent ambiguous lemmas: þingmaður (NOUN 299, PROPN 82, ADV 1), em (PROPN 32, X 1), evrópumót (PROPN 21, NOUN 1), london (PROPN 20, ADV 1), hm (PROPN 17, ADV 1), danmörk (PROPN 15, X 1), forseti (NOUN 409, PROPN 11), fram (ADP 223, PROPN 11, ADV 4), de (PROPN 10, ADV 1, X 1), helgi (PROPN 10, NOUN 7)

The 10 most frequent ambiguous types: þm. (PROPN 80, NOUN 1), EM (PROPN 32, X 1), London (PROPN 20, ADV 1), HM (PROPN 17, ADV 1), Forseti (NOUN 132, PROPN 11), Fram (PROPN 11, ADP 1), de (PROPN 10, ADV 1, X 1), Millar (PROPN 6, ADV 1), Djokovic (PROPN 5, ADV 1), pírata (NOUN 2, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.291572 (the average of all parts of speech is 1.738114).

The 1st highest number of forms (7) was observed with the lemma “ólympíuleikar”: Ólympíuleika, Ólympíuleikana, Ólympíuleikanna, Ólympíuleikar, Ólympíuleikarnir, Ólympíuleikum, Ólympíuleikunum.

The 2nd highest number of forms (6) was observed with the lemma “evrópumót”: Evrópumót, Evrópumóti, Evrópumótinu, Evrópumótið, Evrópumóts, Evrópumótsins.

The 3rd highest number of forms (6) was observed with the lemma “framsóknarflokkur”: Framsóknarflokki, Framsóknarflokkinn, Framsóknarflokknum, Framsóknarflokksins, Framsóknarflokkur, Framsóknarflokkurinn.

PROPN occurs with 5 features: Case (2036; 74% instances), Definite (2036; 74% instances), Number (2036; 74% instances), Gender (1994; 73% instances), Foreign (88; 3% instances)

PROPN occurs with 12 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Nom, Definite=Def, Definite=Ind, Foreign=Yes, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing

PROPN occurs with 50 feature combinations. The most frequent feature combination is _ (619 tokens). Examples: þm., RÚV, EM, H., HM, London, Collins, KSÍ, United, KR

Relations

PROPN nodes are attached to their parents using 18 different relations: obl (694; 25% instances), nsubj (516; 19% instances), flat:name (481; 18% instances), dep (325; 12% instances), nmod:poss (313; 11% instances), conj (149; 5% instances), appos (94; 3% instances), obj (75; 3% instances), root (29; 1% instances), iobj (27; 1% instances), xcomp (17; 1% instances), advcl (6; 0% instances), acl:relcl (5; 0% instances), amod (5; 0% instances), ccomp (3; 0% instances), parataxis (2; 0% instances), acl (1; 0% instances), discourse (1; 0% instances)

Parents of PROPN nodes belong to 12 different parts of speech: PROPN (1030; 38% instances), VERB (865; 32% instances), NOUN (668; 24% instances), ADJ (56; 2% instances), PRON (40; 1% instances), (29; 1% instances), ADV (22; 1% instances), AUX (13; 0% instances), DET (8; 0% instances), ADP (6; 0% instances), NUM (3; 0% instances), PART (3; 0% instances)

1175 (43%) PROPN nodes are leaves.

759 (28%) PROPN nodes have one child.

471 (17%) PROPN nodes have two children.

338 (12%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 18.

Children of PROPN nodes are attached using 28 different relations: case (699; 23% instances), flat:name (481; 16% instances), punct (426; 14% instances), dep (322; 11% instances), obl (272; 9% instances), conj (152; 5% instances), cc (134; 4% instances), amod (133; 4% instances), nmod:poss (68; 2% instances), acl:relcl (65; 2% instances), appos (48; 2% instances), cop (46; 2% instances), advmod (44; 1% instances), nsubj (22; 1% instances), nummod (15; 1% instances), mark (13; 0% instances), compound:prt (12; 0% instances), det (10; 0% instances), acl (7; 0% instances), xcomp (7; 0% instances), obj (4; 0% instances), nmod (3; 0% instances), parataxis (3; 0% instances), aux (2; 0% instances), expl (2; 0% instances), advcl (1; 0% instances), csubj (1; 0% instances), discourse (1; 0% instances)

Children of PROPN nodes belong to 16 different parts of speech: PROPN (1030; 34% instances), ADP (712; 24% instances), PUNCT (426; 14% instances), NOUN (217; 7% instances), ADJ (142; 5% instances), CCONJ (134; 4% instances), NUM (92; 3% instances), VERB (67; 2% instances), AUX (53; 2% instances), ADV (48; 2% instances), PRON (32; 1% instances), DET (23; 1% instances), SCONJ (12; 0% instances), X (3; 0% instances), INTJ (1; 0% instances), PART (1; 0% instances)