Treebank Statistics: UD_Icelandic-Modern: POS Tags: PROPN
There are 878 PROPN
lemmas (14%), 1134 PROPN
types (11%) and 2743 PROPN
tokens (3%).
Out of 16 observed tags, the rank of PROPN
is: 2 in number of lemmas, 4 in number of types and 12 in number of tokens.
The 10 most frequent PROPN
lemmas: ísland, þingmaður, ólympíuleikar, hrafnhildur, alþingi, rúv, ríó, em, íslendingur, evrópusamband
The 10 most frequent PROPN
types: þm., Íslands, RÚV, Hrafnhildur, Ríó, EM, Ísland, Ólympíuleikunum, Alþingi, Íslandi
The 10 most frequent ambiguous lemmas: þingmaður (NOUN 299, PROPN 82, ADV 1), em (PROPN 32, X 1), evrópumót (PROPN 21, NOUN 1), london (PROPN 20, ADV 1), hm (PROPN 17, ADV 1), danmörk (PROPN 15, X 1), forseti (NOUN 409, PROPN 11), fram (ADP 223, PROPN 11, ADV 4), de (PROPN 10, ADV 1, X 1), helgi (PROPN 10, NOUN 7)
The 10 most frequent ambiguous types: þm. (PROPN 80, NOUN 1), EM (PROPN 32, X 1), London (PROPN 20, ADV 1), HM (PROPN 17, ADV 1), Forseti (NOUN 132, PROPN 11), Fram (PROPN 11, ADP 1), de (PROPN 10, ADV 1, X 1), Millar (PROPN 6, ADV 1), Djokovic (PROPN 5, ADV 1), pírata (NOUN 2, PROPN 1)
- þm.
- PROPN 80: Virðulegi forseti . Ég þakka hv. þm. Pétri H. Blöndal fyrir svarið .
- NOUN 1: Herra forseti . Ég vil spyrja hv. 4. þm. Reykv. svo , Pétur H. Blöndal , út í eitt má segja lagatæknilegt eða þinglegt atriði sem varðar atriði þessa máls og það er sú staðreynd að inn í þetta frumvarp eiga nú að bætast samkvæmt breytingartillögum meiri hlutans þrír nýir kaflar eða það á að opna upp löggjöf á þremur nýjum sviðum hér við 2. umr. málsins .
- EM
- London
- HM
- Forseti
- Fram
- PROPN 11: Fram hefur 2 stig eftir fyrstu þrjár umferðirnar en Haukar hafa 4 stig .
- ADP 1: Fram kom í formála ráðherra að þarna væru í rauninni þrjár sviðsmyndir : Það er að hægja á verkefninu , fara strax í könnunarviðræður eða halda áfram að safna þessum upplýsingum , eins og ráðgjafarhópurinn gerir grein fyrir í skýrslunni , og sú varð niðurstaðan .
- de
- Millar
- Djokovic
- pírata
- NOUN 2: Ég hef sérstaklega nefnt 8. gr. og líka 11. gr. sem eru í stefnu okkar pírata og ég sé ekki hvers vegna ekki mátti hafa þær með .
- PROPN 1: Ég sé að hér er samstaða milli hv. þingmanns Samfylkingar , hv. þingmanns Sjálfstæðisflokks Forseti hringir . og vissulega okkar pírata og því vona ég að þetta starf muni bara halda áfram og styrkjast .
Morphology
The form / lemma ratio of PROPN
is 1.291572 (the average of all parts of speech is 1.738114).
The 1st highest number of forms (7) was observed with the lemma “ólympíuleikar”: Ólympíuleika, Ólympíuleikana, Ólympíuleikanna, Ólympíuleikar, Ólympíuleikarnir, Ólympíuleikum, Ólympíuleikunum.
The 2nd highest number of forms (6) was observed with the lemma “evrópumót”: Evrópumót, Evrópumóti, Evrópumótinu, Evrópumótið, Evrópumóts, Evrópumótsins.
The 3rd highest number of forms (6) was observed with the lemma “framsóknarflokkur”: Framsóknarflokki, Framsóknarflokkinn, Framsóknarflokknum, Framsóknarflokksins, Framsóknarflokkur, Framsóknarflokkurinn.
PROPN
occurs with 5 features: Case (2036; 74% instances), Definite (2036; 74% instances), Number (2036; 74% instances), Gender (1994; 73% instances), Foreign (88; 3% instances)
PROPN
occurs with 12 feature-value pairs: Case=Acc
, Case=Dat
, Case=Gen
, Case=Nom
, Definite=Def
, Definite=Ind
, Foreign=Yes
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Number=Plur
, Number=Sing
PROPN
occurs with 50 feature combinations.
The most frequent feature combination is _
(619 tokens).
Examples: þm., RÚV, EM, H., HM, London, Collins, KSÍ, United, KR
Relations
PROPN
nodes are attached to their parents using 18 different relations: obl (694; 25% instances), nsubj (516; 19% instances), flat:name (481; 18% instances), dep (325; 12% instances), nmod:poss (313; 11% instances), conj (149; 5% instances), appos (94; 3% instances), obj (75; 3% instances), root (29; 1% instances), iobj (27; 1% instances), xcomp (17; 1% instances), advcl (6; 0% instances), acl:relcl (5; 0% instances), amod (5; 0% instances), ccomp (3; 0% instances), parataxis (2; 0% instances), acl (1; 0% instances), discourse (1; 0% instances)
Parents of PROPN
nodes belong to 12 different parts of speech: PROPN (1030; 38% instances), VERB (865; 32% instances), NOUN (668; 24% instances), ADJ (56; 2% instances), PRON (40; 1% instances), (29; 1% instances), ADV (22; 1% instances), AUX (13; 0% instances), DET (8; 0% instances), ADP (6; 0% instances), NUM (3; 0% instances), PART (3; 0% instances)
1175 (43%) PROPN
nodes are leaves.
759 (28%) PROPN
nodes have one child.
471 (17%) PROPN
nodes have two children.
338 (12%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 18.
Children of PROPN
nodes are attached using 28 different relations: case (699; 23% instances), flat:name (481; 16% instances), punct (426; 14% instances), dep (322; 11% instances), obl (272; 9% instances), conj (152; 5% instances), cc (134; 4% instances), amod (133; 4% instances), nmod:poss (68; 2% instances), acl:relcl (65; 2% instances), appos (48; 2% instances), cop (46; 2% instances), advmod (44; 1% instances), nsubj (22; 1% instances), nummod (15; 1% instances), mark (13; 0% instances), compound:prt (12; 0% instances), det (10; 0% instances), acl (7; 0% instances), xcomp (7; 0% instances), obj (4; 0% instances), nmod (3; 0% instances), parataxis (3; 0% instances), aux (2; 0% instances), expl (2; 0% instances), advcl (1; 0% instances), csubj (1; 0% instances), discourse (1; 0% instances)
Children of PROPN
nodes belong to 16 different parts of speech: PROPN (1030; 34% instances), ADP (712; 24% instances), PUNCT (426; 14% instances), NOUN (217; 7% instances), ADJ (142; 5% instances), CCONJ (134; 4% instances), NUM (92; 3% instances), VERB (67; 2% instances), AUX (53; 2% instances), ADV (48; 2% instances), PRON (32; 1% instances), DET (23; 1% instances), SCONJ (12; 0% instances), X (3; 0% instances), INTJ (1; 0% instances), PART (1; 0% instances)