Statistics of PROPN in UD_Norwegian-Bokmaal

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Norwegian-Bokmaal: POS Tags: `PROPN`

There are 4640 PROPN lemmas (19%), 5008 PROPN types (15%) and 18260 PROPN tokens (6%). Out of 17 observed tags, the rank of PROPN is: 2 in number of lemmas, 2 in number of types and 7 in number of tokens.

The 10 most frequent PROPN lemmas: Norge, Regjeringen, Obama, USA, Oslo, Jan, Cathrine, Stortinget, Svalbard, Den

The 10 most frequent PROPN types: Norge, Obama, Regjeringen, Jan, Oslo, USA, Den, Svalbard, Mayen, Stortinget

The 10 most frequent ambiguous lemmas: Oslo (PROPN 153, X 2), Den (PROPN 116, DET 1, X 1), The (PROPN 53, X 1), Fashanu (PROPN 44, X 2), Det (PROPN 27, X 3), Annan (PROPN 25, X 1), norsk (ADJ 498, NOUN 18, PROPN 1), de (PRON 1636, DET 1349, PROPN 11, X 6, ADV 1), Haram (PROPN 16, X 1), ©NTB (PROPN 14, X 1)

The 10 most frequent ambiguous types: Regjeringen (PROPN 168, NOUN 38), Oslo (PROPN 147, X 2), Den (DET 222, PROPN 116, PRON 82, X 1), The (PROPN 53, X 1), Regjeringens (PROPN 44, NOUN 5), Fashanu (PROPN 36, X 2), Arbeiderpartiet (PROPN 34, NOUN 1), Mitt (PROPN 28, PRON 9), Det (PRON 1652, DET 159, PROPN 27, X 3), Annan (PROPN 25, X 1)

Regjeringen
- PROPN 168: Regjeringen kan også smuldre opp innenfra .
- NOUN 38: Regjeringen har fastslått at røyking er farlig .
Oslo
- PROPN 147: Det var til og med langt fra Oslo .
- X 2: I love Oslo .
Den
- DET 222: Den albanske ambassador Aleksander Sallabanda fulgte opp
- PROPN 116: Og hun var medlem av Den Norske Nobelkomite i tolv år .
- PRON 82: Den som hun tok av seg da vi kom hjem .
- X 1: « Den underbara doften av sportaffär och syntet . »
The
- PROPN 53: Låten heter « The spirit carries on » .
- X 1: « The uniform says a lot about you , it can say that you’ve made it , or that you’re on your way . But where you end up is entirely up to you » .
Regjeringens
- PROPN 44: Bjarne Håkon Hanssen er en av Regjeringens største politiske begavelser .
- NOUN 5: Regjeringens skolereform Kunnskapsløftet skal gjennomføres fra 2006 .
Fashanu
- PROPN 36: På nettet lever Justin Fashanu videre .
- X 2: Ipswich-fansen , rivalene til Fashanus gamle klubb Norwich , synger den lite hyggelige supportersangen « He’s gay , he’s dead , he’s hanging in the shed , Fashanu , Fashanu » .
Arbeiderpartiet
- PROPN 34: Arbeiderpartiet har gått fra 200000 medlemmer etter krigen til 51000 i dag .
- NOUN 1: Arbeiderpartiet har vært på offensiven på de fleste målinger etter terrorangrepet 22. juli .
Mitt
- PROPN 28: Barack Obama leder med drøyt 50.000 flere enn Mitt Romney .
- PRON 9: - Mitt utgangspunkt er at vi skal jobbe med sikte på å bli bedre .
Det
- PRON 1652: Det ble rene ord :
- DET 159: Det beste , de viktigste lærdommene , lærer vi ikke noe av .
- PROPN 27: Det store vi er i dag lite kjent utenfor pressens vegger .
- X 3: « Det breinn !
Annan
- PROPN 25: Helt siden han tiltrådte , har Kofi Annan talt de fattiges sak .
- X 1: - President Annan of the world , erklærte Jean til klampetrapp .

Morphology

The form / lemma ratio of PROPN is 1.079310 (the average of all parts of speech is 1.381699).

The 1st highest number of forms (3) was observed with the lemma “Demokratene”: Demokratene, demokratenes, demokretane.

The 2nd highest number of forms (3) was observed with the lemma “EU”: EU, EU’s, EUs.

The 3rd highest number of forms (3) was observed with the lemma “FN”: FN, FN’s, FNs.

PROPN occurs with 3 features: Gender (2689; 15% instances), Case (1214; 7% instances), Abbr (653; 4% instances)

PROPN occurs with 5 feature-value pairs: Abbr=Yes, Case=Gen, Gender=Fem, Gender=Masc, Gender=Neut

PROPN occurs with 10 feature combinations. The most frequent feature combination is _ (13916 tokens). Examples: Norge, Obama, Regjeringen, Oslo, Den, Svalbard, Mayen, Cathrine, Bertelsen, Bergen

Relations

PROPN nodes are attached to their parents using 16 different relations: flat:name (4828; 26% instances), nsubj (4161; 23% instances), nmod (3894; 21% instances), obl (2096; 11% instances), conj (1069; 6% instances), appos (943; 5% instances), obj (488; 3% instances), root (419; 2% instances), parataxis (117; 1% instances), compound (110; 1% instances), xcomp (58; 0% instances), iobj (47; 0% instances), ccomp (14; 0% instances), dislocated (10; 0% instances), flat (5; 0% instances), csubj (1; 0% instances)

Parents of PROPN nodes belong to 13 different parts of speech: NOUN (6035; 33% instances), VERB (5969; 33% instances), PROPN (5119; 28% instances), ADJ (429; 2% instances), (419; 2% instances), PRON (75; 0% instances), ADV (62; 0% instances), DET (54; 0% instances), NUM (48; 0% instances), ADP (33; 0% instances), INTJ (8; 0% instances), X (5; 0% instances), SYM (4; 0% instances)

9193 (50%) PROPN nodes are leaves.

4917 (27%) PROPN nodes have one child.

2349 (13%) PROPN nodes have two children.

1801 (10%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 18.

Children of PROPN nodes are attached using 26 different relations: case (4914; 30% instances), flat:name (4700; 28% instances), punct (2515; 15% instances), conj (1181; 7% instances), nmod (801; 5% instances), cc (795; 5% instances), acl:relcl (272; 2% instances), advmod (250; 2% instances), amod (244; 1% instances), appos (229; 1% instances), det (165; 1% instances), cop (144; 1% instances), obl (128; 1% instances), nsubj (86; 1% instances), expl (51; 0% instances), xcomp (34; 0% instances), advcl (24; 0% instances), mark (20; 0% instances), nummod (20; 0% instances), parataxis (20; 0% instances), aux (9; 0% instances), compound (4; 0% instances), acl (3; 0% instances), csubj (2; 0% instances), discourse (1; 0% instances), reparandum (1; 0% instances)

Children of PROPN nodes belong to 17 different parts of speech: PROPN (5119; 31% instances), ADP (5003; 30% instances), PUNCT (2515; 15% instances), NOUN (1220; 7% instances), CCONJ (819; 5% instances), ADJ (485; 3% instances), VERB (331; 2% instances), NUM (228; 1% instances), DET (199; 1% instances), ADV (194; 1% instances), X (156; 1% instances), AUX (153; 1% instances), PRON (125; 1% instances), SYM (27; 0% instances), PART (20; 0% instances), SCONJ (18; 0% instances), INTJ (1; 0% instances)

Treebank Statistics: UD_Norwegian-Bokmaal: POS Tags: PROPN

Morphology

Relations

Treebank Statistics: UD_Norwegian-Bokmaal: POS Tags: `PROPN`