Treebank Statistics: UD_Swedish-Talbanken: POS Tags: PROPN
There are 529 PROPN
lemmas (5%), 571 PROPN
types (4%) and 1360 PROPN
tokens (1%).
Out of 16 observed tags, the rank of PROPN
is: 5 in number of lemmas, 5 in number of types and 14 in number of tokens.
The 10 most frequent PROPN
lemmas: Sverige, EEC, Stockholm, Gud, USA, ATP, Göteborg, Horn, Danmark, Indien
The 10 most frequent PROPN
types: Sverige, EEC, Stockholm, USA, ATP, Sveriges, EEC:s, Gud, Göteborg, Horn
The 10 most frequent ambiguous lemmas: Björn (PROPN 6, NOUN 1), DDT (PROPN 2, NOUN 1), de (PRON 588, DET 10, PROPN 2), s (NOUN 2, PROPN 1), vi (PRON 552, PROPN 2, NOUN 1), väckarklocka (NOUN 2, PROPN 2), Children (NOUN 1, PROPN 1), I (ADP 1, PROPN 1), SKB (NOUN 1, PROPN 1), backa (PROPN 1, VERB 1)
The 10 most frequent ambiguous types: Björn (PROPN 6, NOUN 1), DDT (PROPN 2, NOUN 1), Vi (PRON 80, PROPN 2), de (DET 512, PRON 292, PROPN 2), s (NOUN 2, PROPN 1), Children (NOUN 1, PROPN 1), Handelsbanken (NOUN 1, PROPN 1), I (ADP 268, NOUN 1, NUM 1, PROPN 1), SKB (NOUN 1, PROPN 1), Ännu (ADV 5, PROPN 1)
- Björn
- DDT
- Vi
- de
- s
- Children
- Handelsbanken
- I
- ADP 268: I och med att kvinnan axlar mer krävande uppgifter får hon högre lön .
- NOUN 1: På I 12 i Eksjö tog en kompanichef för ett pansarvärnskompani saken i egna händer .
- NUM 1: De åldersgrupper , som lekcirkeln i första hand är tänkt för är - förutom babybarnen - småbarn ( 1-3 år ) , förskolebarn ( 4-6 år ) , skolbarn I ( 7-9 år ) och skolbarn II ( 10-12 år ) .
- PROPN 1: De nationalitetsbeteckningar som bör användas i förbindelse med postnummer till utlandet är följande : Belgien B Danmark DK Finland SF Frankrike F Italien I Liechtenstein FL Norge N Schweiz CH Västtyskland D Österrike A
- SKB
- Ännu
Morphology
The form / lemma ratio of PROPN
is 1.079395 (the average of all parts of speech is 1.422803).
The 1st highest number of forms (3) was observed with the lemma “FN”: FN, FN:s, FNs.
The 2nd highest number of forms (3) was observed with the lemma “Göteborg”: Göteborg, Göteborgs, Göteborgs-.
The 3rd highest number of forms (2) was observed with the lemma “Afrika”: Afrika, Afrikas.
PROPN
occurs with 1 features: Case (1336; 98% instances)
PROPN
occurs with 2 feature-value pairs: Case=Gen
, Case=Nom
PROPN
occurs with 3 feature combinations.
The most frequent feature combination is Case=Nom
(1183 tokens).
Examples: Sverige, EEC, Stockholm, USA, ATP, Gud, Göteborg, Horn, Danmark, Indien
Relations
PROPN
nodes are attached to their parents using 20 different relations: conj (271; 20% instances), nsubj (234; 17% instances), obl (215; 16% instances), nmod (186; 14% instances), flat:name (156; 11% instances), nmod:poss (134; 10% instances), appos (43; 3% instances), root (35; 3% instances), obj (34; 3% instances), obl:agent (18; 1% instances), nsubj:pass (7; 1% instances), acl (6; 0% instances), compound (6; 0% instances), parataxis (6; 0% instances), advcl (2; 0% instances), iobj (2; 0% instances), xcomp (2; 0% instances), acl:relcl (1; 0% instances), amod (1; 0% instances), csubj:pass (1; 0% instances)
Parents of PROPN
nodes belong to 9 different parts of speech: PROPN (428; 31% instances), VERB (426; 31% instances), NOUN (400; 29% instances), ADJ (49; 4% instances), (35; 3% instances), ADV (14; 1% instances), ADP (3; 0% instances), PRON (3; 0% instances), NUM (2; 0% instances)
432 (32%) PROPN
nodes are leaves.
585 (43%) PROPN
nodes have one child.
163 (12%) PROPN
nodes have two children.
180 (13%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 33.
Children of PROPN
nodes are attached using 25 different relations: case (405; 24% instances), punct (267; 16% instances), conj (266; 16% instances), nmod (203; 12% instances), flat:name (157; 9% instances), cc (132; 8% instances), advmod (36; 2% instances), amod (36; 2% instances), obl (27; 2% instances), nummod (24; 1% instances), appos (22; 1% instances), acl:relcl (19; 1% instances), mark (13; 1% instances), det (9; 1% instances), parataxis (9; 1% instances), cop (7; 0% instances), orphan (5; 0% instances), acl:cleft (4; 0% instances), expl (4; 0% instances), acl (3; 0% instances), nsubj (3; 0% instances), xcomp (3; 0% instances), nmod:poss (2; 0% instances), advcl (1; 0% instances), dislocated (1; 0% instances)
Children of PROPN
nodes belong to 15 different parts of speech: PROPN (428; 26% instances), ADP (409; 25% instances), PUNCT (267; 16% instances), NOUN (233; 14% instances), CCONJ (118; 7% instances), ADV (47; 3% instances), NUM (45; 3% instances), ADJ (42; 3% instances), VERB (22; 1% instances), SCONJ (12; 1% instances), SYM (10; 1% instances), DET (9; 1% instances), PRON (8; 0% instances), AUX (7; 0% instances), PART (1; 0% instances)