Treebank Statistics: UD_Swedish-Talbanken: POS Tags: NUM
There are 483 NUM
lemmas (4%), 485 NUM
types (3%) and 1740 NUM
tokens (2%).
Out of 17 observed tags, the rank of NUM
is: 6 in number of lemmas, 6 in number of types and 13 in number of tokens.
The 10 most frequent NUM
lemmas: två, en, tre, 1, 20, 2, 1970, 3, 10, 1971
The 10 most frequent NUM
types: två, tre, 1, en, 20, 2, ett, 1970, 3, 10
The 10 most frequent ambiguous lemmas: två (NUM 110, PRON 1), en (DET 2090, NUM 91, PRON 24), sex (NUM 16, NOUN 1), tio (NUM 16, ADJ 1), i (ADP 2513, NUM 1)
The 10 most frequent ambiguous types: en (DET 1287, NUM 55, PRON 43), ett (DET 599, NUM 31, PRON 4), sex (NUM 15, NOUN 1), I (ADP 268, NOUN 1, NUM 1, PROPN 1)
- en
- ett
- sex
- I
- ADP 268: I och med att kvinnan axlar mer krävande uppgifter får hon högre lön .
- NOUN 1: På I 12 i Eksjö tog en kompanichef för ett pansarvärnskompani saken i egna händer .
- NUM 1: De åldersgrupper , som lekcirkeln i första hand är tänkt för är - förutom babybarnen - småbarn ( 1-3 år ) , förskolebarn ( 4-6 år ) , skolbarn I ( 7-9 år ) och skolbarn II ( 10-12 år ) .
- PROPN 1: De nationalitetsbeteckningar som bör användas i förbindelse med postnummer till utlandet är följande : Belgien B Danmark DK Finland SF Frankrike F Italien I Liechtenstein FL Norge N Schweiz CH Västtyskland D Österrike A
Morphology
The form / lemma ratio of NUM
is 1.004141 (the average of all parts of speech is 1.430604).
The 1st highest number of forms (2) was observed with the lemma “1”: 1, 1).
The 2nd highest number of forms (2) was observed with the lemma “en”: en, ett.
The 3rd highest number of forms (2) was observed with the lemma “tre”: tre, tre-.
NUM
occurs with 5 features: NumType (1740; 100% instances), Case (1732; 100% instances), Definite (91; 5% instances), Gender (91; 5% instances), Number (91; 5% instances)
NUM
occurs with 6 feature-value pairs: Case=Nom
, Definite=Ind
, Gender=Com
, Gender=Neut
, NumType=Card
, Number=Sing
NUM
occurs with 4 feature combinations.
The most frequent feature combination is Case=Nom|NumType=Card
(1641 tokens).
Examples: två, tre, 1, 20, 2, 1970, 3, 10, 1971, 7
Relations
NUM
nodes are attached to their parents using 17 different relations: nummod (1339; 77% instances), obl (163; 9% instances), nmod (95; 5% instances), conj (37; 2% instances), nsubj (21; 1% instances), appos (20; 1% instances), obj (19; 1% instances), root (17; 1% instances), orphan (11; 1% instances), list (4; 0% instances), advcl (3; 0% instances), nsubj:pass (3; 0% instances), xcomp (3; 0% instances), acl (2; 0% instances), ccomp (1; 0% instances), dislocated (1; 0% instances), parataxis (1; 0% instances)
Parents of NUM
nodes belong to 10 different parts of speech: NOUN (1271; 73% instances), VERB (229; 13% instances), NUM (111; 6% instances), ADJ (51; 3% instances), PROPN (45; 3% instances), (17; 1% instances), ADV (9; 1% instances), DET (3; 0% instances), PRON (3; 0% instances), ADP (1; 0% instances)
1069 (61%) NUM
nodes are leaves.
445 (26%) NUM
nodes have one child.
137 (8%) NUM
nodes have two children.
89 (5%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 8.
Children of NUM
nodes are attached using 22 different relations: advmod (268; 26% instances), nmod (229; 22% instances), case (188; 18% instances), punct (83; 8% instances), conj (64; 6% instances), nummod (54; 5% instances), cc (52; 5% instances), obl (39; 4% instances), det (15; 1% instances), nsubj (12; 1% instances), cop (11; 1% instances), appos (8; 1% instances), acl:relcl (5; 0% instances), mark (5; 0% instances), amod (4; 0% instances), nmod:poss (3; 0% instances), acl (2; 0% instances), advcl (2; 0% instances), acl:cleft (1; 0% instances), expl (1; 0% instances), obj (1; 0% instances), parataxis (1; 0% instances)
Children of NUM
nodes belong to 15 different parts of speech: ADV (289; 28% instances), NOUN (268; 26% instances), ADP (167; 16% instances), NUM (111; 11% instances), PUNCT (83; 8% instances), CCONJ (43; 4% instances), ADJ (20; 2% instances), SYM (19; 2% instances), DET (15; 1% instances), AUX (11; 1% instances), VERB (10; 1% instances), PRON (5; 0% instances), PART (3; 0% instances), PROPN (2; 0% instances), SCONJ (2; 0% instances)