home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Norwegian-Bokmaal: POS Tags: VERB

There are 2058 VERB lemmas (8%), 4456 VERB types (13%) and 33351 VERB tokens (11%). Out of 17 observed tags, the rank of VERB is: 4 in number of lemmas, 4 in number of types and 4 in number of tokens.

The 10 most frequent VERB lemmas: ha, si, bli, få, komme, ta, være, gjøre, gå, se

The 10 most frequent VERB types: har, sier, er, blir, kommer, går, ha, få, bli, ta

The 10 most frequent ambiguous lemmas: ha (AUX 2853, VERB 1796, X 2), si (VERB 1387, ADJ 3), bli (AUX 1106, VERB 1042), (VERB 949, AUX 250, ADJ 98), komme (VERB 782, ADJ 9), ta (VERB 782, ADJ 2, X 1), være (AUX 8101, VERB 766, ADJ 1), gjøre (VERB 763, ADJ 3), (VERB 735, ADJ 3, X 1), se (VERB 669, ADJ 7)

The 10 most frequent ambiguous types: har (AUX 2219, VERB 1104, X 1), er (AUX 5494, VERB 496, X 4, DET 2), blir (VERB 342, AUX 226, X 2), går (VERB 296, NOUN 56), ha (VERB 283, AUX 218, X 2), (VERB 292, AUX 80, ADJ 69), bli (VERB 280, AUX 154), ta (VERB 271, X 1), ble (AUX 578, VERB 264), får (VERB 254, AUX 86)

Morphology

The form / lemma ratio of VERB is 2.165209 (the average of all parts of speech is 1.381699).

The 1st highest number of forms (7) was observed with the lemma “bygge”: bygd, bygde, byge, bygge, bygger, bygges, bygget.

The 2nd highest number of forms (7) was observed with the lemma “fortelle”: Fortell, Fotelle, fortalt, fortalte, fortelle, forteller, fortelles.

The 3rd highest number of forms (7) was observed with the lemma “kalle”: Kall, kalle, kaller, kalles, kallet, kalt, kalte.

VERB occurs with 8 features: VerbForm (33351; 100% instances), Mood (19379; 58% instances), Tense (19149; 57% instances), Voice (1147; 3% instances), Abbr (19; 0% instances), Definite (1; 0% instances), Gender (1; 0% instances), Number (1; 0% instances)

VERB occurs with 13 feature-value pairs: Abbr=Yes, Definite=Ind, Gender=Fem,Masc, Mood=Imp, Mood=Ind, Number=Sing, Tense=Past, Tense=Pres, VerbForm=Fin, VerbForm=Fin,Part, VerbForm=Inf, VerbForm=Part, Voice=Pass

VERB occurs with 11 feature combinations. The most frequent feature combination is Mood=Ind|Tense=Pres|VerbForm=Fin (13057 tokens). Examples: har, sier, er, blir, kommer, går, mener, får, ser, gjør

Relations

VERB nodes are attached to their parents using 16 different relations: root (14167; 42% instances), advcl (4831; 14% instances), conj (4161; 12% instances), acl:relcl (4069; 12% instances), xcomp (1957; 6% instances), ccomp (1913; 6% instances), acl (1201; 4% instances), csubj (937; 3% instances), parataxis (42; 0% instances), dislocated (34; 0% instances), flat:name (14; 0% instances), reparandum (11; 0% instances), compound (10; 0% instances), csubj:outer (2; 0% instances), iobj (1; 0% instances), nmod (1; 0% instances)

Parents of VERB nodes belong to 14 different parts of speech: (14167; 42% instances), VERB (11002; 33% instances), NOUN (4590; 14% instances), ADJ (1831; 5% instances), PRON (867; 3% instances), ADV (364; 1% instances), PROPN (331; 1% instances), DET (82; 0% instances), ADP (66; 0% instances), NUM (41; 0% instances), INTJ (5; 0% instances), PART (2; 0% instances), X (2; 0% instances), AUX (1; 0% instances)

79 (0%) VERB nodes are leaves.

1103 (3%) VERB nodes have one child.

3595 (11%) VERB nodes have two children.

28574 (86%) VERB nodes have three or more children.

The highest child degree of a VERB node is 12.

Children of VERB nodes are attached using 29 different relations: nsubj (22649; 17% instances), punct (21275; 16% instances), obl (15572; 12% instances), obj (13533; 10% instances), advmod (12576; 9% instances), mark (12315; 9% instances), aux (7270; 5% instances), xcomp (4619; 3% instances), cc (4185; 3% instances), conj (4019; 3% instances), advcl (3670; 3% instances), case (3538; 3% instances), ccomp (2933; 2% instances), expl (1879; 1% instances), cop (1241; 1% instances), aux:pass (1106; 1% instances), iobj (596; 0% instances), csubj (260; 0% instances), nsubj:outer (185; 0% instances), dislocated (146; 0% instances), parataxis (130; 0% instances), discourse (76; 0% instances), reparandum (35; 0% instances), nummod (15; 0% instances), flat (9; 0% instances), csubj:outer (2; 0% instances), det (2; 0% instances), compound (1; 0% instances), nmod (1; 0% instances)

Children of VERB nodes belong to 17 different parts of speech: NOUN (32935; 25% instances), PUNCT (21275; 16% instances), PRON (15989; 12% instances), VERB (11002; 8% instances), AUX (9620; 7% instances), SCONJ (8210; 6% instances), ADV (7797; 6% instances), ADJ (6168; 5% instances), PROPN (5969; 4% instances), PART (5956; 4% instances), CCONJ (4186; 3% instances), ADP (3643; 3% instances), NUM (573; 0% instances), DET (404; 0% instances), INTJ (80; 0% instances), X (29; 0% instances), SYM (2; 0% instances)