home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Icelandic-GC: POS Tags: ADV

There are 957 ADV lemmas (6%), 989 ADV types (5%) and 10299 ADV tokens (10%). Out of 17 observed tags, the rank of ADV is: 5 in number of lemmas, 5 in number of types and 4 in number of tokens.

The 10 most frequent ADV lemmas: ekki, þar, í, þá, á, fram, dagur, svo, upp, til

The 10 most frequent ADV types: ekki, þar, þá, í, á, fram, svo, upp, dag, til

The 10 most frequent ambiguous lemmas: ekki (ADV 747, NOUN 2), þar (ADV 381, PRON 1, SCONJ 1), í (ADP 3479, ADV 269, X 4, ADJ 1, NOUN 1, PROPN 1), þá (ADV 260, PRON 2, SCONJ 2, NOUN 1), á (ADP 2096, ADV 244, NOUN 18, SCONJ 12, VERB 7, X 3, PROPN 2), fram (ADV 241, X 1), dagur (ADV 207, NOUN 61, PROPN 2), svo (ADV 197, SCONJ 26), upp (ADV 179, ADP 15), til (ADP 848, ADV 166, SCONJ 17)

The 10 most frequent ambiguous types: ekki (ADV 717, NOUN 2), þar (ADV 327, SCONJ 1), þá (ADV 181, PRON 75, SCONJ 2), í (ADP 3313, ADV 255, X 4, PROPN 1), á (ADP 2211, ADV 238, VERB 85, SCONJ 12, NOUN 6, X 3, PROPN 2, PART 1), fram (ADV 234, X 1), svo (ADV 168, SCONJ 24), upp (ADV 180, ADP 13), dag (ADV 167, NOUN 24, PROPN 1), til (ADP 838, ADV 161, SCONJ 17)

Morphology

The form / lemma ratio of ADV is 1.033438 (the average of all parts of speech is 1.434754).

The 1st highest number of forms (8) was observed with the lemma “vika”: vika, viku, vikum, vikuna, vikunni, vikunum, vikur, vikurnar.

The 2nd highest number of forms (8) was observed with the lemma “ár”: ár, ára, ári, árin, árinu, árið, árum, árunum.

The 3rd highest number of forms (7) was observed with the lemma “dagur”: dag, daga, dagana, daginn, degi, deginum, dögum.

ADV occurs with 12 features: Case (983; 10% instances), Number (934; 9% instances), Gender (933; 9% instances), Degree (547; 5% instances), Definite (167; 2% instances), Person (19; 0% instances), PronType (18; 0% instances), VerbForm (9; 0% instances), Voice (6; 0% instances), NumType (4; 0% instances), Mood (1; 0% instances), Tense (1; 0% instances)

ADV occurs with 22 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Nom, Definite=Def, Degree=Cmp, Degree=Pos, Degree=Sup, Gender=Fem, Gender=Masc, Gender=Neut, Mood=Ind, NumType=Card, Number=Plur, Number=Sing, Person=3, PronType=Prs, Tense=Pres, VerbForm=Inf, VerbForm=Part, VerbForm=Sup, Voice=Act

ADV occurs with 58 feature combinations. The most frequent feature combination is _ (8779 tokens). Examples: ekki, þar, þá, í, fram, á, svo, upp, til, út

Relations

ADV nodes are attached to their parents using 19 different relations: advmod (8613; 84% instances), fixed (986; 10% instances), compound:prt (408; 4% instances), obl (73; 1% instances), conj (59; 1% instances), obj (40; 0% instances), root (28; 0% instances), case (19; 0% instances), nsubj (14; 0% instances), iobj (11; 0% instances), mark (11; 0% instances), acl:relcl (8; 0% instances), advcl (7; 0% instances), nmod:poss (7; 0% instances), dep (5; 0% instances), xcomp (4; 0% instances), ccomp (3; 0% instances), cc (2; 0% instances), parataxis (1; 0% instances)

Parents of ADV nodes belong to 12 different parts of speech: VERB (5998; 58% instances), ADV (1533; 15% instances), NOUN (1396; 14% instances), ADJ (660; 6% instances), CCONJ (197; 2% instances), SCONJ (175; 2% instances), PRON (134; 1% instances), PROPN (101; 1% instances), NUM (70; 1% instances), (28; 0% instances), X (4; 0% instances), ADP (3; 0% instances)

7252 (70%) ADV nodes are leaves.

2050 (20%) ADV nodes have one child.

720 (7%) ADV nodes have two children.

277 (3%) ADV nodes have three or more children.

The highest child degree of a ADV node is 11.

Children of ADV nodes are attached using 23 different relations: fixed (1006; 23% instances), case (890; 20% instances), advmod (520; 12% instances), obl (297; 7% instances), punct (293; 7% instances), amod (273; 6% instances), flat (265; 6% instances), advcl (242; 5% instances), nummod (182; 4% instances), nmod (113; 3% instances), cc (89; 2% instances), nmod:poss (61; 1% instances), conj (57; 1% instances), ccomp (44; 1% instances), xcomp (30; 1% instances), nsubj (29; 1% instances), cop (19; 0% instances), dep (19; 0% instances), mark (17; 0% instances), obj (8; 0% instances), acl:relcl (6; 0% instances), aux (3; 0% instances), discourse (3; 0% instances)

Children of ADV nodes belong to 15 different parts of speech: ADV (1533; 34% instances), ADP (907; 20% instances), NOUN (448; 10% instances), NUM (353; 8% instances), ADJ (299; 7% instances), VERB (296; 7% instances), PUNCT (293; 7% instances), PRON (152; 3% instances), CCONJ (103; 2% instances), PROPN (26; 1% instances), SCONJ (23; 1% instances), AUX (22; 0% instances), PART (5; 0% instances), INTJ (3; 0% instances), X (3; 0% instances)