home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Norwegian-Nynorsk: POS Tags: PRON

There are 44 PRON lemmas (0%), 66 PRON types (0%) and 19366 PRON tokens (6%). Out of 17 observed tags, the rank of PRON is: 10 in number of lemmas, 10 in number of types and 6 in number of tokens.

The 10 most frequent PRON lemmas: det, dei, han, eg, vi, seg, sin, ein, dette, ho

The 10 most frequent PRON types: det, dei, han, eg, vi, seg, ein, dette, ho, me

The 10 most frequent ambiguous lemmas: det (PRON 5532, DET 1337, X 16), dei (PRON 1668, DET 1515), han (PRON 1632, X 5), vi (PRON 1588, X 7), seg (PRON 1166, X 3), ein (DET 5926, PRON 824), dette (PRON 721, DET 192, VERB 3, X 2), du (PRON 366, X 1), kva (PRON 301, DET 106), noko (PRON 283, NUM 24)

The 10 most frequent ambiguous types: det (PRON 4104, DET 1165, X 16, ADV 1), dei (PRON 1439, DET 1346), han (PRON 1223, X 5), vi (PRON 987, X 7, AUX 1), seg (PRON 1166, X 3, VERB 1), ein (DET 2404, PRON 728, ADP 1), dette (PRON 472, DET 161, X 2), me (PRON 393, ADV 1), kva (PRON 240, DET 93), noko (PRON 274, DET 183, NUM 23)

Morphology

The form / lemma ratio of PRON is 1.500000 (the average of all parts of speech is 1.346455).

The 1st highest number of forms (4) was observed with the lemma “det”: dei, det, dte, dét.

The 2nd highest number of forms (4) was observed with the lemma “din”: di, din, dine, ditt.

The 3rd highest number of forms (4) was observed with the lemma “min”: mi, min, mine, mitt.

PRON occurs with 9 features: PronType (19338; 100% instances), Person (15382; 79% instances), Gender (10383; 54% instances), Case (9301; 48% instances), Animacy (6774; 35% instances), Number (4609; 24% instances), Poss (1550; 8% instances), Abbr (4; 0% instances), Polarity (1; 0% instances)

PRON occurs with 21 feature-value pairs: Abbr=Yes, Animacy=Hum, Case=Acc, Case=Nom, Gender=Fem, Gender=Fem,Masc, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Poss=Yes, PronType=Art,Prs, PronType=Ind,Prs, PronType=Int, PronType=Prs, PronType=Prs,Tot, PronType=Rcp

PRON occurs with 40 feature combinations. The most frequent feature combination is Gender=Neut|Person=3|PronType=Prs (6719 tokens). Examples: det, dette, noko, alt, slikt, dét, detta, dei, dte, sånt

Relations

PRON nodes are attached to their parents using 20 different relations: nsubj (9672; 50% instances), expl (3462; 18% instances), obj (2112; 11% instances), nmod:poss (1526; 8% instances), obl (1133; 6% instances), nmod (452; 2% instances), iobj (306; 2% instances), root (152; 1% instances), nsubj:pass (149; 1% instances), conj (117; 1% instances), dislocated (66; 0% instances), appos (56; 0% instances), xcomp (46; 0% instances), nsubj:outer (37; 0% instances), ccomp (36; 0% instances), det (20; 0% instances), flat:name (14; 0% instances), compound (4; 0% instances), parataxis (3; 0% instances), reparandum (3; 0% instances)

Parents of PRON nodes belong to 11 different parts of speech: VERB (13051; 67% instances), NOUN (3203; 17% instances), ADJ (2304; 12% instances), PRON (171; 1% instances), ADV (164; 1% instances), (152; 1% instances), PROPN (127; 1% instances), DET (85; 0% instances), ADP (55; 0% instances), NUM (49; 0% instances), AUX (5; 0% instances)

16456 (85%) PRON nodes are leaves.

1811 (9%) PRON nodes have one child.

721 (4%) PRON nodes have two children.

378 (2%) PRON nodes have three or more children.

The highest child degree of a PRON node is 11.

Children of PRON nodes are attached using 28 different relations: case (1626; 34% instances), acl:relcl (792; 16% instances), punct (541; 11% instances), obl (308; 6% instances), nmod (225; 5% instances), cop (216; 4% instances), advmod (207; 4% instances), det (182; 4% instances), nsubj (135; 3% instances), cc (134; 3% instances), conj (118; 2% instances), acl (104; 2% instances), expl (71; 1% instances), appos (59; 1% instances), mark (31; 1% instances), advcl (20; 0% instances), aux (17; 0% instances), amod (12; 0% instances), discourse (9; 0% instances), parataxis (7; 0% instances), xcomp (5; 0% instances), csubj (4; 0% instances), dislocated (2; 0% instances), flat:name (2; 0% instances), ccomp (1; 0% instances), flat (1; 0% instances), nummod (1; 0% instances), obj (1; 0% instances)

Children of PRON nodes belong to 15 different parts of speech: ADP (1637; 34% instances), VERB (827; 17% instances), PUNCT (541; 11% instances), NOUN (460; 10% instances), AUX (233; 5% instances), PROPN (224; 5% instances), DET (201; 4% instances), ADJ (188; 4% instances), PRON (171; 4% instances), ADV (133; 3% instances), CCONJ (133; 3% instances), PART (38; 1% instances), SCONJ (30; 1% instances), INTJ (9; 0% instances), NUM (6; 0% instances)