home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Polish-PDB: POS Tags: NOUN

There are 11292 NOUN lemmas (36%), 23361 NOUN types (38%) and 88634 NOUN tokens (25%). Out of 17 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: rok, człowiek, mężczyzna, pan, osoba, kobieta, czas, sprawa, dziecko, dzień

The 10 most frequent NOUN types: mężczyzna, roku, pan, kobieta, lat, r, człowiek, pracy, chłopiec, osób

The 10 most frequent ambiguous lemmas: czas (NOUN 490, VERB 2), prawo (NOUN 258, ADV 6), sposób (NOUN 209, VERB 6), złoty (NOUN 170, ADJ 17), milion (NOUN 116, NUM 2), siła (NOUN 94, ADV 1), brak (NOUN 75, VERB 11), potrzeba (NOUN 70, VERB 5), stosować (NOUN 67, VERB 59, ADJ 33), prawda (NOUN 65, INTJ 2)

The 10 most frequent ambiguous types: r (NOUN 260, PROPN 2), sposób (NOUN 170, VERB 6), czas (NOUN 121, VERB 1), prawa (NOUN 86, ADJ 2), prawo (NOUN 77, ADV 6), lata (NOUN 73, VERB 2), proc (NOUN 75, ADJ 2), spraw (NOUN 58, VERB 1), danych (NOUN 60, ADJ 1), razem (NOUN 61, ADV 46)

Morphology

The form / lemma ratio of NOUN is 2.068810 (the average of all parts of speech is 1.966055).

The 1st highest number of forms (15) was observed with the lemma “rząd”: rząd, rządami, rządem, rządowi, rządu, rządy, rządzie, rządów, rzędach, rzędami, rzędem, rzędu, rzędy, rzędzie, rzędów.

The 2nd highest number of forms (11) was observed with the lemma “człowiek”: cztowiek, człowiek, człowieka, człowiekiem, człowiekowi, człowieku, ludzi, ludziach, ludzie, ludziom, ludźmi.

The 3rd highest number of forms (11) was observed with the lemma “oko”: oczach, oczami, oczom, oczu, oczy, oczyma, oczów, oka, okiem, oko, oku.

NOUN occurs with 10 features: Case (87150; 98% instances), Gender (87150; 98% instances), Number (87150; 98% instances), Animacy (38821; 44% instances), Aspect (3913; 4% instances), Polarity (3913; 4% instances), VerbForm (3913; 4% instances), Abbr (1484; 2% instances), NumType (725; 1% instances), Polite (12; 0% instances)

NOUN occurs with 24 feature-value pairs: Abbr=Yes, Animacy=Hum, Animacy=Inan, Animacy=Nhum, Aspect=Imp, Aspect=Perf, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, Gender=Fem, Gender=Masc, Gender=Neut, NumType=Sets, Number=Plur, Number=Ptan, Number=Sing, Polarity=Neg, Polarity=Pos, Polite=Depr, VerbForm=Vnoun

NOUN occurs with 129 feature combinations. The most frequent feature combination is Case=Gen|Gender=Fem|Number=Sing (7847 tokens). Examples: pracy, wody, rady, komisji, strony, unii, góry, ochrony, polityki, pomocy

Relations

NOUN nodes are attached to their parents using 40 different relations: obl (15614; 18% instances), nsubj (14031; 16% instances), nmod (12569; 14% instances), obj (12174; 14% instances), nmod:arg (7537; 9% instances), conj (6275; 7% instances), obl:arg (6068; 7% instances), iobj (4490; 5% instances), fixed (2346; 3% instances), root (1549; 2% instances), nmod:flat (969; 1% instances), appos (866; 1% instances), nsubj:pass (848; 1% instances), obl:agent (672; 1% instances), obl:cmpr (645; 1% instances), nmod:poss (262; 0% instances), flat (252; 0% instances), xcomp:pred (241; 0% instances), parataxis:obj (235; 0% instances), parataxis:insert (213; 0% instances), vocative (166; 0% instances), ccomp (151; 0% instances), acl:relcl (112; 0% instances), advcl (89; 0% instances), ccomp:obj (77; 0% instances), xcomp (58; 0% instances), orphan (51; 0% instances), csubj (18; 0% instances), ccomp:cleft (11; 0% instances), flat:foreign (9; 0% instances), list (7; 0% instances), acl (6; 0% instances), nmod:pred (5; 0% instances), advcl:relcl (4; 0% instances), case (4; 0% instances), advcl:cmpr (3; 0% instances), advmod:emph (3; 0% instances), xcomp:cleft (2; 0% instances), csubj:pass (1; 0% instances), dep (1; 0% instances)

Parents of NOUN nodes belong to 13 different parts of speech: VERB (43973; 50% instances), NOUN (31157; 35% instances), ADJ (7636; 9% instances), ADP (2305; 3% instances), (1549; 2% instances), PROPN (717; 1% instances), ADV (407; 0% instances), DET (301; 0% instances), NUM (282; 0% instances), PRON (240; 0% instances), PART (31; 0% instances), INTJ (19; 0% instances), X (17; 0% instances)

18263 (21%) NOUN nodes are leaves.

33363 (38%) NOUN nodes have one child.

23702 (27%) NOUN nodes have two children.

13306 (15%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 13.

Children of NOUN nodes are attached using 59 different relations: case (29795; 23% instances), amod (22463; 17% instances), nmod (15695; 12% instances), punct (10539; 8% instances), nmod:arg (8165; 6% instances), conj (6176; 5% instances), acl (4635; 4% instances), cc (4136; 3% instances), det (3978; 3% instances), obj (2547; 2% instances), acl:relcl (2102; 2% instances), appos (1783; 1% instances), det:poss (1669; 1% instances), cop (1634; 1% instances), advmod:emph (1534; 1% instances), amod:flat (1452; 1% instances), mark (1404; 1% instances), nummod (1302; 1% instances), nummod:gov (1029; 1% instances), nsubj (913; 1% instances), nmod:flat (699; 1% instances), det:numgov (565; 0% instances), nmod:poss (564; 0% instances), advmod (438; 0% instances), obl:arg (404; 0% instances), ccomp (324; 0% instances), obl:agent (322; 0% instances), obl (289; 0% instances), aux (267; 0% instances), parataxis:insert (235; 0% instances), det:nummod (231; 0% instances), iobj (183; 0% instances), flat (152; 0% instances), expl:pv (136; 0% instances), cc:preconj (135; 0% instances), parataxis:obj (134; 0% instances), advmod:neg (123; 0% instances), advcl (119; 0% instances), list (85; 0% instances), orphan (64; 0% instances), obl:cmpr (56; 0% instances), xcomp (49; 0% instances), fixed (31; 0% instances), discourse:intj (24; 0% instances), csubj (20; 0% instances), ccomp:obj (18; 0% instances), vocative (18; 0% instances), aux:clitic (17; 0% instances), aux:cnd (15; 0% instances), nmod:pred (12; 0% instances), xcomp:subj (10; 0% instances), flat:foreign (9; 0% instances), advcl:cmpr (6; 0% instances), aux:imp (3; 0% instances), dep (3; 0% instances), xcomp:pred (3; 0% instances), discourse (2; 0% instances), nummod:flat (2; 0% instances), advmod:arg (1; 0% instances)

Children of NOUN nodes belong to 17 different parts of speech: NOUN (31157; 24% instances), ADP (29729; 23% instances), ADJ (28667; 22% instances), PUNCT (10539; 8% instances), DET (6632; 5% instances), PROPN (4541; 4% instances), CCONJ (4181; 3% instances), VERB (2789; 2% instances), NUM (2389; 2% instances), AUX (1936; 2% instances), PRON (1771; 1% instances), PART (1551; 1% instances), SCONJ (1398; 1% instances), ADV (901; 1% instances), X (505; 0% instances), INTJ (24; 0% instances), SYM (9; 0% instances)