Statistics of NOUN in UD

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Polish-PDB: POS Tags: `NOUN`

There are 11292 NOUN lemmas (36%), 23361 NOUN types (38%) and 88634 NOUN tokens (25%). Out of 17 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: rok, człowiek, mężczyzna, pan, osoba, kobieta, czas, sprawa, dziecko, dzień

The 10 most frequent NOUN types: mężczyzna, roku, pan, kobieta, lat, r, człowiek, pracy, chłopiec, osób

The 10 most frequent ambiguous lemmas: czas (NOUN 490, VERB 2), prawo (NOUN 258, ADV 6), sposób (NOUN 209, VERB 6), złoty (NOUN 170, ADJ 17), milion (NOUN 116, NUM 2), siła (NOUN 94, ADV 1), brak (NOUN 75, VERB 11), potrzeba (NOUN 70, VERB 5), stosować (NOUN 67, VERB 59, ADJ 33), prawda (NOUN 65, INTJ 2)

The 10 most frequent ambiguous types: r (NOUN 260, PROPN 2), sposób (NOUN 170, VERB 6), czas (NOUN 121, VERB 1), prawa (NOUN 86, ADJ 2), prawo (NOUN 77, ADV 6), lata (NOUN 73, VERB 2), proc (NOUN 75, ADJ 2), spraw (NOUN 58, VERB 1), danych (NOUN 60, ADJ 1), razem (NOUN 61, ADV 46)

r
- NOUN 260: Data ważności mleka na kartonie - 2 marzec 1998 r .
- PROPN 2: Podczas wyborów prezydenckich w 1995 r . Kwaśniewski kłamał jak z nut , że jest magistrem .
sposób
- NOUN 170: W ten sposób to było by zbudowane powoli i starannie .
- VERB 6: Nie sposób nie żywić uczucia podziwu dla odwagi pierwszych żeglarzy .
czas
- NOUN 121: Tymczasem w Słupsku cały czas służby pozostają w pogotowiu .
- VERB 1: Dajcie spokój , czas iść do domu .
prawa
- NOUN 86: Zdaniem specjalistów może pomóc zmiana prawa budowlanego .
- ADJ 2: Klaskał mu prezydent Kwaśniewski oraz zgodnie prawa i lewa strona .
prawo
- NOUN 77: On interpretuje prawo na swój sposób .
- ADV 6: – bo zamiast w prawo na trzy siostry skręcił em tego dnia na te . .
lata
- NOUN 73: Jak Polacy wykorzystali ostatnie cztery lata ?
- VERB 2: Połyskujący niebieski owad lata obok rośliny z kwiatkami .
proc
- NOUN 75: Resztę Howell będzie płacił w ratach - co roku 5 proc .
- ADJ 2: Planowany wzrost przychodów najludniejszych gmin w br . będzie o 2 pkt . proc . wyższy od prognozowanej na br . stopy średniorocznej inflacji ( 15 proc . ) , natomiast wzrost wydatków przewyższa o 12 pkt . proc . stopę inflacji .
spraw
- NOUN 58: To okres na zamykanie , kończenie różnych spraw .
- VERB 1: Przy dziewiątym spośród dziesięciu nieskomplikowanych życiowych zaleceń : codziennie spraw sobie jakąś nagrodę , kolega przerwał : Tyle piwa nie dam rady wypić .
danych
- NOUN 60: Jaki jest cel zbierania danych na temat infrastruktury energetycznej ?
- ADJ 1: Jacek Ambroziak powiedział , że celem tej zmiany jest także to , aby czas prywatyzacji danego przedsiębiorstwa miał znaczenie dla wartości akcji danych pracownikom , tzn . im szybciej przedsiębiorstwo będzie sprywatyzowane , tym większa będzie wartość otrzymywanych przez pracowników bezpłatnych akcji .
razem
- NOUN 61: Tym razem nie zdążyli : skoczył głową w dół .
- ADV 46: Już niedługo będziemy się razem bawić .

Morphology

The form / lemma ratio of NOUN is 2.068810 (the average of all parts of speech is 1.966055).

The 1st highest number of forms (15) was observed with the lemma “rząd”: rząd, rządami, rządem, rządowi, rządu, rządy, rządzie, rządów, rzędach, rzędami, rzędem, rzędu, rzędy, rzędzie, rzędów.

The 2nd highest number of forms (11) was observed with the lemma “człowiek”: cztowiek, człowiek, człowieka, człowiekiem, człowiekowi, człowieku, ludzi, ludziach, ludzie, ludziom, ludźmi.

The 3rd highest number of forms (11) was observed with the lemma “oko”: oczach, oczami, oczom, oczu, oczy, oczyma, oczów, oka, okiem, oko, oku.

NOUN occurs with 10 features: Case (87150; 98% instances), Gender (87150; 98% instances), Number (87150; 98% instances), Animacy (38821; 44% instances), Aspect (3913; 4% instances), Polarity (3913; 4% instances), VerbForm (3913; 4% instances), Abbr (1484; 2% instances), NumType (725; 1% instances), Polite (12; 0% instances)

NOUN occurs with 24 feature-value pairs: Abbr=Yes, Animacy=Hum, Animacy=Inan, Animacy=Nhum, Aspect=Imp, Aspect=Perf, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, Gender=Fem, Gender=Masc, Gender=Neut, NumType=Sets, Number=Plur, Number=Ptan, Number=Sing, Polarity=Neg, Polarity=Pos, Polite=Depr, VerbForm=Vnoun

NOUN occurs with 129 feature combinations. The most frequent feature combination is Case=Gen|Gender=Fem|Number=Sing (7847 tokens). Examples: pracy, wody, rady, komisji, strony, unii, góry, ochrony, polityki, pomocy

Relations

NOUN nodes are attached to their parents using 40 different relations: obl (15614; 18% instances), nsubj (14031; 16% instances), nmod (12569; 14% instances), obj (12174; 14% instances), nmod:arg (7537; 9% instances), conj (6275; 7% instances), obl:arg (6068; 7% instances), iobj (4490; 5% instances), fixed (2346; 3% instances), root (1549; 2% instances), nmod:flat (969; 1% instances), appos (866; 1% instances), nsubj:pass (848; 1% instances), obl:agent (672; 1% instances), obl:cmpr (645; 1% instances), nmod:poss (262; 0% instances), flat (252; 0% instances), xcomp:pred (241; 0% instances), parataxis:obj (235; 0% instances), parataxis:insert (213; 0% instances), vocative (166; 0% instances), ccomp (151; 0% instances), acl:relcl (112; 0% instances), advcl (89; 0% instances), ccomp:obj (77; 0% instances), xcomp (58; 0% instances), orphan (51; 0% instances), csubj (18; 0% instances), ccomp:cleft (11; 0% instances), flat:foreign (9; 0% instances), list (7; 0% instances), acl (6; 0% instances), nmod:pred (5; 0% instances), advcl:relcl (4; 0% instances), case (4; 0% instances), advcl:cmpr (3; 0% instances), advmod:emph (3; 0% instances), xcomp:cleft (2; 0% instances), csubj:pass (1; 0% instances), dep (1; 0% instances)

Parents of NOUN nodes belong to 13 different parts of speech: VERB (43973; 50% instances), NOUN (31157; 35% instances), ADJ (7636; 9% instances), ADP (2305; 3% instances), (1549; 2% instances), PROPN (717; 1% instances), ADV (407; 0% instances), DET (301; 0% instances), NUM (282; 0% instances), PRON (240; 0% instances), PART (31; 0% instances), INTJ (19; 0% instances), X (17; 0% instances)

18263 (21%) NOUN nodes are leaves.

33363 (38%) NOUN nodes have one child.

23702 (27%) NOUN nodes have two children.

13306 (15%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 13.

Children of NOUN nodes are attached using 59 different relations: case (29795; 23% instances), amod (22463; 17% instances), nmod (15695; 12% instances), punct (10539; 8% instances), nmod:arg (8165; 6% instances), conj (6176; 5% instances), acl (4635; 4% instances), cc (4136; 3% instances), det (3978; 3% instances), obj (2547; 2% instances), acl:relcl (2102; 2% instances), appos (1783; 1% instances), det:poss (1669; 1% instances), cop (1634; 1% instances), advmod:emph (1534; 1% instances), amod:flat (1452; 1% instances), mark (1404; 1% instances), nummod (1302; 1% instances), nummod:gov (1029; 1% instances), nsubj (913; 1% instances), nmod:flat (699; 1% instances), det:numgov (565; 0% instances), nmod:poss (564; 0% instances), advmod (438; 0% instances), obl:arg (404; 0% instances), ccomp (324; 0% instances), obl:agent (322; 0% instances), obl (289; 0% instances), aux (267; 0% instances), parataxis:insert (235; 0% instances), det:nummod (231; 0% instances), iobj (183; 0% instances), flat (152; 0% instances), expl:pv (136; 0% instances), cc:preconj (135; 0% instances), parataxis:obj (134; 0% instances), advmod:neg (123; 0% instances), advcl (119; 0% instances), list (85; 0% instances), orphan (64; 0% instances), obl:cmpr (56; 0% instances), xcomp (49; 0% instances), fixed (31; 0% instances), discourse:intj (24; 0% instances), csubj (20; 0% instances), ccomp:obj (18; 0% instances), vocative (18; 0% instances), aux:clitic (17; 0% instances), aux:cnd (15; 0% instances), nmod:pred (12; 0% instances), xcomp:subj (10; 0% instances), flat:foreign (9; 0% instances), advcl:cmpr (6; 0% instances), aux:imp (3; 0% instances), dep (3; 0% instances), xcomp:pred (3; 0% instances), discourse (2; 0% instances), nummod:flat (2; 0% instances), advmod:arg (1; 0% instances)

Children of NOUN nodes belong to 17 different parts of speech: NOUN (31157; 24% instances), ADP (29729; 23% instances), ADJ (28667; 22% instances), PUNCT (10539; 8% instances), DET (6632; 5% instances), PROPN (4541; 4% instances), CCONJ (4181; 3% instances), VERB (2789; 2% instances), NUM (2389; 2% instances), AUX (1936; 2% instances), PRON (1771; 1% instances), PART (1551; 1% instances), SCONJ (1398; 1% instances), ADV (901; 1% instances), X (505; 0% instances), INTJ (24; 0% instances), SYM (9; 0% instances)

Treebank Statistics: UD_Polish-PDB: POS Tags: NOUN

Morphology

Relations

Treebank Statistics: UD_Polish-PDB: POS Tags: `NOUN`