Treebank Statistics: UD_Polish-PDB: POS Tags: NOUN
There are 11292 NOUN
lemmas (36%), 23361 NOUN
types (38%) and 88634 NOUN
tokens (25%).
Out of 17 observed tags, the rank of NOUN
is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.
The 10 most frequent NOUN
lemmas: rok, człowiek, mężczyzna, pan, osoba, kobieta, czas, sprawa, dziecko, dzień
The 10 most frequent NOUN
types: mężczyzna, roku, pan, kobieta, lat, r, człowiek, pracy, chłopiec, osób
The 10 most frequent ambiguous lemmas: czas (NOUN 490, VERB 2), prawo (NOUN 258, ADV 6), sposób (NOUN 209, VERB 6), złoty (NOUN 170, ADJ 17), milion (NOUN 116, NUM 2), siła (NOUN 94, ADV 1), brak (NOUN 75, VERB 11), potrzeba (NOUN 70, VERB 5), stosować (NOUN 67, VERB 59, ADJ 33), prawda (NOUN 65, INTJ 2)
The 10 most frequent ambiguous types: r (NOUN 260, PROPN 2), sposób (NOUN 170, VERB 6), czas (NOUN 121, VERB 1), prawa (NOUN 86, ADJ 2), prawo (NOUN 77, ADV 6), lata (NOUN 73, VERB 2), proc (NOUN 75, ADJ 2), spraw (NOUN 58, VERB 1), danych (NOUN 60, ADJ 1), razem (NOUN 61, ADV 46)
- r
- sposób
- czas
- prawa
- prawo
- lata
- proc
- NOUN 75: Resztę Howell będzie płacił w ratach - co roku 5 proc .
- ADJ 2: Planowany wzrost przychodów najludniejszych gmin w br . będzie o 2 pkt . proc . wyższy od prognozowanej na br . stopy średniorocznej inflacji ( 15 proc . ) , natomiast wzrost wydatków przewyższa o 12 pkt . proc . stopę inflacji .
- spraw
- danych
- NOUN 60: Jaki jest cel zbierania danych na temat infrastruktury energetycznej ?
- ADJ 1: Jacek Ambroziak powiedział , że celem tej zmiany jest także to , aby czas prywatyzacji danego przedsiębiorstwa miał znaczenie dla wartości akcji danych pracownikom , tzn . im szybciej przedsiębiorstwo będzie sprywatyzowane , tym większa będzie wartość otrzymywanych przez pracowników bezpłatnych akcji .
- razem
Morphology
The form / lemma ratio of NOUN
is 2.068810 (the average of all parts of speech is 1.966055).
The 1st highest number of forms (15) was observed with the lemma “rząd”: rząd, rządami, rządem, rządowi, rządu, rządy, rządzie, rządów, rzędach, rzędami, rzędem, rzędu, rzędy, rzędzie, rzędów.
The 2nd highest number of forms (11) was observed with the lemma “człowiek”: cztowiek, człowiek, człowieka, człowiekiem, człowiekowi, człowieku, ludzi, ludziach, ludzie, ludziom, ludźmi.
The 3rd highest number of forms (11) was observed with the lemma “oko”: oczach, oczami, oczom, oczu, oczy, oczyma, oczów, oka, okiem, oko, oku.
NOUN
occurs with 10 features: Case (87150; 98% instances), Gender (87150; 98% instances), Number (87150; 98% instances), Animacy (38821; 44% instances), Aspect (3913; 4% instances), Polarity (3913; 4% instances), VerbForm (3913; 4% instances), Abbr (1484; 2% instances), NumType (725; 1% instances), Polite (12; 0% instances)
NOUN
occurs with 24 feature-value pairs: Abbr=Yes
, Animacy=Hum
, Animacy=Inan
, Animacy=Nhum
, Aspect=Imp
, Aspect=Perf
, Case=Acc
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Case=Voc
, Gender=Fem
, Gender=Masc
, Gender=Neut
, NumType=Sets
, Number=Plur
, Number=Ptan
, Number=Sing
, Polarity=Neg
, Polarity=Pos
, Polite=Depr
, VerbForm=Vnoun
NOUN
occurs with 129 feature combinations.
The most frequent feature combination is Case=Gen|Gender=Fem|Number=Sing
(7847 tokens).
Examples: pracy, wody, rady, komisji, strony, unii, góry, ochrony, polityki, pomocy
Relations
NOUN
nodes are attached to their parents using 40 different relations: obl (15614; 18% instances), nsubj (14031; 16% instances), nmod (12569; 14% instances), obj (12174; 14% instances), nmod:arg (7537; 9% instances), conj (6275; 7% instances), obl:arg (6068; 7% instances), iobj (4490; 5% instances), fixed (2346; 3% instances), root (1549; 2% instances), nmod:flat (969; 1% instances), appos (866; 1% instances), nsubj:pass (848; 1% instances), obl:agent (672; 1% instances), obl:cmpr (645; 1% instances), nmod:poss (262; 0% instances), flat (252; 0% instances), xcomp:pred (241; 0% instances), parataxis:obj (235; 0% instances), parataxis:insert (213; 0% instances), vocative (166; 0% instances), ccomp (151; 0% instances), acl:relcl (112; 0% instances), advcl (89; 0% instances), ccomp:obj (77; 0% instances), xcomp (58; 0% instances), orphan (51; 0% instances), csubj (18; 0% instances), ccomp:cleft (11; 0% instances), flat:foreign (9; 0% instances), list (7; 0% instances), acl (6; 0% instances), nmod:pred (5; 0% instances), advcl:relcl (4; 0% instances), case (4; 0% instances), advcl:cmpr (3; 0% instances), advmod:emph (3; 0% instances), xcomp:cleft (2; 0% instances), csubj:pass (1; 0% instances), dep (1; 0% instances)
Parents of NOUN
nodes belong to 13 different parts of speech: VERB (43973; 50% instances), NOUN (31157; 35% instances), ADJ (7636; 9% instances), ADP (2305; 3% instances), (1549; 2% instances), PROPN (717; 1% instances), ADV (407; 0% instances), DET (301; 0% instances), NUM (282; 0% instances), PRON (240; 0% instances), PART (31; 0% instances), INTJ (19; 0% instances), X (17; 0% instances)
18263 (21%) NOUN
nodes are leaves.
33363 (38%) NOUN
nodes have one child.
23702 (27%) NOUN
nodes have two children.
13306 (15%) NOUN
nodes have three or more children.
The highest child degree of a NOUN
node is 13.
Children of NOUN
nodes are attached using 59 different relations: case (29795; 23% instances), amod (22463; 17% instances), nmod (15695; 12% instances), punct (10539; 8% instances), nmod:arg (8165; 6% instances), conj (6176; 5% instances), acl (4635; 4% instances), cc (4136; 3% instances), det (3978; 3% instances), obj (2547; 2% instances), acl:relcl (2102; 2% instances), appos (1783; 1% instances), det:poss (1669; 1% instances), cop (1634; 1% instances), advmod:emph (1534; 1% instances), amod:flat (1452; 1% instances), mark (1404; 1% instances), nummod (1302; 1% instances), nummod:gov (1029; 1% instances), nsubj (913; 1% instances), nmod:flat (699; 1% instances), det:numgov (565; 0% instances), nmod:poss (564; 0% instances), advmod (438; 0% instances), obl:arg (404; 0% instances), ccomp (324; 0% instances), obl:agent (322; 0% instances), obl (289; 0% instances), aux (267; 0% instances), parataxis:insert (235; 0% instances), det:nummod (231; 0% instances), iobj (183; 0% instances), flat (152; 0% instances), expl:pv (136; 0% instances), cc:preconj (135; 0% instances), parataxis:obj (134; 0% instances), advmod:neg (123; 0% instances), advcl (119; 0% instances), list (85; 0% instances), orphan (64; 0% instances), obl:cmpr (56; 0% instances), xcomp (49; 0% instances), fixed (31; 0% instances), discourse:intj (24; 0% instances), csubj (20; 0% instances), ccomp:obj (18; 0% instances), vocative (18; 0% instances), aux:clitic (17; 0% instances), aux:cnd (15; 0% instances), nmod:pred (12; 0% instances), xcomp:subj (10; 0% instances), flat:foreign (9; 0% instances), advcl:cmpr (6; 0% instances), aux:imp (3; 0% instances), dep (3; 0% instances), xcomp:pred (3; 0% instances), discourse (2; 0% instances), nummod:flat (2; 0% instances), advmod:arg (1; 0% instances)
Children of NOUN
nodes belong to 17 different parts of speech: NOUN (31157; 24% instances), ADP (29729; 23% instances), ADJ (28667; 22% instances), PUNCT (10539; 8% instances), DET (6632; 5% instances), PROPN (4541; 4% instances), CCONJ (4181; 3% instances), VERB (2789; 2% instances), NUM (2389; 2% instances), AUX (1936; 2% instances), PRON (1771; 1% instances), PART (1551; 1% instances), SCONJ (1398; 1% instances), ADV (901; 1% instances), X (505; 0% instances), INTJ (24; 0% instances), SYM (9; 0% instances)