Treebank Statistics: UD_Livvi-KKPP: POS Tags: NOUN
There are 227 NOUN
lemmas (39%), 305 NOUN
types (39%) and 431 NOUN
tokens (26%).
Out of 14 observed tags, the rank of NOUN
is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.
The 10 most frequent NOUN
lemmas: saari, arbaitus, vojennoi, virsta, päivy, tverinkarjalaine, festivuali, kieli, kodi, sarvi
The 10 most frequent NOUN
types: vojennoit, saari, virstaa, bobuli-briha, briha, piduhuttu, saaraa, taatto, arbaituksii, festivualin
The 10 most frequent ambiguous lemmas: tverinkarjalaine (NOUN 2, ADJ 1), piduhus (NOUN 5, X 1), karjalaine (ADJ 4, NOUN 4), nuori (NOUN 4, ADJ 1), bohattu (NOUN 3, ADJ 2), Karjal (PROPN 6, NOUN 2), pučči (NOUN 2, X 1), piirde (ADJ 1, NOUN 1), semmoine (ADJ 3, NOUN 1)
The 10 most frequent ambiguous types: piduhuttu (NOUN 5, X 1), tverinkarjalazien (ADJ 1, NOUN 1), bohattu (NOUN 3, ADJ 2), puččii (NOUN 2, X 1), d’engaa (NOUN 1, X 1), piirdehii (ADJ 1, NOUN 1), semmostu (ADJ 1, NOUN 1)
- piduhuttu
- NOUN 5: Sit oli häkki : seičče virstaa oli sarvet piduhuttu ; paimoi istui sarvel , toine toizel ; torvittih , ga toine toizen ei kuulluh torvindaa .
- X 1: ” Kuunelkaa nygöi , sanon saaraa , – häi sanoo , – konzu minun taatto oli bohattu , kolme virstaa oli kodi piduhuttu , a virstaa levevytty ; kezäkse kraassi koin mustal kraaskal , a talvekse valgiel .
- tverinkarjalazien
- bohattu
- puččii
- d’engaa
- piirdehii
- semmostu
Morphology
The form / lemma ratio of NOUN
is 1.343612 (the average of all parts of speech is 1.335034).
The 1st highest number of forms (7) was observed with the lemma “arbaitus”: Arbaituksen, arbaitukses, arbaitukset, arbaituksien, arbaituksii, arbaituksis, arbaitustu.
The 2nd highest number of forms (6) was observed with the lemma “kieli”: kieldy, kieleh, kielen, kieles, kieli, kielil.
The 3rd highest number of forms (6) was observed with the lemma “saari”: saaraa, saaran, saari, saaril, saarilloo, saaris.
NOUN
occurs with 10 features: Case (429; 100% instances), Number (429; 100% instances), Mood (1; 0% instances), Number[psor] (1; 0% instances), Person (1; 0% instances), Person[psor] (1; 0% instances), Tense (1; 0% instances), Typo (1; 0% instances), VerbForm (1; 0% instances), Voice (1; 0% instances)
NOUN
occurs with 25 feature-value pairs: Case=Abl
, Case=Acc
, Case=Ade
, Case=All
, Case=Com
, Case=Ela
, Case=Ess
, Case=Gen
, Case=Ill
, Case=Ine
, Case=Ins
, Case=Nom
, Case=Par
, Case=Ter
, Case=Tra
, Mood=Ind
, Number=Plur
, Number=Sing
, Number[psor]=Sing
, Person=3
, Person[psor]=3
, Tense=Pres
, Typo=Yes
, VerbForm=Fin
, Voice=Act
NOUN
occurs with 29 feature combinations.
The most frequent feature combination is Case=Nom|Number=Sing
(81 tokens).
Examples: saari, briha, taatto, bobuli-briha, bohattu, häkki, kodi, paimoi, -liitto, akku
Relations
NOUN
nodes are attached to their parents using 21 different relations: obl (120; 28% instances), nsubj (77; 18% instances), obj (76; 18% instances), nmod:poss (47; 11% instances), conj (35; 8% instances), nsubj:cop (20; 5% instances), root (12; 3% instances), orphan (9; 2% instances), nmod (8; 2% instances), parataxis (6; 1% instances), appos (4; 1% instances), advcl (3; 1% instances), acl (2; 0% instances), amod (2; 0% instances), compound (2; 0% instances), compound:nn (2; 0% instances), flat:name (2; 0% instances), acl:relcl (1; 0% instances), case (1; 0% instances), ccomp (1; 0% instances), fixed (1; 0% instances)
Parents of NOUN
nodes belong to 9 different parts of speech: VERB (262; 61% instances), NOUN (119; 28% instances), ADJ (20; 5% instances), (12; 3% instances), PROPN (8; 2% instances), PRON (3; 1% instances), X (3; 1% instances), ADV (2; 0% instances), NUM (2; 0% instances)
173 (40%) NOUN
nodes are leaves.
174 (40%) NOUN
nodes have one child.
43 (10%) NOUN
nodes have two children.
41 (10%) NOUN
nodes have three or more children.
The highest child degree of a NOUN
node is 7.
Children of NOUN
nodes are attached using 29 different relations: nmod:poss (83; 19% instances), amod (74; 17% instances), punct (67; 15% instances), conj (41; 9% instances), cc (21; 5% instances), cop (18; 4% instances), nummod (17; 4% instances), nsubj:cop (15; 3% instances), case (13; 3% instances), det (12; 3% instances), obl (11; 3% instances), parataxis (10; 2% instances), acl:relcl (7; 2% instances), obj (7; 2% instances), nmod (6; 1% instances), appos (5; 1% instances), mark (5; 1% instances), orphan (5; 1% instances), advmod (4; 1% instances), compound:nn (3; 1% instances), goeswith (3; 1% instances), acl (1; 0% instances), advcl (1; 0% instances), compound (1; 0% instances), cop:own (1; 0% instances), csubj:cop (1; 0% instances), discourse (1; 0% instances), nsubj (1; 0% instances), xcomp (1; 0% instances)
Children of NOUN
nodes belong to 14 different parts of speech: NOUN (119; 27% instances), ADJ (76; 17% instances), PUNCT (67; 15% instances), PRON (32; 7% instances), PROPN (31; 7% instances), VERB (26; 6% instances), CCONJ (21; 5% instances), AUX (19; 4% instances), NUM (17; 4% instances), ADP (11; 3% instances), ADV (6; 1% instances), SCONJ (5; 1% instances), X (4; 1% instances), INTJ (1; 0% instances)