home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Latvian-Cairo: POS Tags: NOUN

There are 24 NOUN lemmas (22%), 25 NOUN types (21%) and 26 NOUN tokens (15%). Out of 13 observed tags, the rank of NOUN is: 2 in number of lemmas, 2 in number of types and 3 in number of tokens.

The 10 most frequent NOUN lemmas: mašīna, vēstule, bronza, brālis, draudzene, dzeršana, galvaspilsēta, iemesls, istaba, jausma

The 10 most frequent NOUN types: mašīnu, Meitene, bronzu, brālis, draudzenei, dzeršanu, galvaspilsētā, iemesla, istabu, jausmas

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of NOUN is 1.041667 (the average of all parts of speech is 1.093458).

The 1st highest number of forms (2) was observed with the lemma “vēstule”: vēstule, vēstuli.

The 2nd highest number of forms (1) was observed with the lemma “bronza”: bronzu.

The 3rd highest number of forms (1) was observed with the lemma “brālis”: brālis.

NOUN occurs with 3 features: Case (26; 100% instances), Gender (26; 100% instances), Number (26; 100% instances)

NOUN occurs with 10 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Number=Coll, Number=Plur, Number=Sing

NOUN occurs with 12 feature combinations. The most frequent feature combination is Case=Acc|Gender=Fem|Number=Sing (7 tokens). Examples: mašīnu, bronzu, dzeršanu, istabu, smēķēšanu, vēstuli

Relations

NOUN nodes are attached to their parents using 8 different relations: obj (9; 35% instances), nsubj (6; 23% instances), orphan (3; 12% instances), conj (2; 8% instances), iobj (2; 8% instances), obl (2; 8% instances), appos (1; 4% instances), root (1; 4% instances)

Parents of NOUN nodes belong to 5 different parts of speech: VERB (18; 69% instances), PROPN (4; 15% instances), NOUN (2; 8% instances), ADJ (1; 4% instances), (1; 4% instances)

12 (46%) NOUN nodes are leaves.

9 (35%) NOUN nodes have one child.

3 (12%) NOUN nodes have two children.

2 (8%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 5.

Children of NOUN nodes are attached using 14 different relations: det (5; 21% instances), nmod (3; 13% instances), punct (3; 13% instances), amod (2; 8% instances), cc (2; 8% instances), acl (1; 4% instances), advmod:emph (1; 4% instances), advmod:neg (1; 4% instances), case (1; 4% instances), conj (1; 4% instances), cop (1; 4% instances), discourse (1; 4% instances), nsubj (1; 4% instances), orphan (1; 4% instances)

Children of NOUN nodes belong to 11 different parts of speech: DET (5; 21% instances), PART (3; 13% instances), PROPN (3; 13% instances), PUNCT (3; 13% instances), ADJ (2; 8% instances), CCONJ (2; 8% instances), NOUN (2; 8% instances), ADP (1; 4% instances), AUX (1; 4% instances), PRON (1; 4% instances), VERB (1; 4% instances)