home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Latvian-LVTB: POS Tags: PART

There are 79 PART lemmas (0%), 85 PART types (0%) and 6747 PART tokens (2%). Out of 17 observed tags, the rank of PART is: 9 in number of lemmas, 10 in number of types and 13 in number of tokens.

The 10 most frequent PART lemmas: arī, tikai, pat, vai, ne, gan, kaut, vien, jau, tieši

The 10 most frequent PART types: arī, tikai, pat, vai, ne, gan, kaut, vien, jau, tieši

The 10 most frequent ambiguous lemmas: arī (PART 1840, CCONJ 282), vai (CCONJ 598, SCONJ 500, PART 301, INTJ 2), ne (PART 288, CCONJ 280), gan (CCONJ 487, PART 282, SCONJ 1), kaut (PART 275, SCONJ 40), jau (ADV 706, PART 203), tieši (PART 196, ADV 23), nu (PART 179, ADV 103, INTJ 5), tāpat (PART 98, ADV 80), (SCONJ 958, ADV 603, CCONJ 247, PART 94)

The 10 most frequent ambiguous types: arī (PART 1685, CCONJ 280), pat (PART 343, X 2, ADP 1), vai (CCONJ 566, SCONJ 496, PART 119, INTJ 1), ne (CCONJ 272, PART 262, PRON 1), gan (CCONJ 469, PART 278, SCONJ 1), kaut (PART 256, SCONJ 23), jau (ADV 636, PART 203), tieši (PART 168, ADV 16, ADJ 1), nu (PART 111, ADV 80, INTJ 1), tāpat (ADV 49, PART 33)

Morphology

The form / lemma ratio of PART is 1.075949 (the average of all parts of speech is 2.339090).

The 1st highest number of forms (4) was observed with the lemma “arī”: ar, ar’, ari, arī.

The 2nd highest number of forms (2) was observed with the lemma “ar”: ar, ar’.

The 3rd highest number of forms (2) was observed with the lemma “nu”: no, nu.

PART occurs with 2 features: Polarity (442; 7% instances), Typo (14; 0% instances)

PART occurs with 3 feature-value pairs: Polarity=Neg, Polarity=Pos, Typo=Yes

PART occurs with 5 feature combinations. The most frequent feature combination is _ (6292 tokens). Examples: arī, tikai, pat, vai, gan, kaut, vien, jau, tieši, nu

Relations

PART nodes are attached to their parents using 19 different relations: discourse (3363; 50% instances), advmod:emph (2348; 35% instances), fixed (678; 10% instances), advmod:neg (167; 2% instances), mark (43; 1% instances), root (43; 1% instances), dep (35; 1% instances), conj (20; 0% instances), ccomp (17; 0% instances), cc (14; 0% instances), flat (6; 0% instances), flat:name (3; 0% instances), det (2; 0% instances), iobj (2; 0% instances), parataxis (2; 0% instances), advcl (1; 0% instances), advmod (1; 0% instances), obj (1; 0% instances), orphan (1; 0% instances)

Parents of PART nodes belong to 17 different parts of speech: VERB (2315; 34% instances), NOUN (1859; 28% instances), ADV (684; 10% instances), CCONJ (300; 4% instances), DET (292; 4% instances), PRON (284; 4% instances), ADJ (270; 4% instances), PART (233; 3% instances), SCONJ (175; 3% instances), PROPN (169; 3% instances), NUM (98; 1% instances), (43; 1% instances), X (11; 0% instances), SYM (7; 0% instances), AUX (3; 0% instances), INTJ (3; 0% instances), ADP (1; 0% instances)

6048 (90%) PART nodes are leaves.

522 (8%) PART nodes have one child.

142 (2%) PART nodes have two children.

35 (1%) PART nodes have three or more children.

The highest child degree of a PART node is 7.

Children of PART nodes are attached using 20 different relations: punct (584; 63% instances), fixed (251; 27% instances), cc (27; 3% instances), discourse (24; 3% instances), conj (8; 1% instances), advcl (5; 1% instances), flat (5; 1% instances), advmod (4; 0% instances), dep (4; 0% instances), mark (3; 0% instances), orphan (2; 0% instances), parataxis (2; 0% instances), acl (1; 0% instances), cop (1; 0% instances), flat:name (1; 0% instances), goeswith (1; 0% instances), iobj (1; 0% instances), nsubj (1; 0% instances), obj (1; 0% instances), obl (1; 0% instances)

Children of PART nodes belong to 15 different parts of speech: PUNCT (584; 63% instances), PART (233; 25% instances), SCONJ (44; 5% instances), CCONJ (27; 3% instances), VERB (8; 1% instances), ADV (7; 1% instances), NOUN (6; 1% instances), PRON (6; 1% instances), INTJ (5; 1% instances), DET (2; 0% instances), ADJ (1; 0% instances), AUX (1; 0% instances), PROPN (1; 0% instances), SYM (1; 0% instances), X (1; 0% instances)