home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-FicTree: POS Tags: AUX

There are 2 AUX lemmas (0%), 59 AUX types (0%) and 7534 AUX tokens (5%). Out of 16 observed tags, the rank of AUX is: 16 in number of lemmas, 11 in number of types and 10 in number of tokens.

The 10 most frequent AUX lemmas: být, bývat

The 10 most frequent AUX types: jsem, je, by, byl, byla, bylo, bych, jsme, bude, jsou

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types: je (AUX 863, PRON 228), si (PRON 1351, AUX 4), buď (CCONJ 6, AUX 1)

Morphology

The form / lemma ratio of AUX is 29.500000 (the average of all parts of speech is 1.968999).

The 1st highest number of forms (48) was observed with the lemma “být”: Buďme, Nebuď, bude, budeme, budete, budeš, budou, budu, buď, buďte, by, bych, bychom, byl, byla, byli, bylo, byly, bys, bysme, byste, být, je, jsem, jsi, jsme, jsou, jsouc, jste, nebude, nebudeme, nebudeš, nebudou, nebudu, nebyl, nebyla, nebyli, nebylo, nebyly, nebýt, nejsem, nejseš, nejsi, nejsme, nejsou, nejste, není, si.

The 2nd highest number of forms (11) was observed with the lemma “bývat”: Nebývají, bývají, býval, bývala, bývali, bývalo, bývá, bývám, nebýval, nebývala, nebývalo.

AUX occurs with 11 features: Aspect (7534; 100% instances), VerbForm (7534; 100% instances), Number (6573; 87% instances), Polarity (6258; 83% instances), Tense (6098; 81% instances), Voice (6098; 81% instances), Mood (5957; 79% instances), Person (5150; 68% instances), Gender (1423; 19% instances), Animacy (624; 8% instances), Style (7; 0% instances)

AUX occurs with 25 feature-value pairs: Animacy=Anim, Animacy=Inan, Aspect=Imp, Gender=Fem, Gender=Masc, Gender=Neut, Mood=Cnd, Mood=Imp, Mood=Ind, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Polarity=Pos, Style=Coll, Tense=Fut, Tense=Past, Tense=Pres, VerbForm=Conv, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Act

AUX occurs with 54 feature combinations. The most frequent feature combination is Aspect=Imp|Mood=Ind|Number=Sing|Person=1|Polarity=Pos|Tense=Pres|VerbForm=Fin|Voice=Act (2338 tokens). Examples: jsem, bývám

Relations

AUX nodes are attached to their parents using 14 different relations: aux (4216; 56% instances), cop (2955; 39% instances), aux:pass (138; 2% instances), root (86; 1% instances), conj (36; 0% instances), advcl (35; 0% instances), ccomp (28; 0% instances), xcomp (17; 0% instances), acl (10; 0% instances), dep (6; 0% instances), acl:relcl (4; 0% instances), csubj (1; 0% instances), orphan (1; 0% instances), parataxis (1; 0% instances)

Parents of AUX nodes belong to 13 different parts of speech: VERB (4068; 54% instances), ADJ (1452; 19% instances), NOUN (1119; 15% instances), ADV (340; 5% instances), PRON (215; 3% instances), DET (127; 2% instances), (86; 1% instances), PART (39; 1% instances), PROPN (39; 1% instances), NUM (29; 0% instances), AUX (18; 0% instances), SCONJ (1; 0% instances), X (1; 0% instances)

7327 (97%) AUX nodes are leaves.

11 (0%) AUX nodes have one child.

56 (1%) AUX nodes have two children.

140 (2%) AUX nodes have three or more children.

The highest child degree of a AUX node is 9.

Children of AUX nodes are attached using 18 different relations: punct (248; 38% instances), nsubj (132; 20% instances), mark (59; 9% instances), conj (39; 6% instances), cc (35; 5% instances), advcl (25; 4% instances), advmod (22; 3% instances), ccomp (21; 3% instances), dep (16; 2% instances), csubj (14; 2% instances), aux (12; 2% instances), xcomp (8; 1% instances), obl (7; 1% instances), discourse (5; 1% instances), appos (3; 0% instances), parataxis (3; 0% instances), obj (1; 0% instances), obl:arg (1; 0% instances)

Children of AUX nodes belong to 14 different parts of speech: PUNCT (248; 38% instances), NOUN (109; 17% instances), VERB (81; 12% instances), SCONJ (60; 9% instances), CCONJ (32; 5% instances), PRON (30; 5% instances), ADV (27; 4% instances), DET (26; 4% instances), AUX (18; 3% instances), ADJ (7; 1% instances), PART (7; 1% instances), NUM (4; 1% instances), INTJ (1; 0% instances), PROPN (1; 0% instances)