home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Welsh-CCG: POS Tags: AUX

There are 7 AUX lemmas (0%), 62 AUX types (1%) and 3046 AUX tokens (6%). Out of 15 observed tags, the rank of AUX is: 13 in number of lemmas, 9 in number of types and 7 in number of tokens.

The 10 most frequent AUX lemmas: yn, bod, wedi, ar, am, newydd, heb

The 10 most frequent AUX types: yn, ‘n, wedi, mae, yw, oedd, bod, fod, oes, ar

The 10 most frequent ambiguous lemmas: yn (AUX 1367, PART 1106, ADP 1046), bod (VERB 1782, AUX 1087, NOUN 308), wedi (AUX 482, ADP 6, SCONJ 5), ar (ADP 747, AUX 39), am (ADP 331, AUX 27), newydd (ADJ 82, AUX 26), heb (ADP 33, AUX 18)

The 10 most frequent ambiguous types: yn (AUX 882, PART 743, ADP 732), ‘n (AUX 478, PART 322, PRON 31, ADP 11), wedi (AUX 465, SCONJ 5, ADP 2), mae (VERB 257, AUX 91, NOUN 2), yw (AUX 176, VERB 39), oedd (AUX 147, VERB 134), bod (NOUN 195, AUX 91), fod (NOUN 96, AUX 76), oes (AUX 42, VERB 21, NOUN 7), ar (ADP 708, AUX 39)

Morphology

The form / lemma ratio of AUX is 8.857143 (the average of all parts of speech is 1.452021).

The 1st highest number of forms (53) was observed with the lemma “bod”: Buodd, Byddaf, Byddan, Byddech, Byddi, Bûm, Maen, Oeddet, Ydach, baech, baent, baswn, bawn, bod, bu, buoch, bydd, bydda, byddai, byddant, byddwch, bysa, dw, dwi, fo, fod, fu, fydd, fyddai, ma’, mae, mod, oedd, oeddech, oeddem, oeddwn, oes, s’, sy, sydd, wy, wyf, wyt, ydi, ydoedd, ydw, ydy, ydych, ydym, ydyn, ydynt, ydyw, yw.

The 2nd highest number of forms (3) was observed with the lemma “wedi”: ‘di, di, wedi.

The 3rd highest number of forms (2) was observed with the lemma “yn”: ‘n, yn.

AUX occurs with 6 features: Number (1087; 36% instances), VerbForm (1087; 36% instances), Person (917; 30% instances), Tense (916; 30% instances), Mood (909; 30% instances), Mutation (125; 4% instances)

AUX occurs with 18 feature-value pairs: Mood=Cnd, Mood=Ind, Mood=Sub, Mutation=NM, Mutation=SM, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Tense=Fut, Tense=Imp, Tense=Past, Tense=Pqp, Tense=Pres, VerbForm=Fin, VerbForm=FinRel, VerbForm=Vnoun

AUX occurs with 36 feature combinations. The most frequent feature combination is _ (1959 tokens). Examples: yn, ‘n, wedi, ar, am, newydd, heb, ‘di, di

Relations

AUX nodes are attached to their parents using 10 different relations: aux (1966; 65% instances), cop (996; 33% instances), root (42; 1% instances), conj (18; 1% instances), acl:relcl (10; 0% instances), ccomp (6; 0% instances), advcl (4; 0% instances), parataxis (2; 0% instances), acl (1; 0% instances), appos (1; 0% instances)

Parents of AUX nodes belong to 10 different parts of speech: NOUN (2507; 82% instances), ADJ (441; 14% instances), (42; 1% instances), NUM (21; 1% instances), VERB (17; 1% instances), PROPN (7; 0% instances), PRON (6; 0% instances), ADV (3; 0% instances), AUX (1; 0% instances), SYM (1; 0% instances)

2956 (97%) AUX nodes are leaves.

11 (0%) AUX nodes have one child.

17 (1%) AUX nodes have two children.

62 (2%) AUX nodes have three or more children.

The highest child degree of a AUX node is 8.

Children of AUX nodes are attached using 14 different relations: punct (75; 25% instances), nsubj (59; 20% instances), advmod (54; 18% instances), xcomp (48; 16% instances), cc (16; 5% instances), obl (16; 5% instances), mark (8; 3% instances), advcl (6; 2% instances), fixed (6; 2% instances), conj (3; 1% instances), appos (1; 0% instances), case (1; 0% instances), csubj (1; 0% instances), det (1; 0% instances)

Children of AUX nodes belong to 14 different parts of speech: NOUN (104; 35% instances), PUNCT (75; 25% instances), PART (32; 11% instances), PRON (22; 7% instances), ADV (19; 6% instances), CCONJ (16; 5% instances), PROPN (9; 3% instances), SCONJ (7; 2% instances), ADJ (3; 1% instances), VERB (3; 1% instances), ADP (2; 1% instances), AUX (1; 0% instances), DET (1; 0% instances), NUM (1; 0% instances)