home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-GUM: POS Tags: AUX

There are 14 AUX lemmas (0%), 51 AUX types (0%) and 11355 AUX tokens (5%). Out of 17 observed tags, the rank of AUX is: 16 in number of lemmas, 11 in number of types and 9 in number of tokens.

The 10 most frequent AUX lemmas: be, have, do, can, will, would, should, could, may, might

The 10 most frequent AUX types: is, was, are, be, ‘s, can, do, will, have, would

The 10 most frequent ambiguous lemmas: be (AUX 7080, VERB 409), have (AUX 1082, VERB 869), do (AUX 828, VERB 468, NOUN 4, PROPN 3), can (AUX 644, NOUN 11), will (AUX 562, NOUN 18, VERB 2), get (VERB 448, AUX 27)

The 10 most frequent ambiguous types: is (AUX 1832, VERB 85, X 1), was (AUX 1189, VERB 53), are (AUX 867, VERB 103), be (AUX 805, VERB 31), ’s (AUX 725, PART 444, VERB 81, PRON 43), can (AUX 556, NOUN 3), do (AUX 388, VERB 251, NOUN 3, PROPN 3, PART 1), will (AUX 403, NOUN 18, VERB 2), have (VERB 513, AUX 385), were (AUX 351, VERB 17)

Morphology

The form / lemma ratio of AUX is 3.642857 (the average of all parts of speech is 1.237215).

The 1st highest number of forms (18) was observed with the lemma “be”: ‘m, ‘re, ‘s, R, S’, am, are, be, been, being, is, s, was, were, where, ’m, ’re, ’s.

The 2nd highest number of forms (11) was observed with the lemma “have”: ‘d, ‘s, ‘ve, a, had, has, have, having, ’d, ’s, ’ve.

The 3rd highest number of forms (6) was observed with the lemma “do”: ‘d, did, do, does, doing, done.

AUX occurs with 6 features: VerbForm (11354; 100% instances), Tense (8019; 71% instances), Mood (7626; 67% instances), Person (7626; 67% instances), Number (7603; 67% instances), Typo (26; 0% instances)

AUX occurs with 15 feature-value pairs: Mood=Imp, Mood=Ind, Mood=Sub, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Tense=Past, Tense=Pres, Typo=Yes, VerbForm=Fin, VerbForm=Ger, VerbForm=Inf, VerbForm=Part

AUX occurs with 29 feature combinations. The most frequent feature combination is Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin (3174 tokens). Examples: is, ‘s, has, ’s, does, S’, gets

Relations

AUX nodes are attached to their parents using 20 different relations: aux (5045; 44% instances), cop (4263; 38% instances), aux:pass (1583; 14% instances), root (109; 1% instances), reparandum (96; 1% instances), advcl (72; 1% instances), acl:relcl (59; 1% instances), advcl:relcl (30; 0% instances), conj (29; 0% instances), parataxis (22; 0% instances), ccomp (20; 0% instances), fixed (10; 0% instances), acl (5; 0% instances), xcomp (4; 0% instances), compound (2; 0% instances), orphan (2; 0% instances), appos (1; 0% instances), csubj (1; 0% instances), dep (1; 0% instances), discourse (1; 0% instances)

Parents of AUX nodes belong to 15 different parts of speech: VERB (6507; 57% instances), ADJ (1953; 17% instances), NOUN (1848; 16% instances), ADV (272; 2% instances), PRON (244; 2% instances), PROPN (190; 2% instances), (109; 1% instances), NUM (96; 1% instances), AUX (65; 1% instances), DET (24; 0% instances), ADP (16; 0% instances), INTJ (12; 0% instances), PART (9; 0% instances), X (6; 0% instances), SYM (4; 0% instances)

10824 (95%) AUX nodes are leaves.

183 (2%) AUX nodes have one child.

136 (1%) AUX nodes have two children.

212 (2%) AUX nodes have three or more children.

The highest child degree of a AUX node is 11.

Children of AUX nodes are attached using 26 different relations: nsubj (397; 32% instances), punct (333; 27% instances), advmod (135; 11% instances), mark (82; 7% instances), discourse (48; 4% instances), aux (45; 4% instances), obl (44; 4% instances), cc (42; 3% instances), conj (26; 2% instances), reparandum (21; 2% instances), advcl (17; 1% instances), parataxis (14; 1% instances), obl:unmarked (7; 1% instances), obj (6; 0% instances), xcomp (6; 0% instances), csubj (4; 0% instances), nsubj:pass (3; 0% instances), orphan (3; 0% instances), ccomp (2; 0% instances), compound (1; 0% instances), compound:prt (1; 0% instances), dep (1; 0% instances), dislocated (1; 0% instances), expl (1; 0% instances), nsubj:outer (1; 0% instances), vocative (1; 0% instances)

Children of AUX nodes belong to 16 different parts of speech: PRON (339; 27% instances), PUNCT (333; 27% instances), NOUN (101; 8% instances), ADV (90; 7% instances), SCONJ (76; 6% instances), AUX (65; 5% instances), PART (57; 5% instances), INTJ (51; 4% instances), CCONJ (42; 3% instances), VERB (37; 3% instances), PROPN (29; 2% instances), ADJ (7; 1% instances), ADP (5; 0% instances), DET (5; 0% instances), NUM (4; 0% instances), SYM (1; 0% instances)