Treebank Statistics: UD_Portuguese-DANTEStocks: POS Tags: AUX
There are 8 AUX
lemmas (0%), 89 AUX
types (1%) and 1332 AUX
tokens (2%).
Out of 16 observed tags, the rank of AUX
is: 16 in number of lemmas, 9 in number of types and 13 in number of tokens.
The 10 most frequent AUX
lemmas: ser, estar, ir, ter, poder, dever, haver, vir
The 10 most frequent AUX
types: é, vai, está, tá, foi, foram, será, ser, estão, ta
The 10 most frequent ambiguous lemmas: ser (AUX 683, VERB 24, NOUN 1), estar (AUX 386, VERB 3), ir (AUX 202, VERB 93), ter (VERB 211, AUX 45), poder (VERB 74, AUX 7, NOUN 3), dever (VERB 42, AUX 3), haver (VERB 26, AUX 3), vir (VERB 54, AUX 3)
The 10 most frequent ambiguous types: é (AUX 301, VERB 5), vai (AUX 97, VERB 36), está (AUX 92, VERB 1), tá (AUX 62, INTJ 1), foi (AUX 68, VERB 13), foram (AUX 48, VERB 2), ser (AUX 42, VERB 3, NOUN 1), vamos (AUX 21, VERB 2), vou (AUX 27, VERB 2), era (AUX 19, NOUN 1)
- é
- vai
- está
- tá
- foi
- foram
- ser
- AUX 42: #vale5 se acreditar que corrige 27,45 pode ser um bom pto …
- VERB 3: @CaciqueInvest Pode ser mas bateu em o Obj eu arrisco … petr4 a 12,50 foi o mesmo , quem segue sabe …
- NOUN 1: Jean Willas tá esperneando em o TT por causa de a debandada de o PMDB em a votação de a Petr4 … Espera q vem mais , ser nojento ! !
- vamos
- vou
- era
Morphology
The form / lemma ratio of AUX
is 11.125000 (the average of all parts of speech is 1.238049).
The 1st highest number of forms (31) was observed with the lemma “ser”: =, e, eh, era, eram, f, fo, foi, for, foram, fosse, fossem, fui, sao, se, seja, sejam, sendo, ser, sera, serem, seria, seriam, sermos, será, serão, sido, sou, são, sãos, é.
The 2nd highest number of forms (26) was observed with the lemma “estar”: eat, esta, estamos, estao, estar, estarei, estaremos, estaria, estava, estavam, esteja, esteve, estiver, estivesse, estou, está, estão, restou, ta, tais, tao, tava, to, tou, tá, tô.
The 3rd highest number of forms (11) was observed with the lemma “ir”: foram, fui, ia, iria, irá, irão, vai, vamos, vou, vá, vão.
AUX
occurs with 8 features: VerbForm (1332; 100% instances), Number (1248; 94% instances), Person (1243; 93% instances), Mood (1239; 93% instances), Tense (1207; 91% instances), Abbr (149; 11% instances), Typo (94; 7% instances), Gender (5; 0% instances)
AUX
occurs with 20 feature-value pairs: Abbr=Yes
, Gender=Masc
, Mood=Cnd
, Mood=Imp
, Mood=Ind
, Mood=Sub
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, Tense=Fut
, Tense=Imp
, Tense=Past
, Tense=Pres
, Typo=Yes
, VerbForm=Fin
, VerbForm=Ger
, VerbForm=Inf
, VerbForm=Part
AUX
occurs with 45 feature combinations.
The most frequent feature combination is Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin
(569 tokens).
Examples: é, vai, está, =, tem, vamos, deve, vem, vão, foram
Relations
AUX
nodes are attached to their parents using 12 different relations: cop (724; 54% instances), aux (414; 31% instances), aux:pass (101; 8% instances), discourse (52; 4% instances), parataxis (10; 1% instances), fixed (8; 1% instances), root (7; 1% instances), conj (5; 0% instances), xcomp (5; 0% instances), ccomp (3; 0% instances), acl (2; 0% instances), acl:relcl (1; 0% instances)
Parents of AUX
nodes belong to 11 different parts of speech: VERB (626; 47% instances), NOUN (291; 22% instances), ADJ (197; 15% instances), ADV (59; 4% instances), PROPN (49; 4% instances), PRON (38; 3% instances), NUM (36; 3% instances), SYM (20; 2% instances), (7; 1% instances), CCONJ (6; 0% instances), X (3; 0% instances)
1293 (97%) AUX
nodes are leaves.
14 (1%) AUX
nodes have one child.
11 (1%) AUX
nodes have two children.
14 (1%) AUX
nodes have three or more children.
The highest child degree of a AUX
node is 8.
Children of AUX
nodes are attached using 14 different relations: punct (29; 31% instances), parataxis (14; 15% instances), nsubj (11; 12% instances), advmod (9; 10% instances), obl (6; 6% instances), conj (4; 4% instances), obj (4; 4% instances), xcomp (4; 4% instances), cc (3; 3% instances), mark (3; 3% instances), vocative (3; 3% instances), discourse (2; 2% instances), advcl (1; 1% instances), dep (1; 1% instances)
Children of AUX
nodes belong to 13 different parts of speech: PUNCT (29; 31% instances), VERB (15; 16% instances), NOUN (13; 14% instances), PROPN (9; 10% instances), ADV (8; 9% instances), PRON (5; 5% instances), SYM (4; 4% instances), CCONJ (3; 3% instances), SCONJ (3; 3% instances), X (2; 2% instances), ADJ (1; 1% instances), ADP (1; 1% instances), NUM (1; 1% instances)