Treebank Statistics: UD_Serbian-SET: POS Tags: AUX
There are 2 AUX
lemmas (0%), 36 AUX
types (0%) and 6203 AUX
tokens (6%).
Out of 17 observed tags, the rank of AUX
is: 16 in number of lemmas, 11 in number of types and 7 in number of tokens.
The 10 most frequent AUX
lemmas: biti, hteti
The 10 most frequent AUX
types: je, su, će, bi, nije, biti, bio, bude, smo, bilo
The 10 most frequent ambiguous lemmas: biti (AUX 5637, VERB 8), hteti (AUX 566, VERB 5)
The 10 most frequent ambiguous types: je (AUX 3191, PRON 12), bilo (AUX 68, PART 34, CCONJ 2, VERB 1), sam (AUX 45, ADJ 9), biće (AUX 41, VERB 5), jeste (AUX 14, VERB 1)
- je
- bilo
- AUX 68: Osam od ovih četrdeset bilo je na univerzitetima Evrope i Amerike .
- PART 34: U Bulgartabaku su odbili bilo kakav komentar .
- CCONJ 2: Tu je i etnička komponenta , s obzirom da pojedinci iz različitih zajednica mogu da kažu da je diskriminacija – bilo sada ili ranije – uticala na njihovu mogućnost ostvarivanja koristi od privatizacije .
- VERB 1: Na Divljem zapadu nekad bilo .
- sam
- biće
- jeste
Morphology
The form / lemma ratio of AUX
is 18.000000 (the average of all parts of speech is 1.845258).
The 1st highest number of forms (29) was observed with the lemma “biti”: Bićemo, bi, bih, bila, bile, bili, bilo, bio, bismo, biti, bivaju, biće, bude, budemo, budi, budu, je, jesam, jest, jeste, jesu, nije, nisam, nismo, nisu, sam, smo, ste, su.
The 2nd highest number of forms (7) was observed with the lemma “hteti”: neće, nećemo, neću, će, ćemo, ćete, ću.
AUX
occurs with 8 features: VerbForm (6203; 100% instances), Number (6054; 98% instances), Tense (6054; 98% instances), Mood (5750; 93% instances), Person (5750; 93% instances), Gender (304; 5% instances), Voice (304; 5% instances), Polarity (286; 5% instances)
AUX
occurs with 18 feature-value pairs: Gender=Fem
, Gender=Masc
, Gender=Neut
, Mood=Cnd
, Mood=Ind
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, Polarity=Neg
, Tense=Fut
, Tense=Past
, Tense=Pres
, VerbForm=Fin
, VerbForm=Inf
, VerbForm=Part
, Voice=Act
AUX
occurs with 23 feature combinations.
The most frequent feature combination is Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin
(3734 tokens).
Examples: je, će, bude, Nije, jeste, Neće, biće, jest, ću
Relations
AUX
nodes are attached to their parents using 12 different relations: aux (4479; 72% instances), cop (1556; 25% instances), root (60; 1% instances), acl (31; 0% instances), xcomp (25; 0% instances), advcl (15; 0% instances), conj (15; 0% instances), ccomp (14; 0% instances), obj (3; 0% instances), parataxis (3; 0% instances), csubj (1; 0% instances), fixed (1; 0% instances)
Parents of AUX
nodes belong to 12 different parts of speech: VERB (3490; 56% instances), ADJ (1491; 24% instances), NOUN (933; 15% instances), ADV (101; 2% instances), (60; 1% instances), AUX (50; 1% instances), DET (35; 1% instances), PROPN (18; 0% instances), PRON (17; 0% instances), NUM (4; 0% instances), PART (2; 0% instances), SCONJ (2; 0% instances)
6038 (97%) AUX
nodes are leaves.
20 (0%) AUX
nodes have one child.
28 (0%) AUX
nodes have two children.
117 (2%) AUX
nodes have three or more children.
The highest child degree of a AUX
node is 8.
Children of AUX
nodes are attached using 16 different relations: punct (156; 26% instances), nsubj (102; 17% instances), obl (61; 10% instances), mark (55; 9% instances), aux (46; 8% instances), advmod (30; 5% instances), ccomp (28; 5% instances), conj (27; 5% instances), parataxis (26; 4% instances), xcomp (23; 4% instances), discourse (12; 2% instances), advcl (11; 2% instances), obj (11; 2% instances), cc (9; 2% instances), amod (1; 0% instances), case (1; 0% instances)
Children of AUX
nodes belong to 14 different parts of speech: PUNCT (156; 26% instances), NOUN (135; 23% instances), VERB (81; 14% instances), SCONJ (52; 9% instances), AUX (50; 8% instances), ADV (30; 5% instances), ADJ (21; 4% instances), PROPN (20; 3% instances), CCONJ (16; 3% instances), DET (16; 3% instances), PRON (11; 2% instances), PART (8; 1% instances), ADP (2; 0% instances), NUM (1; 0% instances)