Treebank Statistics: UD_Mbya_Guarani-Dooley: POS Tags: NOUN
There are 27 NOUN
lemmas (22%), 28 NOUN
types (21%) and 1278 NOUN
tokens (11%).
Out of 16 observed tags, the rank of NOUN
is: 2 in number of lemmas, 2 in number of types and 4 in number of tokens.
The 10 most frequent NOUN
lemmas: _, ava, me, yke’y, a’yxy, akã, bicicleta, couve, enda, jeruxi
The 10 most frequent NOUN
types: _, ava, tyke’y, Neme, Vexa’i, Xakã, bicicleta, couve, ime, jeruxi
The 10 most frequent ambiguous lemmas: _ (PART 2380, VERB 2363, PUNCT 1843, NOUN 1248, SCONJ 1129, PRON 1094, ADP 924, ADV 242, NUM 104, PROPN 99, ADJ 41, CCONJ 32, INTJ 15, DET 11, X 8, AUX 3)
The 10 most frequent ambiguous types: _ (PART 2380, VERB 2363, PUNCT 1843, NOUN 1248, SCONJ 1129, PRON 1094, ADP 924, ADV 242, NUM 104, PROPN 99, ADJ 41, CCONJ 32, INTJ 15, DET 11, X 8, AUX 3)
- _
- PART 2380: _ _ _ _ _ _
- VERB 2363: _ _ _ _ _ _
- PUNCT 1843: _ _ _ _ _ _
- NOUN 1248: _ _ _ _ _ _
- SCONJ 1129: _ _ _ _ _ _ _ _ _ _ _ _ he’i _
- PRON 1094: _ _ _ _ _ _ _ _ _ _
- ADP 924: _ _ _ _ _ _ _ _ _ _
- ADV 242: _ _ _ _ _ _
- NUM 104: _ _ _ _ _ _
- PROPN 99: _ _ _ _ _ _ _ _
- ADJ 41: _ _ _ _ _ _ _ _ _ _
- CCONJ 32: _ _ _ _ _ _ _ _ _ _
- INTJ 15: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- DET 11: _ _ _ _ _
- X 8: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- AUX 3: _ Neme _ _ _ _ _ _ _ _ _ _ _
Morphology
The form / lemma ratio of NOUN
is 1.037037 (the average of all parts of speech is 1.056000).
The 1st highest number of forms (2) was observed with the lemma “me”: Neme, ime.
The 2nd highest number of forms (1) was observed with the lemma “_”: _.
The 3rd highest number of forms (1) was observed with the lemma “a’yxy”: ta’yxy.
NOUN
occurs with 3 features: Number[psor] (71; 6% instances), Number (17; 1% instances), Clusivity[psor] (12; 1% instances)
NOUN
occurs with 5 feature-value pairs: Clusivity[psor]=Ex
, Clusivity[psor]=In
, Number=Plur
, Number[psor]=Plur
, Number[psor]=Sing
NOUN
occurs with 6 feature combinations.
The most frequent feature combination is _
(1190 tokens).
Examples: _, ava, tyke’y, Vexa’i, Xakã, bicicleta, couve, ime, jeruxi, ka’aguy
Relations
NOUN
nodes are attached to their parents using 15 different relations: obl (430; 34% instances), nsubj (391; 31% instances), obj (243; 19% instances), nmod (136; 11% instances), conj (20; 2% instances), parataxis:rep (15; 1% instances), compound (14; 1% instances), appos (9; 1% instances), vocative (7; 1% instances), acl (3; 0% instances), dislocated (3; 0% instances), parataxis (3; 0% instances), obl:sentcon (2; 0% instances), ccomp (1; 0% instances), root (1; 0% instances)
Parents of NOUN
nodes belong to 5 different parts of speech: VERB (1089; 85% instances), NOUN (184; 14% instances), PRON (3; 0% instances), ADV (1; 0% instances), (1; 0% instances)
484 (38%) NOUN
nodes are leaves.
494 (39%) NOUN
nodes have one child.
204 (16%) NOUN
nodes have two children.
96 (8%) NOUN
nodes have three or more children.
The highest child degree of a NOUN
node is 8.
Children of NOUN
nodes are attached using 22 different relations: case (394; 32% instances), dep:mod (283; 23% instances), nmod (164; 13% instances), nummod (85; 7% instances), mark (56; 4% instances), acl (55; 4% instances), punct (54; 4% instances), amod (30; 2% instances), det (28; 2% instances), compound (22; 2% instances), conj (18; 1% instances), appos (17; 1% instances), cc (13; 1% instances), parataxis (7; 1% instances), advcl (4; 0% instances), flat (4; 0% instances), nsubj (4; 0% instances), obl (3; 0% instances), obl:sentcon (2; 0% instances), advmod (1; 0% instances), aux (1; 0% instances), compound:svc (1; 0% instances)
Children of NOUN
nodes belong to 15 different parts of speech: ADP (394; 32% instances), PART (283; 23% instances), NOUN (184; 15% instances), NUM (86; 7% instances), VERB (71; 6% instances), SCONJ (56; 4% instances), PUNCT (54; 4% instances), PRON (46; 4% instances), ADJ (30; 2% instances), CCONJ (13; 1% instances), DET (13; 1% instances), PROPN (10; 1% instances), INTJ (4; 0% instances), ADV (1; 0% instances), AUX (1; 0% instances)