home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Xavante-XDT: POS Tags: X

There are 10 X lemmas (3%), 11 X types (2%) and 63 X tokens (4%). Out of 15 observed tags, the rank of X is: 7 in number of lemmas, 7 in number of types and 9 in number of tokens.

The 10 most frequent X lemmas: di, ‘wa, ni, wamhã, zaʔra, hö, na, u, we, ʔre

The 10 most frequent X types: di, wa’wa, ni, wamhã, zaʔra, ‘wa, hö, na, u, wei

The 10 most frequent ambiguous lemmas: di (X 27, AUX 6, PART 2), ‘wa (X 15, PRON 2), ni (X 12, PRON 2), wamhã (SCONJ 4, X 2), zaʔra (PART 3, X 2), (NOUN 1, X 1), na (ADP 32, NOUN 8, X 1), u (ADP 11, X 1), we (ADV 7, X 1), ʔre (PART 2, ADP 1, X 1)

The 10 most frequent ambiguous types: di (X 27, AUX 6, PART 2), ni (X 12, PRON 2), wamhã (SCONJ 4, X 2), zaʔra (PART 3, X 2), ‘wa (PRON 2, X 1), (NOUN 1, X 1), na (ADP 30, X 1), u (ADP 11, X 1), wei (ADV 5, X 1), ʔre (ADP 1, PART 1, X 1)

Morphology

The form / lemma ratio of X is 1.100000 (the average of all parts of speech is 1.290598).

The 1st highest number of forms (2) was observed with the lemma “’wa”: ‘wa, wa’wa.

The 2nd highest number of forms (1) was observed with the lemma “di”: di.

The 3rd highest number of forms (1) was observed with the lemma “hö”: .

X occurs with 2 features: Number (25; 40% instances), Mood (1; 2% instances)

X occurs with 2 feature-value pairs: Mood=Des, Number=Dual

X occurs with 3 feature combinations. The most frequent feature combination is _ (37 tokens). Examples: di, ni, wamhã, zaʔra, na, u, wei, ʔre

Relations

X nodes are attached to their parents using 3 different relations: dep (61; 97% instances), mark (1; 2% instances), root (1; 2% instances)

Parents of X nodes belong to 6 different parts of speech: VERB (40; 63% instances), NOUN (17; 27% instances), ADV (2; 3% instances), PRON (2; 3% instances), PART (1; 2% instances), (1; 2% instances)

62 (98%) X nodes are leaves.

0 (0%) X nodes have one child.

0 (0%) X nodes have two children.

1 (2%) X nodes have three or more children.

The highest child degree of a X node is 4.

Children of X nodes are attached using 4 different relations: advcl (1; 25% instances), dep (1; 25% instances), obl (1; 25% instances), punct (1; 25% instances)

Children of X nodes belong to 4 different parts of speech: ADP (1; 25% instances), PART (1; 25% instances), PUNCT (1; 25% instances), VERB (1; 25% instances)