home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-GENTLE: POS Tags: X

There are 111 X lemmas (3%), 115 X types (3%) and 300 X tokens (2%). Out of 17 observed tags, the rank of X is: 7 in number of lemmas, 7 in number of types and 14 in number of tokens.

The 10 most frequent X lemmas: 1., 2., 3., 4., 5., 6., 7., 8., 9., 10.

The 10 most frequent X types: 1., 2., 3., 4., 5., 6., 7., 8., 9., 10.

The 10 most frequent ambiguous lemmas: schole (X 2, NOUN 1), school (NOUN 34, VERB 6, X 1), next (ADJ 38, ADV 6, X 1)

The 10 most frequent ambiguous types: schole (X 2, NOUN 1), school (NOUN 29, VERB 1, X 1), ever (ADV 9, X 1), next (ADJ 36, ADV 6, X 1), one (NUM 25, NOUN 6, PRON 3, X 1), right (ADJ 14, ADV 5, NOUN 5, INTJ 1, X 1)

Morphology

The form / lemma ratio of X is 1.036036 (the average of all parts of speech is 1.147634).

The 1st highest number of forms (5) was observed with the lemma “_”: ever, one, right, self, ware.

The 2nd highest number of forms (1) was observed with the lemma “(1)”: (1).

The 3rd highest number of forms (1) was observed with the lemma “(2)”: (2).

X occurs with 1 features: Abbr (1; 0% instances)

X occurs with 1 feature-value pairs: Abbr=Yes

X occurs with 2 feature combinations. The most frequent feature combination is _ (299 tokens). Examples: 1., 2., 3., 4., 5., 6., 7., 8., 9., 10.

Relations

X nodes are attached to their parents using 11 different relations: discourse (231; 77% instances), conj (27; 9% instances), nmod (16; 5% instances), root (8; 3% instances), goeswith (5; 2% instances), obl (5; 2% instances), appos (4; 1% instances), obj (1; 0% instances), obl:agent (1; 0% instances), orphan (1; 0% instances), parataxis (1; 0% instances)

Parents of X nodes belong to 10 different parts of speech: NOUN (134; 45% instances), VERB (55; 18% instances), PROPN (45; 15% instances), X (40; 13% instances), ADJ (9; 3% instances), (8; 3% instances), PRON (4; 1% instances), ADV (3; 1% instances), DET (1; 0% instances), NUM (1; 0% instances)

235 (78%) X nodes are leaves.

15 (5%) X nodes have one child.

4 (1%) X nodes have two children.

46 (15%) X nodes have three or more children.

The highest child degree of a X node is 9.

Children of X nodes are attached using 10 different relations: punct (56; 26% instances), appos (38; 17% instances), compound (30; 14% instances), conj (28; 13% instances), case (25; 11% instances), parataxis (14; 6% instances), amod (13; 6% instances), nmod (13; 6% instances), acl (1; 0% instances), det (1; 0% instances)

Children of X nodes belong to 8 different parts of speech: PUNCT (56; 26% instances), X (40; 18% instances), NOUN (34; 16% instances), PROPN (33; 15% instances), ADJ (28; 13% instances), ADP (25; 11% instances), VERB (2; 1% instances), DET (1; 0% instances)