home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-GUM: POS Tags: PART

There are 3 PART lemmas (0%), 15 PART types (0%) and 5113 PART tokens (2%). Out of 17 observed tags, the rank of PART is: 17 in number of lemmas, 17 in number of types and 12 in number of tokens.

The 10 most frequent PART lemmas: to, not, ‘s

The 10 most frequent PART types: to, not, n’t, ‘s, ’s, n’t, na, ‘, ’, ta

The 10 most frequent ambiguous lemmas: to (PART 2914, ADP 1696, SCONJ 36, X 1)

The 10 most frequent ambiguous types: to (PART 2722, ADP 1671, SCONJ 36, DET 1, VERB 1, X 1), ’s (AUX 725, PART 444, VERB 81, PRON 43), ’s (PART 196, AUX 169, PRON 13, VERB 9), na (PART 125, INTJ 4, X 1), (PUNCT 160, PART 46, NOUN 2), (PART 28, PUNCT 24), ta (PART 10, ADP 1), s (PART 3, AUX 1, NOUN 1, VERB 1, X 1), a (DET 3654, ADP 2, PART 2, SCONJ 2, AUX 1, NOUN 1, PROPN 1), do (AUX 388, VERB 251, NOUN 3, PROPN 3, PART 1)

Morphology

The form / lemma ratio of PART is 5.000000 (the average of all parts of speech is 1.237215).

The 1st highest number of forms (6) was observed with the lemma “to”: a, do, na, ta, the, to.

The 2nd highest number of forms (5) was observed with the lemma “’s”: ’, ‘s, s, ’, ’s.

The 3rd highest number of forms (4) was observed with the lemma “not”: n’t, n`t, not, n’t.

PART occurs with 3 features: Polarity (1480; 29% instances), Style (10; 0% instances), Typo (7; 0% instances)

PART occurs with 3 feature-value pairs: Polarity=Neg, Style=Coll, Typo=Yes

PART occurs with 5 feature combinations. The most frequent feature combination is _ (3617 tokens). Examples: to, ‘s, ’s, na, ‘, ’, a

Relations

PART nodes are attached to their parents using 12 different relations: mark (2869; 56% instances), advmod (1451; 28% instances), case (719; 14% instances), xcomp (28; 1% instances), conj (12; 0% instances), root (10; 0% instances), acl (8; 0% instances), reparandum (6; 0% instances), advcl (4; 0% instances), parataxis (3; 0% instances), orphan (2; 0% instances), nmod (1; 0% instances)

Parents of PART nodes belong to 14 different parts of speech: VERB (3755; 73% instances), NOUN (485; 9% instances), PROPN (438; 9% instances), ADJ (243; 5% instances), AUX (57; 1% instances), ADV (54; 1% instances), PRON (39; 1% instances), NUM (14; 0% instances), (10; 0% instances), DET (6; 0% instances), INTJ (5; 0% instances), ADP (3; 0% instances), PART (2; 0% instances), X (2; 0% instances)

5062 (99%) PART nodes are leaves.

32 (1%) PART nodes have one child.

7 (0%) PART nodes have two children.

12 (0%) PART nodes have three or more children.

The highest child degree of a PART node is 9.

Children of PART nodes are attached using 13 different relations: punct (42; 42% instances), advmod (12; 12% instances), cc (12; 12% instances), cop (9; 9% instances), nsubj (8; 8% instances), mark (5; 5% instances), discourse (4; 4% instances), obl (3; 3% instances), case (1; 1% instances), conj (1; 1% instances), det (1; 1% instances), nsubj:outer (1; 1% instances), parataxis (1; 1% instances)

Children of PART nodes belong to 13 different parts of speech: PUNCT (42; 42% instances), CCONJ (12; 12% instances), ADV (10; 10% instances), AUX (9; 9% instances), PRON (8; 8% instances), SCONJ (5; 5% instances), DET (4; 4% instances), INTJ (4; 4% instances), PART (2; 2% instances), ADJ (1; 1% instances), ADP (1; 1% instances), NOUN (1; 1% instances), VERB (1; 1% instances)