home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Coptic-Scriptorium: POS Tags: PART

There are 37 PART lemmas (1%), 42 PART types (1%) and 2037 PART tokens (4%). Out of 15 observed tags, the rank of PART is: 8 in number of lemmas, 9 in number of types and 9 in number of tokens.

The 10 most frequent PART lemmas: ⲇⲉ, ⲉ, ⲉⲣⲉ, ⲅⲁⲣ, ⲛϭⲓ, ϭⲉ, ⲱ, ⲉⲓⲥ, ⲙⲉⲛ, ϩⲁⲙⲏⲛ

The 10 most frequent PART types: ⲉ, ⲇⲉ, ⲅⲁⲣ, ⲛϭⲓ, ⲛⲧ, ϭⲉ, ⲱ, ⲉⲓⲥ, ⲉⲣⲉ, ⲙⲉⲛ

The 10 most frequent ambiguous lemmas: ⲉ (ADP 1146, PART 393, SCONJ 4, X 1), ⲉⲣⲉ (SCONJ 865, PART 333, AUX 10, ADP 2), ⲅⲁⲣ (PART 207, SCONJ 1), ϭⲉ (PART 53, DET 7, ADV 3), ⲙⲉⲛ (PART 30, CCONJ 2), ⲟⲩⲛ (VERB 33, AUX 15, PART 14), ⲙⲛⲛⲥⲁ (ADP 29, PART 13), ϩⲏⲏⲧⲉ (PART 12, NOUN 2), ⲟⲩⲟⲓ (PART 10, NOUN 9), ⲉⲧⲃⲉ (ADP 178, PART 8)

The 10 most frequent ambiguous types: ⲉ (SCONJ 840, ADP 786, PART 636, PRON 50, AUX 7, NOUN 1, X 1), ⲅⲁⲣ (PART 207, SCONJ 1), ⲛⲧ (SCONJ 104, PART 56, VERB 9), ϭⲉ (PART 53, DET 10, ADV 3), ⲉⲣⲉ (SCONJ 53, PART 34, AUX 10, PRON 6), ⲙⲉⲛ (PART 30, CCONJ 2), ⲟⲩⲛ (VERB 24, PART 14, AUX 11), ⲙⲛⲛⲥⲁ (ADP 15, PART 13), ϩⲏⲏⲧⲉ (PART 12, NOUN 2), ⲉⲧⲃⲉ (ADP 156, PART 8)

Morphology

The form / lemma ratio of PART is 1.135135 (the average of all parts of speech is 1.137647).

The 1st highest number of forms (4) was observed with the lemma “ⲉⲣⲉ”: ⲉ, ⲉⲛⲧ, ⲉⲣⲉ, ⲛⲧ.

The 2nd highest number of forms (3) was observed with the lemma “ⲛ”: ⲙ, ⲙⲙⲟ, ⲛ.

The 3rd highest number of forms (2) was observed with the lemma “ⲉⲛⲉ”: ⲉⲛⲉ, ⲛⲉ.

PART occurs with 2 features: Foreign (915; 45% instances), Polarity (10; 0% instances)

PART occurs with 2 feature-value pairs: Foreign=Yes, Polarity=Neg

PART occurs with 4 feature combinations. The most frequent feature combination is _ (1113 tokens). Examples: ⲉ, ⲛϭⲓ, ⲛⲧ, ϭⲉ, ⲉⲓⲥ, ⲉⲣⲉ, ⲙⲛⲛⲥⲁ, ϩⲏⲏⲧⲉ, ⲉⲧⲃⲉ, ⲟⲩⲟⲓ

Relations

PART nodes are attached to their parents using 12 different relations: advmod (956; 47% instances), mark (765; 38% instances), case (207; 10% instances), discourse (67; 3% instances), fixed (14; 1% instances), ccomp (10; 0% instances), root (8; 0% instances), advcl (6; 0% instances), cc (1; 0% instances), conj (1; 0% instances), orphan (1; 0% instances), parataxis (1; 0% instances)

Parents of PART nodes belong to 10 different parts of speech: VERB (1558; 76% instances), NOUN (312; 15% instances), DET (47; 2% instances), PROPN (45; 2% instances), PRON (37; 2% instances), PART (14; 1% instances), (8; 0% instances), ADV (7; 0% instances), NUM (7; 0% instances), CCONJ (2; 0% instances)

1986 (97%) PART nodes are leaves.

33 (2%) PART nodes have one child.

5 (0%) PART nodes have two children.

13 (1%) PART nodes have three or more children.

The highest child degree of a PART node is 4.

Children of PART nodes are attached using 13 different relations: fixed (23; 27% instances), mark (14; 16% instances), vocative (14; 16% instances), punct (12; 14% instances), obl (11; 13% instances), parataxis (3; 3% instances), advcl (2; 2% instances), ccomp (2; 2% instances), advmod (1; 1% instances), cc (1; 1% instances), dislocated (1; 1% instances), nmod (1; 1% instances), nsubj (1; 1% instances)

Children of PART nodes belong to 9 different parts of speech: NOUN (17; 20% instances), PART (14; 16% instances), SCONJ (14; 16% instances), PUNCT (12; 14% instances), PRON (11; 13% instances), CCONJ (7; 8% instances), VERB (5; 6% instances), DET (4; 5% instances), PROPN (2; 2% instances)