home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Pesh-ChibErgIS: POS Tags: PART

There are 46 PART lemmas (9%), 48 PART types (6%) and 317 PART tokens (13%). Out of 15 observed tags, the rank of PART is: 4 in number of lemmas, 4 in number of types and 3 in number of tokens.

The 10 most frequent PART lemmas: =ma, =kan, =na, =pe, =hã, ãma, =ras, =sa, =mã, nĩhã

The 10 most frequent PART types: =ma, =kan, =na, =pe, =hã, ãma, =ras, =sa, =mã, =pra

The 10 most frequent ambiguous lemmas: =ma (PART 57, SCONJ 26, ADP 14), =kan (PART 47, ADP 9, SCONJ 4), =na (PART 23, DET 1), ãma (PART 14, INTJ 1), =ras (PART 13, SCONJ 13), =mã (SCONJ 14, PART 12, ADP 3), =ra (ADP 12, PART 6, SCONJ 3), =ri (ADP 7, PART 4, CCONJ 1), nẽʔ (PART 4, VERB 3), =ken (ADP 7, PART 3, SCONJ 3)

The 10 most frequent ambiguous types: =ma (PART 57, SCONJ 26, ADP 14), =kan (PART 45, ADP 8, SCONJ 4), =na (PART 23, DET 1), ãma (PART 14, INTJ 1), =ras (PART 13, SCONJ 13), =mã (SCONJ 14, PART 12, ADP 3), =ra (ADP 12, PART 6, SCONJ 3), =ri (ADP 7, AUX 7, PART 4, CCONJ 1), nẽʔ (PART 4, VERB 3), =ken (ADP 7, PART 3, SCONJ 3)

Morphology

The form / lemma ratio of PART is 1.043478 (the average of all parts of speech is 1.578544).

The 1st highest number of forms (2) was observed with the lemma “=kan”: =kan, =kanka.

The 2nd highest number of forms (2) was observed with the lemma “nĩhã”: nĩhã, ũtanĩhã.

The 3rd highest number of forms (1) was observed with the lemma “=ha”: =ha.

PART occurs with 2 features: Clusivity (1; 0% instances), PronType (1; 0% instances)

PART occurs with 2 feature-value pairs: Clusivity=Ex, PronType=Int

PART occurs with 3 feature combinations. The most frequent feature combination is _ (315 tokens). Examples: =ma, =kan, =na, =pe, =hã, ãma, =ras, =sa, =mã, =pra

Relations

PART nodes are attached to their parents using 17 different relations: case (116; 37% instances), advmod (77; 24% instances), discourse (62; 20% instances), mark (28; 9% instances), compound (6; 2% instances), cc (5; 2% instances), reparandum (5; 2% instances), dep (3; 1% instances), root (3; 1% instances), ccomp (2; 1% instances), dislocated (2; 1% instances), obj (2; 1% instances), obl:arg (2; 1% instances), acl (1; 0% instances), appos (1; 0% instances), obl (1; 0% instances), parataxis (1; 0% instances)

Parents of PART nodes belong to 9 different parts of speech: VERB (107; 34% instances), PRON (101; 32% instances), NOUN (73; 23% instances), PART (19; 6% instances), X (6; 2% instances), ADV (5; 2% instances), (3; 1% instances), NUM (2; 1% instances), AUX (1; 0% instances)

251 (79%) PART nodes are leaves.

41 (13%) PART nodes have one child.

12 (4%) PART nodes have two children.

13 (4%) PART nodes have three or more children.

The highest child degree of a PART node is 5.

Children of PART nodes are attached using 16 different relations: punct (31; 28% instances), dep (22; 20% instances), reparandum (10; 9% instances), discourse (9; 8% instances), det (8; 7% instances), compound (7; 6% instances), mark (6; 5% instances), advmod (5; 5% instances), cop (2; 2% instances), nsubj (2; 2% instances), obl:mod (2; 2% instances), orphan (2; 2% instances), advcl (1; 1% instances), case (1; 1% instances), ccomp (1; 1% instances), obl (1; 1% instances)

Children of PART nodes belong to 13 different parts of speech: PUNCT (31; 28% instances), PRON (20; 18% instances), PART (19; 17% instances), NOUN (12; 11% instances), ADV (7; 6% instances), X (6; 5% instances), ADP (5; 5% instances), AUX (2; 2% instances), DET (2; 2% instances), INTJ (2; 2% instances), VERB (2; 2% instances), CCONJ (1; 1% instances), NUM (1; 1% instances)