home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hausa-NorthernAutogramm: POS Tags: PART

There are 21 PART lemmas (4%), 21 PART types (2%) and 181 PART tokens (4%). Out of 16 observed tags, the rank of PART is: 8 in number of lemmas, 10 in number of types and 7 in number of tokens.

The 10 most frequent PART lemmas: ta, gàː, naː, ba, baːbù, kòː, dai, àkwai, bâː, na

The 10 most frequent PART types: ta, gàː, nàː, ba, baːbù, kòː, dai, àkwai, bâː, bàː

The 10 most frequent ambiguous lemmas: ta (PART 26, PRON 11), bâː (PART 7, VERB 4), (ADP 60, SCONJ 36, CCONJ 24, PART 3)

The 10 most frequent ambiguous types: ta (PART 30, PRON 4), nàː (PART 22, AUX 1), bâː (PART 7, VERB 4), naː (AUX 14, PART 5), (ADP 60, SCONJ 36, CCONJ 24, PART 3)

Morphology

The form / lemma ratio of PART is 1.000000 (the average of all parts of speech is 1.465753).

The 1st highest number of forms (2) was observed with the lemma “na”: na, ta.

The 2nd highest number of forms (2) was observed with the lemma “naː”: naː, nàː.

The 3rd highest number of forms (1) was observed with the lemma “ba”: ba.

PART occurs with 7 features: Polarity (50; 28% instances), Aspect (26; 14% instances), Definite (7; 4% instances), PartType (7; 4% instances), Gender (4; 2% instances), Number (4; 2% instances), ExtPos (3; 2% instances)

PART occurs with 7 feature-value pairs: Aspect=Iter, Definite=Cons, ExtPos=NOUN, Gender=Fem, Number=Sing, PartType=Int, Polarity=Neg

PART occurs with 9 feature combinations. The most frequent feature combination is _ (92 tokens). Examples: gàː, nàː, kòː, dai, àkwai, naː, wai, dà, hakàn, hwa

Relations

PART nodes are attached to their parents using 12 different relations: advmod (54; 30% instances), root (44; 24% instances), discourse (28; 15% instances), dep (18; 10% instances), parataxis (14; 8% instances), conj (9; 5% instances), case (8; 4% instances), xcomp (2; 1% instances), acl:relcl (1; 1% instances), nsubj (1; 1% instances), obj (1; 1% instances), reparandum (1; 1% instances)

Parents of PART nodes belong to 10 different parts of speech: VERB (77; 43% instances), (44; 24% instances), NOUN (21; 12% instances), PART (14; 8% instances), PRON (13; 7% instances), PROPN (5; 3% instances), NUM (3; 2% instances), ADV (2; 1% instances), AUX (1; 1% instances), DET (1; 1% instances)

88 (49%) PART nodes are leaves.

11 (6%) PART nodes have one child.

8 (4%) PART nodes have two children.

74 (41%) PART nodes have three or more children.

The highest child degree of a PART node is 10.

Children of PART nodes are attached using 22 different relations: punct (102; 32% instances), nsubj (64; 20% instances), xcomp (42; 13% instances), discourse (16; 5% instances), advcl:cleft (15; 5% instances), dislocated (14; 4% instances), advmod (11; 3% instances), obl (9; 3% instances), parataxis (9; 3% instances), conj (7; 2% instances), advcl (6; 2% instances), fixed (5; 2% instances), ccomp (3; 1% instances), vocative (3; 1% instances), cc (2; 1% instances), obl:arg (2; 1% instances), compound (1; 0% instances), cop (1; 0% instances), dep (1; 0% instances), det (1; 0% instances), mark (1; 0% instances), reparandum (1; 0% instances)

Children of PART nodes belong to 15 different parts of speech: PUNCT (102; 32% instances), NOUN (77; 24% instances), VERB (41; 13% instances), ADV (28; 9% instances), PRON (24; 8% instances), PART (14; 4% instances), INTJ (10; 3% instances), CCONJ (5; 2% instances), AUX (4; 1% instances), PROPN (3; 1% instances), DET (2; 1% instances), NUM (2; 1% instances), X (2; 1% instances), ADJ (1; 0% instances), SCONJ (1; 0% instances)