home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Naija-NSC: POS Tags: X

There are 115 X lemmas (3%), 322 X types (6%) and 40246 X tokens (29%). Out of 15 observed tags, the rank of X is: 6 in number of lemmas, 5 in number of types and 1 in number of tokens.

The 10 most frequent X lemmas: #, //, <, {, }, [, |c, ||, ], >+

The 10 most frequent X types: #, //, <, {, }, [, |c, ||, ], >+

The 10 most frequent ambiguous lemmas: X (X 411, INTJ 6, DET 3, NUM 1), ma (PART 28, X 3), per (ADP 6, X 3), cup (NOUN 3, X 2), dem (PRON 2220, PART 42, X 2), na (AUX 2025, X 2, ADP 1), di (DET 2816, X 1), gbogbo (ADV 1, X 1), husband (NOUN 33, X 1), ka (PRON 1, X 1)

The 10 most frequent ambiguous types: ma (PART 28, X 8), b~ (X 7, DET 1), ba (PART 15, X 5), e (PRON 1564, X 5, DET 1), a (DET 127, PRON 22, X 3, NOUN 1), o~ (X 3, NUM 1), per (ADP 6, X 3), wa (INTJ 12, X 3), da (X 2, DET 1), de (PRON 1224, X 2)

Morphology

The form / lemma ratio of X is 2.800000 (the average of all parts of speech is 1.162341).

The 1st highest number of forms (210) was observed with the lemma “X”: Adigo, Agwan~, Alap~, Ala~, Ea~, Fren~, Fr~, Had~, Kw~, Lil~, Max~, Om~, Oria~, RI~, STP~, X, a, abf, ab~, ak~, ala, almo~, al~, anyb~, ar~, avera~, aw~, a~, ba, ban~, ba~, be~, bi~, brin~, bro~, br~, bu~, b~, ca, ca~, chea~, checkli~, chi~, ch~, cle~, conne~, con~, coun~, co~, cr~, cu~, c~, da, de~, di~, do~, du~, d~, e, eh, en~, epurutepu, etin~, ev, everyti~, everyt~, ev~, exa~, e~, fa, fai~, fe~, fini~, fin~, fi~, fore~, for~, fo~, f~, ga~, gbu~, ge, gene~, ge~, ghet~, giti, gi~, gm~, gover~, go~, gu~, g~, hav~, hel~, hip~, ho~, hub~, huma~, h~, im~, inf~, ingred~, insi~, inst~, i~, kambia, k~, lafs~, la~, le~, lit~, li~, ma, mad~, mana~, ma~, med~, me~, mil~, mir~, mon~, mor~, mow~, mo~, mu~, m~, nai~, ne, nikan, norm~, not~, no~, nso, nu, num~, n~, ogbeni, origi~, ori~, oro~, over~, o~, pala~, pa~, pelu, peo~, pers~, pe~, pik~, pi~, pla~, pol~, post, pre~, profe~, pro~, pur~, pu~, p~, re, reach, repre~, res~, re~, r~, sab~, sa~, se~, shere, sh~, sin~, sis~, si~, sle~, sm~, som~, so~, spe~, spu~, sp~, st~, su~, swe~, sy~, s~, tawon, ta~, thirt~, thou~, ti~, traffi~, tre~, tri~, tu, t~, una, under~, un~, wa, wa~, wet~, we~, wit~, wi~, wom~, wor~, wo~, wu~, w~, zaga.

The 2nd highest number of forms (2) was observed with the lemma “//”: //, }.

The 3rd highest number of forms (2) was observed with the lemma “>”: <, >.

X occurs with 7 features: ExtPos (7; 0% instances), PronType (7; 0% instances), Case (6; 0% instances), Person (6; 0% instances), PartType (4; 0% instances), Number (3; 0% instances), NumType (1; 0% instances)

X occurs with 11 feature-value pairs: Case=Nom, ExtPos=ADV, ExtPos=PROPN, NumType=Card, Number=Plur, Number=Sing, PartType=Cop, Person=2, Person=3, PronType=Dem, PronType=Prs

X occurs with 9 feature combinations. The most frequent feature combination is _ (40227 tokens). Examples: #, //, <, {, }, [, |c, ||, ], >+

Relations

X nodes are attached to their parents using 25 different relations: dep (39713; 99% instances), reparandum (285; 1% instances), flat:foreign (97; 0% instances), root (41; 0% instances), obj (25; 0% instances), flat (15; 0% instances), nmod (9; 0% instances), obl:mod (8; 0% instances), discourse (7; 0% instances), compound (5; 0% instances), conj (5; 0% instances), fixed (4; 0% instances), nsubj (4; 0% instances), parataxis (4; 0% instances), xcomp (4; 0% instances), acl:relcl (3; 0% instances), compound:svc (3; 0% instances), obl:arg (3; 0% instances), advcl:cleft (2; 0% instances), compound:redup (2; 0% instances), parataxis:conj (2; 0% instances), parataxis:parenth (2; 0% instances), advcl (1; 0% instances), advmod (1; 0% instances), vocative (1; 0% instances)

Parents of X nodes belong to 16 different parts of speech: VERB (20551; 51% instances), NOUN (7267; 18% instances), ADV (2593; 6% instances), PRON (2569; 6% instances), ADJ (1565; 4% instances), AUX (1071; 3% instances), PROPN (1032; 3% instances), INTJ (931; 2% instances), X (851; 2% instances), NUM (457; 1% instances), DET (397; 1% instances), ADP (374; 1% instances), PART (294; 1% instances), SCONJ (183; 0% instances), CCONJ (70; 0% instances), (41; 0% instances)

39868 (99%) X nodes are leaves.

57 (0%) X nodes have one child.

131 (0%) X nodes have two children.

190 (0%) X nodes have three or more children.

The highest child degree of a X node is 26.

Children of X nodes are attached using 33 different relations: dep (757; 65% instances), flat:foreign (112; 10% instances), det (44; 4% instances), nsubj (43; 4% instances), discourse (28; 2% instances), aux (27; 2% instances), case (21; 2% instances), nmod (17; 1% instances), cop (14; 1% instances), flat (13; 1% instances), advmod (12; 1% instances), conj (12; 1% instances), amod (11; 1% instances), reparandum (9; 1% instances), cc (6; 1% instances), dislocated (6; 1% instances), acl (4; 0% instances), nmod:poss (4; 0% instances), advcl (3; 0% instances), compound (3; 0% instances), acl:relcl (2; 0% instances), compound:redup (2; 0% instances), mark (2; 0% instances), nummod (2; 0% instances), parataxis:conj (2; 0% instances), appos (1; 0% instances), ccomp (1; 0% instances), compound:svc (1; 0% instances), fixed (1; 0% instances), obj (1; 0% instances), obl:mod (1; 0% instances), parataxis:discourse (1; 0% instances), xcomp (1; 0% instances)

Children of X nodes belong to 15 different parts of speech: X (851; 73% instances), PRON (56; 5% instances), DET (45; 4% instances), AUX (41; 4% instances), NOUN (38; 3% instances), INTJ (21; 2% instances), VERB (19; 2% instances), ADP (18; 2% instances), ADJ (16; 1% instances), ADV (16; 1% instances), PART (13; 1% instances), SCONJ (13; 1% instances), CCONJ (8; 1% instances), PROPN (7; 1% instances), NUM (2; 0% instances)