Treebank Statistics: UD_Naija-NSC: POS Tags: X
There are 115 X
lemmas (3%), 322 X
types (6%) and 40246 X
tokens (29%).
Out of 15 observed tags, the rank of X
is: 6 in number of lemmas, 5 in number of types and 1 in number of tokens.
The 10 most frequent X
lemmas: #, //, <, {, }, [, |c, ||, ], >+
The 10 most frequent X
types: #, //, <, {, }, [, |c, ||, ], >+
The 10 most frequent ambiguous lemmas: X (X 411, INTJ 6, DET 3, NUM 1), ma (PART 28, X 3), per (ADP 6, X 3), cup (NOUN 3, X 2), dem (PRON 2220, PART 42, X 2), na (AUX 2025, X 2, ADP 1), di (DET 2816, X 1), gbogbo (ADV 1, X 1), husband (NOUN 33, X 1), ka (PRON 1, X 1)
The 10 most frequent ambiguous types: ma (PART 28, X 8), b~ (X 7, DET 1), ba (PART 15, X 5), e (PRON 1564, X 5, DET 1), a (DET 127, PRON 22, X 3, NOUN 1), o~ (X 3, NUM 1), per (ADP 6, X 3), wa (INTJ 12, X 3), da (X 2, DET 1), de (PRON 1224, X 2)
- ma
- b~
- ba
- e
- a
- o~
- per
- wa
- da
- de
Morphology
The form / lemma ratio of X
is 2.800000 (the average of all parts of speech is 1.162341).
The 1st highest number of forms (210) was observed with the lemma “X”: Adigo, Agwan~, Alap~, Ala~, Ea~, Fren~, Fr~, Had~, Kw~, Lil~, Max~, Om~, Oria~, RI~, STP~, X, a, abf, ab~, ak~, ala, almo~, al~, anyb~, ar~, avera~, aw~, a~, ba, ban~, ba~, be~, bi~, brin~, bro~, br~, bu~, b~, ca, ca~, chea~, checkli~, chi~, ch~, cle~, conne~, con~, coun~, co~, cr~, cu~, c~, da, de~, di~, do~, du~, d~, e, eh, en~, epurutepu, etin~, ev, everyti~, everyt~, ev~, exa~, e~, fa, fai~, fe~, fini~, fin~, fi~, fore~, for~, fo~, f~, ga~, gbu~, ge, gene~, ge~, ghet~, giti, gi~, gm~, gover~, go~, gu~, g~, hav~, hel~, hip~, ho~, hub~, huma~, h~, im~, inf~, ingred~, insi~, inst~, i~, kambia, k~, lafs~, la~, le~, lit~, li~, ma, mad~, mana~, ma~, med~, me~, mil~, mir~, mon~, mor~, mow~, mo~, mu~, m~, nai~, ne, nikan, norm~, not~, no~, nso, nu, num~, n~, ogbeni, origi~, ori~, oro~, over~, o~, pala~, pa~, pelu, peo~, pers~, pe~, pik~, pi~, pla~, pol~, post, pre~, profe~, pro~, pur~, pu~, p~, re, reach, repre~, res~, re~, r~, sab~, sa~, se~, shere, sh~, sin~, sis~, si~, sle~, sm~, som~, so~, spe~, spu~, sp~, st~, su~, swe~, sy~, s~, tawon, ta~, thirt~, thou~, ti~, traffi~, tre~, tri~, tu, t~, una, under~, un~, wa, wa~, wet~, we~, wit~, wi~, wom~, wor~, wo~, wu~, w~, zaga.
The 2nd highest number of forms (2) was observed with the lemma “//”: //, }.
The 3rd highest number of forms (2) was observed with the lemma “>”: <, >.
X
occurs with 7 features: ExtPos (7; 0% instances), PronType (7; 0% instances), Case (6; 0% instances), Person (6; 0% instances), PartType (4; 0% instances), Number (3; 0% instances), NumType (1; 0% instances)
X
occurs with 11 feature-value pairs: Case=Nom
, ExtPos=ADV
, ExtPos=PROPN
, NumType=Card
, Number=Plur
, Number=Sing
, PartType=Cop
, Person=2
, Person=3
, PronType=Dem
, PronType=Prs
X
occurs with 9 feature combinations.
The most frequent feature combination is _
(40227 tokens).
Examples: #, //, <, {, }, [, |c, ||, ], >+
Relations
X
nodes are attached to their parents using 25 different relations: dep (39713; 99% instances), reparandum (285; 1% instances), flat:foreign (97; 0% instances), root (41; 0% instances), obj (25; 0% instances), flat (15; 0% instances), nmod (9; 0% instances), obl:mod (8; 0% instances), discourse (7; 0% instances), compound (5; 0% instances), conj (5; 0% instances), fixed (4; 0% instances), nsubj (4; 0% instances), parataxis (4; 0% instances), xcomp (4; 0% instances), acl:relcl (3; 0% instances), compound:svc (3; 0% instances), obl:arg (3; 0% instances), advcl:cleft (2; 0% instances), compound:redup (2; 0% instances), parataxis:conj (2; 0% instances), parataxis:parenth (2; 0% instances), advcl (1; 0% instances), advmod (1; 0% instances), vocative (1; 0% instances)
Parents of X
nodes belong to 16 different parts of speech: VERB (20551; 51% instances), NOUN (7267; 18% instances), ADV (2593; 6% instances), PRON (2569; 6% instances), ADJ (1565; 4% instances), AUX (1071; 3% instances), PROPN (1032; 3% instances), INTJ (931; 2% instances), X (851; 2% instances), NUM (457; 1% instances), DET (397; 1% instances), ADP (374; 1% instances), PART (294; 1% instances), SCONJ (183; 0% instances), CCONJ (70; 0% instances), (41; 0% instances)
39868 (99%) X
nodes are leaves.
57 (0%) X
nodes have one child.
131 (0%) X
nodes have two children.
190 (0%) X
nodes have three or more children.
The highest child degree of a X
node is 26.
Children of X
nodes are attached using 33 different relations: dep (757; 65% instances), flat:foreign (112; 10% instances), det (44; 4% instances), nsubj (43; 4% instances), discourse (28; 2% instances), aux (27; 2% instances), case (21; 2% instances), nmod (17; 1% instances), cop (14; 1% instances), flat (13; 1% instances), advmod (12; 1% instances), conj (12; 1% instances), amod (11; 1% instances), reparandum (9; 1% instances), cc (6; 1% instances), dislocated (6; 1% instances), acl (4; 0% instances), nmod:poss (4; 0% instances), advcl (3; 0% instances), compound (3; 0% instances), acl:relcl (2; 0% instances), compound:redup (2; 0% instances), mark (2; 0% instances), nummod (2; 0% instances), parataxis:conj (2; 0% instances), appos (1; 0% instances), ccomp (1; 0% instances), compound:svc (1; 0% instances), fixed (1; 0% instances), obj (1; 0% instances), obl:mod (1; 0% instances), parataxis:discourse (1; 0% instances), xcomp (1; 0% instances)
Children of X
nodes belong to 15 different parts of speech: X (851; 73% instances), PRON (56; 5% instances), DET (45; 4% instances), AUX (41; 4% instances), NOUN (38; 3% instances), INTJ (21; 2% instances), VERB (19; 2% instances), ADP (18; 2% instances), ADJ (16; 1% instances), ADV (16; 1% instances), PART (13; 1% instances), SCONJ (13; 1% instances), CCONJ (8; 1% instances), PROPN (7; 1% instances), NUM (2; 0% instances)