Treebank Statistics: UD_Maghrebi_Arabic_French-Arabizi: POS Tags: PRON
There are 191 PRON
lemmas (3%), 491 PRON
types (6%) and 1969 PRON
tokens (10%).
Out of 16 observed tags, the rank of PRON
is: 5 in number of lemmas, 5 in number of types and 4 in number of tokens.
The 10 most frequent PRON
lemmas: qui, je, avec_toi, nous, vous, moi, avec_vous, toi, ce, cette
The 10 most frequent PRON
types: li, m3ak, je, ana, hada, had, il, vous, on, m3akom
The 10 most frequent ambiguous lemmas: qui (PRON 133, ADV 9, ADP 4), vous (PRON 76, VERB 1), ce (PRON 48, DET 8, ADV 1, VERB 1), cette (PRON 46, DET 21), il (PRON 43, DET 1), on (PRON 43, PART 1), quoi (PRON 43, DET 5, INTJ 1), lui (PRON 38, CCONJ 1), tout (PRON 33, DET 19, ADJ 15, ADV 13, NOUN 1), ça (PRON 34, DET 2)
The 10 most frequent ambiguous types: li (PRON 98, ADP 27, DET 2, SCONJ 2), ana (PRON 50, SCONJ 4, ADP 1, X 1), had (PRON 40, DET 2, ADP 1), il (PRON 37, DET 7, ADP 1), on (PRON 37, ADP 1, ADV 1, AUX 1, CCONJ 1), hna (PRON 30, ADV 10), hadi (PRON 25, ADJ 1, DET 1), qui (PRON 24, ADP 1), wach (PRON 20, ADV 12, DET 3), tu (PRON 20, VERB 1)
- li
- PRON 98: kayen ghir morad megheni li nahssen 3awnah parce que rabnouh fi lasio
- ADP 27: 123 vive l’ algerie et le maroc inchalah li l3ab mazyan nssaf9o lo
- DET 2: si li jouj alah ghaleb , kan il baroud
- SCONJ 2: ahsan ma fa3alou li ana al maghrib yatawafar 3la al mala3ib w al fanadi9 w al bounya atahtiya machi kima hna
- ana
- PRON 50: W lakane hakmouni lia ana publicité
- SCONJ 4: kan motaw9a3 ana maroc radi tnadam lahkach 3anda biya tahtiya
- ADP 1: ahsan ma fa3alou li ana al maghrib yatawafar 3la al mala3ib w al fanadi9 w al bounya atahtiya machi kima hna
- X 1: le jour ou en a jouer a om dorman kona 3arfin bli nrbho li ana shaha tatata2aba avant que le match commance w tata2ob mina chaytan golo l sa3dan ki dir kaskita nakhasro
- had
- il
- PRON 37: 3andou wach shih hada Lamouchia ca cerai bien qi’ il retourne au bled
- DET 7: 3andou el hak makache football en algerie alors il a bien fait
- ADP 1: a tous les repsonsables vous allez payer inchallah au lieu de consacrer l’ arget de notre pays a construire des economies fortes et creer des emplois pour nos jeunes ils nous remenent de la racaille !!! allah yen3alkoum ya mesouline il youm e ddine
- on
- PRON 37: on va gagné inchaalah ya rabi kon m3ana
- ADP 1: salam 3likoum ta3alim on algerie makayan walou kolhaja balahabe haka w madabihe ben bouzid ina
- ADV 1: as salamo 3alikom je suis la pour passer 1n grande salut pour nos joueur s oublie sur tout pa matmoure 40 le mieure joueure pour mon opinion w nchallah narb7o zambie 2 ou 3 on fin nas l 7rakta kamel m3akom kom tout les algeriens far7ona nchallah w takabala allho siamana wa siyamakom w a salamo 3alaykom w ra7mato allahi wa barakatoho
- AUX 1: ou yahya walah ma nesmahlek 3and rabi car je travaille pas et G posée un dossier pour avoir une voiture par fasilité la voiture coute 40 milions les frée du dossier m on couté 14 milions et la taxe cout 7 milions sa fait le tous 21 milions pour avoire 40 milions c normale pour un jeune qui a posée son dossier a </em>
- CCONJ 1: il faut gagner le match et vous présentons l’ algerie on nchalah matadarbouche 3la al minha kima 3am li féte
- hna
- hadi
- qui
- wach
- tu
Morphology
The form / lemma ratio of PRON
is 2.570681 (the average of all parts of speech is 1.474223).
The 1st highest number of forms (25) was observed with the lemma “sur_vous”: 3alaicom, 3alaikom, 3alaikoum, 3alaycom;, 3alayekom, 3alayka, 3alaykom, 3alaykoum, 3aleikom, 3aleykoum, 3alihoum, 3alikom, 3alikome, 3alikoum, 3lihkomm, 3likom, 3likome, 3likoum, alaicom, alikom, alikoum, alykoum, fik, fikoum, fikoume.
The 2nd highest number of forms (17) was observed with the lemma “qui”: ali, chkon, chkoun, eli, elli, ki, l, les, lhaw, li, lli, lé, ma, man, mane, qu, qui.
The 3rd highest number of forms (17) was observed with the lemma “vous”: anetoma, antoma, antouma, atna, entoum, entouma, intouma, iyakom, netmana, nta, ntoma, ntouma, ous, t, v, vou, vous.
PRON
occurs with 7 features: PronType (383; 19% instances), Polarity (23; 1% instances), AdpType (17; 1% instances), Typo (3; 0% instances), Person (2; 0% instances), Gender (1; 0% instances), Number (1; 0% instances)
PRON
occurs with 11 feature-value pairs: AdpType=Prep
, Gender=Masc
, Number=Plur
, Person=2
, Person=3
, Polarity=Neg
, PronType=Dem
, PronType=Ind
, PronType=Int
, PronType=Rel
, Typo=Yes
PRON
occurs with 12 feature combinations.
The most frequent feature combination is _
(1550 tokens).
Examples: m3ak, je, ana, il, vous, on, m3akom, c’, hna, nous
Relations
PRON
nodes are attached to their parents using 22 different relations: nsubj (919; 47% instances), obl (407; 21% instances), obj (148; 8% instances), parataxis (130; 7% instances), det (84; 4% instances), expl (58; 3% instances), root (52; 3% instances), nmod (44; 2% instances), iobj (35; 2% instances), dislocated (27; 1% instances), conj (16; 1% instances), appos (14; 1% instances), expl:pv (12; 1% instances), dep (5; 0% instances), discourse (4; 0% instances), fixed (4; 0% instances), acl:relcl (3; 0% instances), ccomp (3; 0% instances), advcl (1; 0% instances), csubj (1; 0% instances), reparandum (1; 0% instances), vocative (1; 0% instances)
Parents of PRON
nodes belong to 14 different parts of speech: VERB (1318; 67% instances), NOUN (335; 17% instances), PROPN (90; 5% instances), PRON (71; 4% instances), ADJ (61; 3% instances), (52; 3% instances), INTJ (24; 1% instances), ADV (8; 0% instances), ADP (4; 0% instances), NUM (2; 0% instances), AUX (1; 0% instances), DET (1; 0% instances), PART (1; 0% instances), SCONJ (1; 0% instances)
1444 (73%) PRON
nodes are leaves.
329 (17%) PRON
nodes have one child.
129 (7%) PRON
nodes have two children.
67 (3%) PRON
nodes have three or more children.
The highest child degree of a PRON
node is 13.
Children of PRON
nodes are attached using 25 different relations: cc (108; 13% instances), nmod (108; 13% instances), case (107; 13% instances), nsubj (91; 11% instances), parataxis (79; 9% instances), advmod (75; 9% instances), amod (50; 6% instances), discourse (48; 6% instances), vocative (36; 4% instances), acl:relcl (31; 4% instances), appos (31; 4% instances), conj (18; 2% instances), obj (16; 2% instances), det (10; 1% instances), dislocated (10; 1% instances), punct (9; 1% instances), mark (6; 1% instances), advcl (5; 1% instances), obl (4; 0% instances), goeswith (3; 0% instances), acl (2; 0% instances), ccomp (2; 0% instances), cop (1; 0% instances), dep (1; 0% instances), reparandum (1; 0% instances)
Children of PRON
nodes belong to 15 different parts of speech: NOUN (161; 19% instances), PROPN (109; 13% instances), CCONJ (108; 13% instances), ADP (105; 12% instances), VERB (87; 10% instances), PRON (71; 8% instances), ADV (63; 7% instances), ADJ (52; 6% instances), INTJ (48; 6% instances), PART (19; 2% instances), DET (9; 1% instances), PUNCT (9; 1% instances), SCONJ (6; 1% instances), X (3; 0% instances), AUX (2; 0% instances)