Treebank Statistics: UD_Maghrebi_Arabic_French-Arabizi: POS Tags: ADP
There are 85 ADP
lemmas (2%), 216 ADP
types (3%) and 1688 ADP
tokens (9%).
Out of 16 observed tags, the rank of ADP
is: 8 in number of lemmas, 8 in number of types and 6 in number of tokens.
The 10 most frequent ADP
lemmas: à, de, avec, pour, dans, en, sur, comme, au, contre
The 10 most frequent ADP
types: fi, de, m3a, f, b, pour, ta3, a, 3la, bi
The 10 most frequent ambiguous lemmas: à (ADP 325, DET 3, PRON 2, INTJ 1, PART 1), de (ADP 298, DET 6, PART 1, PRON 1, VERB 1), avec (ADP 270, PART 1, PRON 1), pour (ADP 137, ADV 2, INTJ 1), en (ADP 87, PRON 12, NOUN 1), sur (ADP 65, ADV 1), comme (ADP 62, SCONJ 17, ADV 11, PRON 1), contre (ADP 23, SCONJ 1), d’ (ADP 23, DET 1), du (ADP 22, DET 4, PROPN 1)
The 10 most frequent ambiguous types: fi (ADP 175, VERB 1), m3a (ADP 114, PART 1, PRON 1), pour (ADP 72, ADV 1), a (ADP 46, DET 41, VERB 29, AUX 17, INTJ 5, PROPN 1, X 1), 3la (ADP 46, PART 1), bi (ADP 45, PRON 1), 3ala (ADP 39, NOUN 1), l (DET 303, ADP 37, PRON 8, PART 1), en (ADP 36, PRON 17, ADV 1, DET 1, NOUN 1), men (ADP 32, PRON 1)
- fi
- m3a
- pour
- a
- ADP 46: M3a boumediene jusqu’ a la mort
- DET 41: allahoma la tohasibna bima yaf3aloho a tafihin
- VERB 29: 3andou el hak makache football en algerie alors il a bien fait
- AUX 17: حجج حجج حجج حجج حجج حجج comme il a dit n
- INTJ 5: a 3ayeniya vive alg
- PROPN 1: salam 3likoum khawti les algeriens nchallah ya rabi les verts yfarhouna f l a frique du sud w nroho le 2ème tour b rabi nchallah ad3ou m3ana ya r abi amine
- X 1: ana dhad roj3 lamouchaia y3atik saha y a kader 02
- 3la
- bi
- 3ala
- l
- DET 303: m3ak ya sadaane le grand de oum dermane hata l mamat
- ADP 37: Men l’ enfance l l’ adolescence , koul khatra yzidou fe la doze
- PRON 8: ra7 ngoul les sujets l nass 3lihom dor
- PART 1: wallah wahed mahou intersted la bi karim wa la mourad wa l madjid dadou les jouer ta3a al chkara samir nasri mayoukhlesh kima ziani wallah ma yesthakou besh yellaabou fi les quipe national
- en
- ADP 36: 3andou el hak makache football en algerie alors il a bien fait
- PRON 17: inchallah en beynulhum beli keyen el balo 3adna 1 2 3 viva algerien
- ADV 1: allah y3awnkom nchalah tous en somble contre cette pourriture
- DET 1: saha en noum ya saadane ki tel3ab la defense kifmah rah tmarki les buts ? oula yenzel men e smaa
- NOUN 1: bouzid khir men belaid et medjani yest7a9 yel3ab m3a l’ en
- men
Morphology
The form / lemma ratio of ADP
is 2.541176 (the average of all parts of speech is 1.474223).
The 1st highest number of forms (40) was observed with the lemma “de”: 2, 3ad, 3al, 3ala, 3an, 3en, 3la, 3li, b, bah, be, bi, bin, d, d’, de, dial, du, dyal, d’, f, fi, had, m, man, mane, me, men, mena, mene, min, mina, mine, mn, nta3, ta3, ta3a, ta3e, te3, te3e.
The 2nd highest number of forms (38) was observed with the lemma “à”: 3a, 3al, 3ala, 3alla, 3ela, 3la, a, al, ala, au, b, bi, f, f’, fa, fe, fi, fu, fé, f’, il, ila, illa, l, la, le, li, l’, m, men, min, mina, mine, nta3, ta3, to, à, ب.
The 3rd highest number of forms (26) was observed with the lemma “pour”: 3a, 3achen, 3ala, 3la, al, alla, bach, bache, bah, bech, beh, besh, f, fe, fel, fi, khatar, l, la, li, m3a, por, pou, pour, pr, to.
ADP
occurs with 5 features: AdpType (1662; 98% instances), Polarity (27; 2% instances), Typo (2; 0% instances), Gender (1; 0% instances), Number (1; 0% instances)
ADP
occurs with 5 feature-value pairs: AdpType=Prep
, Gender=Masc
, Number=Sing
, Polarity=Neg
, Typo=Yes
ADP
occurs with 5 feature combinations.
The most frequent feature combination is AdpType=Prep
(1633 tokens).
Examples: fi, de, m3a, f, b, pour, ta3, a, 3la, bi
Relations
ADP
nodes are attached to their parents using 14 different relations: case (1628; 96% instances), mark (28; 2% instances), fixed (7; 0% instances), advmod (5; 0% instances), dep (5; 0% instances), parataxis (4; 0% instances), nsubj (3; 0% instances), cc (2; 0% instances), acl:relcl (1; 0% instances), conj (1; 0% instances), det (1; 0% instances), iobj (1; 0% instances), nmod (1; 0% instances), root (1; 0% instances)
Parents of ADP
nodes belong to 13 different parts of speech: NOUN (970; 57% instances), PROPN (371; 22% instances), VERB (140; 8% instances), PRON (105; 6% instances), ADJ (45; 3% instances), NUM (23; 1% instances), ADV (18; 1% instances), ADP (6; 0% instances), DET (4; 0% instances), PART (2; 0% instances), SCONJ (2; 0% instances), (1; 0% instances), X (1; 0% instances)
1661 (98%) ADP
nodes are leaves.
18 (1%) ADP
nodes have one child.
6 (0%) ADP
nodes have two children.
3 (0%) ADP
nodes have three or more children.
The highest child degree of a ADP
node is 4.
Children of ADP
nodes are attached using 13 different relations: fixed (17; 43% instances), nsubj (5; 13% instances), cc (4; 10% instances), obl (3; 8% instances), advmod (2; 5% instances), goeswith (2; 5% instances), conj (1; 3% instances), dislocated (1; 3% instances), nmod (1; 3% instances), obj (1; 3% instances), parataxis (1; 3% instances), punct (1; 3% instances), xcomp (1; 3% instances)
Children of ADP
nodes belong to 10 different parts of speech: ADV (13; 33% instances), ADP (6; 15% instances), NOUN (5; 13% instances), CCONJ (4; 10% instances), PRON (4; 10% instances), PROPN (3; 8% instances), X (2; 5% instances), DET (1; 3% instances), PUNCT (1; 3% instances), VERB (1; 3% instances)