Treebank Statistics: UD_Maghrebi_Arabic_French-Arabizi: POS Tags: INTJ
There are 138 INTJ
lemmas (3%), 234 INTJ
types (3%) and 1018 INTJ
tokens (5%).
Out of 16 observed tags, the rank of INTJ
is: 7 in number of lemmas, 7 in number of types and 9 in number of tokens.
The 10 most frequent INTJ
lemmas: oh, inchaAllah, ô, wallah, paix, si_Dieu_le_veut, merci, 123, ah, InchaAllah
The 10 most frequent INTJ
types: ya, nchalah, inchallah, nchallah, salam, 123, wallah, inchalah, merci, walah
The 10 most frequent ambiguous lemmas: paix (INTJ 39, NOUN 11, PROPN 1), si_Dieu_le_veut (INTJ 37, VERB 1), merci (INTJ 27, NOUN 1), félicitation (INTJ 11, ADJ 1, NOUN 1), 1 (NUM 16, INTJ 8), 2 (NUM 15, INTJ 8, PRON 1), 3 (NUM 9, INTJ 8, DET 1), salut (INTJ 7, NOUN 3), non (INTJ 7, PART 4, ADV 2), dommage (INTJ 4, NOUN 4, ADV 1)
The 10 most frequent ambiguous types: ya (INTJ 296, CCONJ 2, SCONJ 1, VERB 1), salam (INTJ 28, NOUN 2, PROPN 1), 1 (NUM 17, INTJ 9, ADJ 1, DET 1, NOUN 1), 2 (NUM 13, INTJ 9, ADP 3, ADJ 1, PRON 1), 3 (INTJ 9, NUM 9, DET 3), ha (INTJ 7, PRON 1), y (PRON 12, INTJ 7, VERB 1), in (INTJ 6, SCONJ 3), a (ADP 46, DET 41, VERB 29, AUX 17, INTJ 5, PROPN 1, X 1), mabrouk (INTJ 4, NOUN 1)
- ya
- INTJ 296: on va gagné inchaalah ya rabi kon m3ana
- CCONJ 2: al hadra le 7 ha khoya nechalkahe narabhoukome b 20 hena 3andna jamhoure ya narabhoukome ya temotou
- SCONJ 1: alihada a ssabab kanat touwakafou el bakhirat li moudatin tawila fi el mawani bach idjiboulna sebbat el bay3 hakada houm el djazairiyoun ki yessam3ou wahed 3erbi ikounou ta3matelhoum mouslim kana bi l imkan woudjoud houloul oukhra douna el bay3 ya dra loukan n’koul ana 3arbiya wach imedouli yakhi yakh
- VERB 1: alhe yarhamha oi ya g3alha mina al moaminine oi mine ashabe al janhe
- salam
- 1
- NUM 17: widadi 4 / 1 hade l 3ame tal3a
- INTJ 9: 1 2 3 vive algerian ;3ak ya rezki bey
- ADJ 1: l haja 1 rak bayen makch masri w ll haja 2 l brizil w tlayen dartou m3ahoum score w l marikan ?????????????????????
- DET 1: si koi wache rakom tahkiw andha 3 ans meli walat 1 milyar machi 400 mil rabi yahdik ya journaliste
- NOUN 1: rah 3andana amal kbir f taahol w ntmana nkono l motaahil 1 w maroc le milleur 2eme bach ntal3o 7na w yahom nchalah bonn chonce
- 2
- NUM 13: 2 / hada les coper nhar wahd okher ywili 3ala 3la l’ algérie c 0 oui
- INTJ 9: 1 2 3 vive algerian ;3ak ya rezki bey
- ADP 3: 3andek hak ya milat ta3mik 2 hadi hadra makanech kima tazawji b hbib 9alb sans chorot
- ADJ 1: l haja 1 rak bayen makch masri w ll haja 2 l brizil w tlayen dartou m3ahoum score w l marikan ?????????????????????
- PRON 1: ana chefet la sseila b 3ayeniya fihom les 2 rabi yekamel mais gaouaoui yestahel yel3eb m3a les grands li yel3bo f el champions ligue
- 3
- ha
- INTJ 7: al hadra le 7 ha khoya nechalkahe narabhoukome b 20 hena 3andna jamhoure ya narabhoukome ya temotou
- PRON 1: elah yarhem dhahaya el irhab el hamadji ha dhou houma e rdjal fi beladi ahhhhhh b el ahra 3and boutef je pose un simple question est ce que un couchon comme ce la il peut vivre avec nous dans notre societe ????? welahi manamenkoume ila goltou oui par ce que la solution d’ extermine hadhou el mahkloukate
- y
- in
- a
- ADP 46: M3a boumediene jusqu’ a la mort
- DET 41: allahoma la tohasibna bima yaf3aloho a tafihin
- VERB 29: 3andou el hak makache football en algerie alors il a bien fait
- AUX 17: حجج حجج حجج حجج حجج حجج comme il a dit n
- INTJ 5: a 3ayeniya vive alg
- PROPN 1: salam 3likoum khawti les algeriens nchallah ya rabi les verts yfarhouna f l a frique du sud w nroho le 2ème tour b rabi nchallah ad3ou m3ana ya r abi amine
- X 1: ana dhad roj3 lamouchaia y3atik saha y a kader 02
- mabrouk
Morphology
The form / lemma ratio of INTJ
is 1.695652 (the average of all parts of speech is 1.474223).
The 1st highest number of forms (28) was observed with the lemma “inchaAllah”: anchalah, anchallah, anchlah, enchala, enchalah, in, incha, incha’Allah, inchaalah, inchalah, inchallah, inchallahe, inchalllah, inshallah, n’challah, nachallah, ncahalah, nchaalh, nchalah, nchalahe, nchaleh, nchallah, nchallh, nchlah, nchlh, nchllh, nshalah, nshallah.
The 2nd highest number of forms (12) was observed with the lemma “oh”: a, aw, iooo, y, y’a, ya, ya3ni, yaaaaaa, yah, yakh, yaw, ye.
The 3rd highest number of forms (10) was observed with the lemma “ah”: a, ah, awah, ih, ya, yak, yakh, yekhi, yewwwwwwwwwwwwwww, yék.
INTJ
occurs with 1 features: Typo (21; 2% instances)
INTJ
occurs with 1 feature-value pairs: Typo=Yes
INTJ
occurs with 2 feature combinations.
The most frequent feature combination is _
(997 tokens).
Examples: ya, nchalah, inchallah, nchallah, salam, 123, wallah, inchalah, merci, walah
Relations
INTJ
nodes are attached to their parents using 10 different relations: discourse (969; 95% instances), fixed (11; 1% instances), parataxis (10; 1% instances), root (7; 1% instances), vocative (6; 1% instances), conj (5; 0% instances), dep (4; 0% instances), obj (3; 0% instances), flat (2; 0% instances), nsubj (1; 0% instances)
Parents of INTJ
nodes belong to 11 different parts of speech: VERB (404; 40% instances), PROPN (267; 26% instances), NOUN (229; 22% instances), PRON (48; 5% instances), ADJ (32; 3% instances), INTJ (19; 2% instances), (7; 1% instances), DET (4; 0% instances), NUM (4; 0% instances), ADV (2; 0% instances), PART (2; 0% instances)
905 (89%) INTJ
nodes are leaves.
84 (8%) INTJ
nodes have one child.
24 (2%) INTJ
nodes have two children.
5 (0%) INTJ
nodes have three or more children.
The highest child degree of a INTJ
node is 4.
Children of INTJ
nodes are attached using 15 different relations: cc (38; 25% instances), obl (29; 19% instances), goeswith (22; 15% instances), det (13; 9% instances), fixed (11; 7% instances), conj (7; 5% instances), vocative (7; 5% instances), amod (6; 4% instances), ccomp (4; 3% instances), punct (4; 3% instances), parataxis (3; 2% instances), flat (2; 1% instances), nmod (2; 1% instances), advmod (1; 1% instances), discourse (1; 1% instances)
Children of INTJ
nodes belong to 11 different parts of speech: CCONJ (38; 25% instances), PRON (24; 16% instances), X (22; 15% instances), INTJ (19; 13% instances), DET (13; 9% instances), NOUN (11; 7% instances), ADJ (6; 4% instances), PROPN (6; 4% instances), VERB (6; 4% instances), PUNCT (4; 3% instances), ADV (1; 1% instances)