home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic-PADT: POS Tags: SCONJ

There are 4 SCONJ lemmas (0%), 3 SCONJ types (0%) and 5457 SCONJ tokens (2%). Out of 17 observed tags, the rank of SCONJ is: 15 in number of lemmas, 17 in number of types and 11 in number of tokens.

The 10 most frequent SCONJ lemmas: أَنَّ، أَن، إِنَّ، إِن

The 10 most frequent SCONJ types: أن، ان، إن

The 10 most frequent ambiguous lemmas: إِنَّ (SCONJ 945, PART 212), إِن (SCONJ 20, X 13)

The 10 most frequent ambiguous types: أن (SCONJ 2919, CCONJ 4, X 3, VERB 1), ان (SCONJ 1956, PART 30, X 11, VERB 1), إن (SCONJ 582, PART 182, X 3)

Morphology

The form / lemma ratio of SCONJ is 0.750000 (the average of all parts of speech is 1.761966).

The 1st highest number of forms (3) was observed with the lemma “إِنَّ”: أن, إن, ان.

The 2nd highest number of forms (2) was observed with the lemma “أَن”: أن, ان.

The 3rd highest number of forms (2) was observed with the lemma “أَنَّ”: أن, ان.

SCONJ does not occur with any features.

Relations

SCONJ nodes are attached to their parents using 12 different relations: mark (5179; 95% instances), cc (133; 2% instances), fixed (98; 2% instances), case (11; 0% instances), conj (11; 0% instances), obj (10; 0% instances), dep (4; 0% instances), nmod (3; 0% instances), root (3; 0% instances), obl:arg (2; 0% instances), parataxis (2; 0% instances), nsubj (1; 0% instances)

Parents of SCONJ nodes belong to 13 different parts of speech: VERB (4112; 75% instances), NOUN (478; 9% instances), ADJ (381; 7% instances), X (195; 4% instances), ADV (85; 2% instances), ADP (44; 1% instances), PRON (43; 1% instances), CCONJ (39; 1% instances), PART (31; 1% instances), DET (23; 0% instances), NUM (15; 0% instances), SCONJ (8; 0% instances), (3; 0% instances)

5087 (93%) SCONJ nodes are leaves.

340 (6%) SCONJ nodes have one child.

13 (0%) SCONJ nodes have two children.

17 (0%) SCONJ nodes have three or more children.

The highest child degree of a SCONJ node is 6.

Children of SCONJ nodes are attached using 14 different relations: fixed (334; 76% instances), ccomp (32; 7% instances), nsubj (18; 4% instances), cc (14; 3% instances), punct (12; 3% instances), obl (9; 2% instances), conj (5; 1% instances), mark (5; 1% instances), obj (3; 1% instances), case (2; 0% instances), advmod (1; 0% instances), advmod:emph (1; 0% instances), dep (1; 0% instances), dislocated (1; 0% instances)

Children of SCONJ nodes belong to 12 different parts of speech: PRON (340; 78% instances), VERB (33; 8% instances), CCONJ (17; 4% instances), NOUN (16; 4% instances), PUNCT (12; 3% instances), SCONJ (8; 2% instances), ADJ (4; 1% instances), ADP (2; 0% instances), ADV (2; 0% instances), DET (2; 0% instances), PART (1; 0% instances), X (1; 0% instances)