Treebank Statistics: UD_Western_Sierra_Puebla_Nahuatl-ITML: POS Tags: SCONJ
There are 37 SCONJ
lemmas (2%), 37 SCONJ
types (1%) and 292 SCONJ
tokens (3%).
Out of 15 observed tags, the rank of SCONJ
is: 8 in number of lemmas, 9 in number of types and 9 in number of tokens.
The 10 most frequent SCONJ
lemmas: que, para, porque, tla, in, cuando, ihcuac, como, nic, tlen
The 10 most frequent SCONJ
types: que, para, porque, tla, n, cuando, ihcuac, como, nic, tlen
The 10 most frequent ambiguous lemmas: que (SCONJ 39, ADP 2, PRON 2), para (SCONJ 37, ADP 35), porque (SCONJ 36, ADV 1), in (DET 628, SCONJ 24, ADJ 1, X 1), cuando (SCONJ 15, ADV 4), ihcuac (SCONJ 13, ADV 4), como (SCONJ 13, ADP 1, ADV 1), nic (SCONJ 12, ADV 3), tlen (PRON 24, SCONJ 9, DET 1), ijkwak (SCONJ 6, ADV 2)
The 10 most frequent ambiguous types: que (SCONJ 38, ADP 2, PRON 2), para (SCONJ 37, ADP 34), n (DET 411, SCONJ 24, X 1), cuando (SCONJ 14, ADV 4), ihcuac (SCONJ 7, ADV 3), como (SCONJ 11, ADP 1, ADV 1), nic (SCONJ 11, ADV 3), tlen (PRON 51, SCONJ 10, DET 1), ijkwak (SCONJ 4, ADV 1), keh (SCONJ 5, INTJ 1)
- que
- para
- n
- cuando
- ihcuac
- como
- SCONJ 11: Después telpukatl yuwi n bohtlah , como n ichah katki itenko n bohtlah .
- ADP 1: Techtrataroa kwale n gente , gente ompa amables , amo malos , simplemente como notewah o semopartaroh kwale .
- ADV 1: Pero como a veces México es cimi difícil como para ompa cechanchihuaz o para cetiquitiz ompa ye motlani cimi tzocopintzi tomen , huan nochi patiyo .
- nic
- tlen
- ijkwak
- keh
Morphology
The form / lemma ratio of SCONJ
is 1.000000 (the average of all parts of speech is 1.474576).
The 1st highest number of forms (3) was observed with the lemma “porque”: Porkeh, porke, porque.
The 2nd highest number of forms (2) was observed with the lemma “como”: como, komoh.
The 3rd highest number of forms (2) was observed with the lemma “que”: ken, que.
SCONJ
occurs with 1 features: Foreign (45; 15% instances)
SCONJ
occurs with 1 feature-value pairs: Foreign=Yes
SCONJ
occurs with 2 feature combinations.
The most frequent feature combination is _
(247 tokens).
Examples: para, que, tla, n, porque, ihcuac, nic, tlen, cuando, ijkwak
Relations
SCONJ
nodes are attached to their parents using 10 different relations: mark (259; 89% instances), fixed (13; 4% instances), case (6; 2% instances), cc (3; 1% instances), nsubj (3; 1% instances), obj (3; 1% instances), advcl (2; 1% instances), discourse (1; 0% instances), flat (1; 0% instances), root (1; 0% instances)
Parents of SCONJ
nodes belong to 9 different parts of speech: VERB (236; 81% instances), ADJ (14; 5% instances), NOUN (14; 5% instances), ADV (10; 3% instances), ADP (9; 3% instances), PRON (6; 2% instances), DET (1; 0% instances), PROPN (1; 0% instances), (1; 0% instances)
289 (99%) SCONJ
nodes are leaves.
3 (1%) SCONJ
nodes have one child.
The highest child degree of a SCONJ
node is 1.
Children of SCONJ
nodes are attached using 2 different relations: punct (2; 67% instances), acl:relcl (1; 33% instances)
Children of SCONJ
nodes belong to 2 different parts of speech: PUNCT (2; 67% instances), VERB (1; 33% instances)