home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-Porttinari: POS Tags: X

There are 182 X lemmas (1%), 182 X types (1%) and 246 X tokens (0%). Out of 16 observed tags, the rank of X is: 7 in number of lemmas, 7 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: bitcoin, car, safety, ale, bitcoins, pale, rohingyas, capita, corpus, country

The 10 most frequent X types: bitcoin, car, safety, ale, bitcoins, pale, rohingyas, capita, corpus, country

The 10 most frequent ambiguous lemmas: corpus (X 3, NOUN 1), habeas (X 3, NOUN 1), in (PROPN 18, X 3), on-line (X 3, ADJ 2, ADV 2), premium (X 2, ADJ 1), acusar (VERB 17, X 1), and (PROPN 4, X 1), de (ADP 11129, X 1), denunciar (VERB 17, X 1), drag (NOUN 2, X 1)

The 10 most frequent ambiguous types: corpus (X 3, NOUN 1), habeas (X 3, NOUN 1), in (PROPN 18, X 3), on-line (X 3, ADJ 2, ADV 2), premium (X 2, ADJ 1), and (PROPN 4, X 1), arepas (NOUN 1, X 1), blockbusters (NOUN 2, X 1), de (ADP 11029, X 1), drag (NOUN 1, X 1)

Morphology

The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 1.496159).

The 1st highest number of forms (1) was observed with the lemma “Bad”: Bad.

The 2nd highest number of forms (1) was observed with the lemma “Italy”: Italy.

The 3rd highest number of forms (1) was observed with the lemma “acusar”: acusar.

X occurs with 1 features: Foreign (231; 94% instances)

X occurs with 1 feature-value pairs: Foreign=Yes

X occurs with 2 feature combinations. The most frequent feature combination is Foreign=Yes (231 tokens). Examples: bitcoin, car, safety, ale, bitcoins, pale, rohingyas, capita, corpus, country

Relations

X nodes are attached to their parents using 17 different relations: nmod (72; 29% instances), flat:foreign (68; 28% instances), conj (27; 11% instances), obj (25; 10% instances), nsubj (20; 8% instances), obl (11; 4% instances), appos (4; 2% instances), parataxis (4; 2% instances), root (3; 1% instances), discourse (2; 1% instances), flat (2; 1% instances), nsubj:pass (2; 1% instances), xcomp (2; 1% instances), advcl (1; 0% instances), cc (1; 0% instances), ccomp (1; 0% instances), ccomp:speech (1; 0% instances)

Parents of X nodes belong to 10 different parts of speech: X (89; 36% instances), NOUN (78; 32% instances), VERB (58; 24% instances), PROPN (7; 3% instances), ADJ (3; 1% instances), ADV (3; 1% instances), (3; 1% instances), SYM (3; 1% instances), AUX (1; 0% instances), PRON (1; 0% instances)

84 (34%) X nodes are leaves.

29 (12%) X nodes have one child.

50 (20%) X nodes have two children.

83 (34%) X nodes have three or more children.

The highest child degree of a X node is 7.

Children of X nodes are attached using 20 different relations: punct (123; 26% instances), det (81; 17% instances), flat:foreign (68; 15% instances), case (58; 12% instances), nmod (28; 6% instances), conj (27; 6% instances), appos (22; 5% instances), cc (16; 3% instances), amod (9; 2% instances), acl:relcl (6; 1% instances), advmod (6; 1% instances), acl (5; 1% instances), cop (5; 1% instances), nsubj (3; 1% instances), nummod (3; 1% instances), flat (2; 0% instances), mark (2; 0% instances), advcl (1; 0% instances), obl (1; 0% instances), orphan (1; 0% instances)

Children of X nodes belong to 16 different parts of speech: PUNCT (123; 26% instances), X (89; 19% instances), DET (82; 18% instances), ADP (59; 13% instances), NOUN (40; 9% instances), CCONJ (16; 3% instances), PROPN (14; 3% instances), VERB (12; 3% instances), ADJ (11; 2% instances), ADV (5; 1% instances), AUX (5; 1% instances), NUM (4; 1% instances), PRON (3; 1% instances), SYM (2; 0% instances), INTJ (1; 0% instances), SCONJ (1; 0% instances)