Treebank Statistics: UD_Portuguese-DANTEStocks: POS Tags: SYM
There are 1502 SYM
lemmas (16%), 1502 SYM
types (13%) and 4438 SYM
tokens (5%).
Out of 16 observed tags, the rank of SYM
is: 4 in number of lemmas, 5 in number of types and 8 in number of tokens.
The 10 most frequent SYM
lemmas: %, R$, -, +, http://t.co/kgt1YiTbF7, $, http://t.co/zJRs3Eeyz9, o.O, US$, x
The 10 most frequent SYM
types: %, R$, -, +, http://t.co/kgt1YiTbF7, $, http://t.co/zJRs3Eeyz9, o.O, US$, x
The 10 most frequent ambiguous lemmas: - (PUNCT 1787, SYM 329), + (SYM 253, PUNCT 1), x (SYM 18, NOUN 2, ADP 1), =) (SYM 20, PUNCT 1), ’ (PUNCT 258, SYM 15), > (PUNCT 20, SYM 4), => (PUNCT 7, SYM 3), WW (SYM 2, PROPN 1), h (NOUN 3, SYM 2), # (PROPN 2, SYM 1)
The 10 most frequent ambiguous types: - (PUNCT 1787, SYM 329), + (SYM 253, ADV 10, PUNCT 1), x (SYM 18, ADP 1), ’ (PUNCT 258, SYM 15), > (PUNCT 20, SYM 4), => (PUNCT 7, SYM 3), WW (SYM 2, PROPN 1), h (NOUN 2, SYM 2), # (PROPN 2, SYM 1), * (PUNCT 12, SYM 1)
- -
- +
- SYM 253: Petr4 15,68 + 2,35 % Cerveró falando sem parar … #CPIdaPTbras
- ADV 10: ITUB4 ja negociou + de 300 M ? Ta certo meu sistema aqui ? @ferrisss @dfittarelli @JPedro_Sullivan
- PUNCT 1: > > > Cemig ( CMIG4 ) : Julgamento de o mandado sobre Jaguara é suspenso > > > ( + ) Cemig ( CMIG4 ) : Julgamento de o mandado … http://t.co/M9A5yw7dU0
- x
- ’
- >
- =>
- WW
- h
- #
- *
Morphology
The form / lemma ratio of SYM
is 1.000000 (the average of all parts of speech is 1.238049).
The 1st highest number of forms (1) was observed with the lemma “#”: #.
The 2nd highest number of forms (1) was observed with the lemma “$”: $.
The 3rd highest number of forms (1) was observed with the lemma “\(”: <em>\)</em>.
SYM
does not occur with any features.
Relations
SYM
nodes are attached to their parents using 19 different relations: parataxis (1700; 38% instances), nmod (1000; 23% instances), advmod (587; 13% instances), obl (529; 12% instances), obj (252; 6% instances), discourse (126; 3% instances), conj (105; 2% instances), appos (43; 1% instances), root (31; 1% instances), orphan (18; 0% instances), cc (14; 0% instances), case (10; 0% instances), nsubj (9; 0% instances), advcl (4; 0% instances), flat:name (3; 0% instances), reparandum (3; 0% instances), acl (2; 0% instances), acl:relcl (1; 0% instances), ccomp (1; 0% instances)
Parents of SYM
nodes belong to 12 different parts of speech: VERB (1667; 38% instances), NOUN (1245; 28% instances), PROPN (669; 15% instances), NUM (592; 13% instances), SYM (145; 3% instances), ADJ (41; 1% instances), (31; 1% instances), ADV (28; 1% instances), PRON (8; 0% instances), X (7; 0% instances), AUX (4; 0% instances), INTJ (1; 0% instances)
2374 (53%) SYM
nodes are leaves.
1004 (23%) SYM
nodes have one child.
697 (16%) SYM
nodes have two children.
363 (8%) SYM
nodes have three or more children.
The highest child degree of a SYM
node is 13.
Children of SYM
nodes are attached using 21 different relations: nummod (1666; 45% instances), case (779; 21% instances), punct (600; 16% instances), nmod (140; 4% instances), conj (105; 3% instances), det (98; 3% instances), parataxis (96; 3% instances), nsubj (53; 1% instances), cc (42; 1% instances), advmod (35; 1% instances), acl (32; 1% instances), cop (19; 1% instances), appos (10; 0% instances), discourse (10; 0% instances), vocative (8; 0% instances), amod (6; 0% instances), obl (4; 0% instances), mark (3; 0% instances), obj (3; 0% instances), acl:relcl (1; 0% instances), fixed (1; 0% instances)
Children of SYM
nodes belong to 16 different parts of speech: NUM (1691; 46% instances), ADP (779; 21% instances), PUNCT (600; 16% instances), SYM (145; 4% instances), PROPN (119; 3% instances), NOUN (110; 3% instances), DET (98; 3% instances), VERB (48; 1% instances), CCONJ (41; 1% instances), ADV (29; 1% instances), AUX (20; 1% instances), X (11; 0% instances), ADJ (10; 0% instances), SCONJ (4; 0% instances), INTJ (3; 0% instances), PRON (3; 0% instances)