: symbol
A symbol is a word-like entity that differs from ordinary words by form, function, or both.
We recognize as symbols:
- currency symbols: $
- mathematical operators: µg / m3
- ’/’ used a separator: 2001 / 923 / CE
- emoticons and emojis: :-)
- URLs and emails
- hashtags: #
- at-mentions: @
The following are not symbols:
- Proper nouns with numbers and special characters: 130XE, DC10, DC-10 are tagged PROPN.
- Acronyms for proper nouns: UN, NATO are tagged as PROPN.
- Abbreviated words: Sig. (signore), kg (chilogrammo), km (chilometro), dott (dottore) are tagged NOUN.
- Characters used as bullets in itemized lists (*, •, ‣) are PUNCT.
Corresponding language-specific part-of-speech tags
SYM: Symbol
X: Other
- $, %, §, ©
- +, −, ×, ÷, =, <, >
- :), ♥‿♥, 😝
- #universaldependencies
- @johndoe
SYM in other languages: [cs] [cy] [da] [en] [es] [et] [fi] [fr] [ga] [grc] [hy] [it] [ja] [kk] [ky] [no] [pt] [ru] [sl] [sv] [tr] [tt] [uk] [u] [urj] [yue] [zh]