Data Analysis – Modeling and Parsing

4. hashtags have been split into multiple tokens using the Python library ”compound-word-splitter”, 5. apostrophes have been considered as token separators, 6. tokens composed by digits characters have been replaced with the token NUM, 7. tokens corresponding to … ................
................