Text Analytics Fundamentals - Text Analytics with R | Learn Data Science

Originally published at: https://tutorials.datasciencedojo.com/text-analytics-fundamentals/

Text analytics fundamentals covers:

– The importance of splitting data in to training and test datasets
– Stratified sampling of imbalanced data using the caret package
– Representing text data for the purposes of machine learning
– Introduction to tokenization, stop words, and stemming
– The bag-of-words model
– Considerations for data pre-processing

Kaggle Dataset can be found here

The data and R code used in this series is available here

(228)