Text Analysis and dealing with grammar, tense in R

748 Views Asked by At

I am trying to do text analysis in R. I am able to do the frequency counts and wordcloud. But I could not figure out how to work with the words which are same but different tense such as "enjoy", "enjoyed". I want these words to count as single word "enjoy" rather than 2 separate words. Is there are way I can fix these words or change to present tense?

1

There are 1 best solutions below

0
Jeanette On

You can either use stemming as a pre-processing step or use the Quanteda package and employ a wildcard pattern match by specifying "enjoy*" to include variations such as "enjoyed" and "enjoying"