Question about how the sentimentr lexicon dictionary was built

238 Views Asked by At

I have used the Sentimentr package to do some sentiment analysis because it includes valence shifters. However I cannot find how this lexicon lexicon::hash_sentiment_jockers_rinker was built, how individual words were scored. From what I understand, the lexicon was originally exported by the syuzhet and is a combination of AFFIN, bing, nrc and syuzhet. Could someone help me to understand how individual words in the lexicon were calculated?

Thanks!

1

There are 1 best solutions below

3
On

The NEWS file on github says this for version "2.1.0 - 2.2.3":

The default sentiment sentiment lookup table used within sentimentr is now lexicon::hash_sentiment_jockers_rinker, a combined and augmented version of lexicon::hash_sentiment_jockers (Jockers, 2017) & Rinker's augmented lexicon::hash_sentiment_huliu (Hu & Liu, 2004) sentiment lookup tables.

It doesn't state how it was "combined and augmented", so if that was the essence of your question this might not be much help. In that case I'd suggest studying git history to see where the change was introduced, as the commit message, or source code comments, might explain the details.