Here is the case. Let say we have dataset containing messages from a chat and we want to do a topic modeling on it (few topics for example).
Let us assume, that the topic A might be (and should) represented by few words but I know (let say from some external source), that all messages that contain a word word_to_boost should be predicted as A-belonging. All preprocessing and the bag of words is done. Is there any possibility to "boost" the word word_to_boost somehow, to suggest to the model putting all messages withing such word into the A topic? If so, is that recommended?
I assumed it might be done around TF-IDF but maybe there is a different approach?
Thanks in advance!
There's a good amount of confusion here: