How to do stemming in text mining Slovene texts in R

33 Views Asked by At

I am text-mining a series of interviews I performed in the Slovene language. After I transformed the vector file into a corpus, I tried this command to stem the words:

pupils<- tm_map(pupils, stemDocument(pupils, language = "slovene"))

However, the response was this:

Error in UseMethod("stemDocument", x) : no applicable method for 'stemDocument' applied to an object of class "c('VCorpus', 'Corpus')"

I could perform the intended text mining operations (ngrams) without the stemming process, but they will not be completely legit. I saw the post where a gentleman asked about stemming in Italian and was instructed to use the SnowballC package. Unfortunately, SnowballC does not incorporate Slovene, so there's that.

I will appreciate any help. Thank you. Janez

0

There are 0 best solutions below