How can i add stopwords to MeCab?

969 Views Asked by At

I want to add stopwords -- like 'me', 'you' or something -- to MeCab. but I can't find any information of stopword on MeCab on its manual.

2

There are 2 best solutions below

0
polm23 On

MeCab is a part of speech tagger, it doesn't do stopword removal.

You need to remove stopwords yourself by processing output and looking at surface forms (the literal token), base forms (the lemmatized canonical form), or part of speech.

0
SUM On

I think that you don't have to add stopword in MeCab. You can remove stopwords after mecab has passed you tokenized data by pattern matching(x.replace("stopword", "") in case of python) or using POS tag(remove terms with specific tags).