I used to use SnowBallAnalyzer
to combine custom stop word filtering with basic stemming, but it has been deprecated. For e.g. in index config, I could easily specify:
IndexWriterConfig config = new IndexWriterConfig(Version.LUCENE_32,
new SnowballAnalyzer(Version.LUCENE_32, "name", stopSet));
where stopSet
is my custom list of stopwords.
How do I now create a single analyzer that would enable me filter stop words and do basic English stemming?
Thanks.
Use EnglishAnalyzer:
I'm a little confused on how your listed code does anything particularly useful, if you aren't passing a valid stemmer name into the SnowballAnalyzer constructor. Seems like it should throw an exception right around here:
Being that there is no stemmer called: "
org.tartarus.snowball.ext.nameStemmer
".