I have a problem with my german stemming SnowballPorterFilterFactory. I just can´t get it to work. Neither in english or germna. I´m using this AnalzyderDef.
@AnalyzerDef(name = "customAnalyzer", charFilters = {
@CharFilterDef(factory = HTMLStripCharFilterFactory.class)},
tokenizer = @TokenizerDef(factory = StandardTokenizerFactory.class),
filters = {
@TokenFilterDef(factory = TrimFilterFactory.class),
@TokenFilterDef(factory = LowerCaseFilterFactory.class),
@TokenFilterDef(factory = SnowballPorterFilterFactory.class, params = {
@Parameter(name = "language", value = "German2")
}),
@TokenFilterDef(factory = WordDelimiterFilterFactory.class, params = {
@Parameter(name = "splitOnCaseChange", value = "1"),
@Parameter(name = "splitOnNumerics", value = "1"),
@Parameter(name = "generateWordParts", value = "1"),
@Parameter(name = "preserveOriginal", value = "1")
}),
@TokenFilterDef(factory = EdgeNGramFilterFactory.class, params = {
@Parameter(name = "minGramSize", value = "5"),
@Parameter(name = "maxGramSize", value = "15"),
@Parameter(name = "side", value = "front")
})
}))
I tried it with walk / walking / walks like mentioned here. Is maybe the combination of the filters the problem?