German stemming with SnowballPorterFilterFactory

452 Views Asked by At

I have a problem with my german stemming SnowballPorterFilterFactory. I just can´t get it to work. Neither in english or germna. I´m using this AnalzyderDef.

@AnalyzerDef(name = "customAnalyzer", charFilters = {
@CharFilterDef(factory = HTMLStripCharFilterFactory.class)},
            tokenizer = @TokenizerDef(factory = StandardTokenizerFactory.class),
            filters = {
                @TokenFilterDef(factory = TrimFilterFactory.class),
                @TokenFilterDef(factory = LowerCaseFilterFactory.class),
                @TokenFilterDef(factory = SnowballPorterFilterFactory.class, params = {
            @Parameter(name = "language", value = "German2")
        }),
                @TokenFilterDef(factory = WordDelimiterFilterFactory.class, params = {
            @Parameter(name = "splitOnCaseChange", value = "1"),
            @Parameter(name = "splitOnNumerics", value = "1"),
            @Parameter(name = "generateWordParts", value = "1"),
            @Parameter(name = "preserveOriginal", value = "1")

        }),
                @TokenFilterDef(factory = EdgeNGramFilterFactory.class, params = {
            @Parameter(name = "minGramSize", value = "5"),
            @Parameter(name = "maxGramSize", value = "15"),
            @Parameter(name = "side", value = "front")
        })
            }))

I tried it with walk / walking / walks like mentioned here. Is maybe the combination of the filters the problem?

0

There are 0 best solutions below