SOLR Faceting for special characters

119 Views Asked by At

I have multiple languages(English, German, Chinese, Japanese) in my collection and I would like to perform Faceting. Faceting works good for English and German but for chinese and japanese, I am not getting any facets. Can someone help me in solving this ?

<fieldType name="text_general_search_ja" class="solr.TextField" positionIncrementGap="100" autoGeneratePhraseQueries="false">
      <analyzer>
        <tokenizer class="solr.JapaneseTokenizerFactory" mode="search"/>
        <filter class="solr.JapaneseBaseFormFilterFactory"/>
        <filter class="solr.JapanesePartOfSpeechStopFilterFactory" tags="lang/stoptags_ja.txt" />
        <filter class="solr.CJKWidthFilterFactory"/>
        <filter class="solr.StopFilterFactory" ignoreCase="true" words="lang/stopwords_ja.txt" />
        <filter class="solr.JapaneseKatakanaStemFilterFactory" minimumLength="4"/>
        <filter class="solr.LowerCaseFilterFactory"/>
      </analyzer>
    </fieldType>

Facet field name is catchall_asia-pacific_cn_zh and the type for the field name is text_general_search_ja. The configuration of text_general_search_ja is mentioned above

1

There are 1 best solutions below

0
Abhijit Bashetti On

Use of text field types for facets is not advisable.

If you want to achieve both, search on fields and faceting on the field on catchall_asia-pacific_cn_zh then extract it in 2 separate fields.

One fields as string field type which would be helpful in faceting and sorting.

Second as text field type and use these for search on the fields.

These are 2 different use case and hence would be good to have 2 separate fields.