What is the config options to setup what words/letters are indexed
-
For example to index “-” character and allow its search, what is the related config option name?
-
For example to make testword1234 split up to “testword” and “1234” and not a single word, what is the related config option name?
Thank you.
Sergey
2
charset_table Manticore Search Manual
For example to make testword1234 split up to “testword” and “1234” and not a single word, what is the related config option name?
No option for it. ngram_chars (Manticore Search Manual) may be helpful, but it won’t split testword1234
into only 2 tokens.
It not useful in all situations, but there is a tool that might help with pre-processing data to split into token words
https://manual.manticoresearch.com/Miscellaneous_tools#wordbreaker