What is the config options to setup what words/letters are indexed

What is the config options to setup what words/letters are indexed

For example to index “-” character and allow its search, what is the related config option name?

For example to make testword1234 split up to “testword” and “1234” and not a single word, what is the related config option name?

Thank you.

charset_table https://mnt.cr/charset_table

For example to make testword1234 split up to “testword” and “1234” and not a single word, what is the related config option name?

No option for it. ngram_chars (http://mnt.cr/ngram_chars) may be helpful, but it won’t split testword1234 into only 2 tokens.

It not useful in all situations, but there is a tool that might help with pre-processing data to split into token words