xpt
1
Hi,
If I want to use Manticore Search for Chinese text searching, what should I do?
I did a search on the Internet trying to find an answer, and came upon the following two articles,
Which are basically the same, stressing the difficulties, but also showing that it is possible.
So the problem is, how can I make it happen on my end?
Thanks
xpt
2
Ok. it has built-in support –
https://manual.manticoresearch.com/Installation/Debian_and_Ubuntu
To enable CJK tokenization support the official packages contain binaries with embedded ICU library and include ICU data file.
https://manual.manticoresearch.com/Creating_an_index/NLP_and_tokenization/Supported_languages
Manticore has built-in support for indexing CJK texts
xpt
3
Hmm…, however, I’m having problem make my mysql
accept the Chinese I pasted in.
But I guess that’s a mysql
problem…
xpt
4
https://bugzilla.redhat.com/show_bug.cgi?id=1187469
When I type or paste Chinese characters in mysql client, the Chinese characters are not inputed.
Reported in year 2015, and yet still not solved…
l1t
5
you may need set the client charactor sets and code page of terminal to utf8