Hello. Why do you have Lao language characters (eg. U+0E81…U+0E82/2) in non_cjk and Thai characters (eg. U+0E01…U+0E30) in cjk? As far as I know, both of these languages are non-hieroglyphic.
Also I find a thai chars in korean charset
U+0E01…U+0E30,
U+0E32,
U+0E33,
U+0E40…U+0E46,
U+0E50…U+0E59,
Probably by mistake. Pls create an issue on github. PRs are welcome. The charsets are here manticoresearch/src/charsets at master · manticoresoftware/manticoresearch · GitHub
I tries to enable support for Thai chars in 6.2 but charset=Thai gives an error, it seems the file is not there. Was it added in 6.3 ?
I fixed it for now by adding the characters range manually but wonder what is the proper way to fix this. Any advice ?
Thanks
P.S.
I also looked on Git but the Thai file is not there either, please advice
Unfortunately it didn’t make it to 6.3.0, the related PR Is Thai charset by Nick-S-2018 · Pull Request #2151 · manticoresoftware/manticoresearch · GitHub and there’s still some work to be done.