Is there a tool/API for Manticore similar to ElasticSearch’s analyzer (see here). I am looking for something that helps me, given a sample document or query, see how it was tokenized, stemmed, etc. This will help me do a comparative study between the Manticore and ElasticSearch for some differences in retrieval benchmarks observed.
What I’ve tried: I saw the indextool utility that has the option of --dumpdict INDEXNAME. But I ran into an error and the documentation does not mention exactly what is returned by the command.
Yes, there’s CALL KEYWORDS. You can make an empty index with your tokenization settings and then use CALL KEYWORDS to see how it would tokenize your text: