thanks Barry! i was thinking about hundreds of thousands to millions of elements in the extreme case.
based on some preliminary tests - this seems to work with plain index after increasing value of max_xmlpipe2_field to prevent truncation of input fields.