Yes this model could cause issues such as the one you describe. With phrase queries/multi-token queries this becomes less of a problem. Phrases aren't anagrams that often.
A secondary index might become needed with the most popular terms, to resolve which anagram is the right one.
A secondary index might become needed with the most popular terms, to resolve which anagram is the right one.