Hacker News new | ask | show | jobs
by PeterisP 756 days ago
For arbitrary documents and queries, how do we reliably segment the text between those two different languages? And if we can do that, why can't the model do it implicitly?