Hacker News new | ask | show | jobs
by brigadier132 632 days ago
You can test it by comparing human translations with LLM translations. The results are pretty close. Like I said in another comment, the common failure mode with mandarin is around names and genders