Hacker News new | ask | show | jobs
by jrrv 3 hours ago
No worries, I appreciate the effort. I did go back and listen and they are indeed pronouncing sheí in the audio too.

I use a firefox extension to convert simplified to traditional, looks like it's open source so that may be of some use to you: https://github.com/tongwentang/tongwentang-extension.

Although there are some clashes that it does not handle, e.g. 隻 and 只 are both 只 in simplified, you just have to know which one it is from context, but the extension fails to convert to 隻 where appropriate.

1 comments

Thanks, really useful extension link. Proper traditional support probably needs a context aware layer, not a plain lookup. I will experiment with additional LLM enrichment. Appreciate you digging into this!