Hacker News new | ask | show | jobs
by josephh 795 days ago
Does anyone know of a good text normalization (?) library that converts symbols and initialisms into plain English before feeding them into a TTS model? All the models that I've used so far do a horrible job at synthesizing speech for them and I'm wondering whether this is the missing piece in the pipeline.
1 comments

I’ve found GPT 3.5 to do a good job of this, not perfect but I bet with some more prompt engineering it could get really good.
Thanks! It never occurred to me that I can just tweak the system prompt to make sure the LL model never outputs symbols and initialisms as is.