Hacker News new | ask | show | jobs
by npwr 1950 days ago
It's great to see the whole chain of "speech to text to model to text to speech" in action. The huge amount of computation that has to be done shows in the delay during the API calls.

Before this is actually usable in a game or product the models must be made cheaper to compute and smaller in size.