Hacker News new | ask | show | jobs
by lyu07282 750 days ago
isn't whisper the speech to text model by openai? which model did you mean?
1 comments

yeah thats correct. I meant this one https://voicebox.metademolab.com/
Yeah I tried a bunch of them and OpenAI's TTS was by far the best.

Outside of that standard tech stack Next.js, Postgres, TailwindCSS.

It is still early days for ML TTS, and it will be exciting to see the compute requirements drop and for it to run on the device. OSS models have some promise, but still not there from quality perspective.