Hacker News new | ask | show | jobs
by klysm 1262 days ago
What’s real time text to speech mean? Like latency from space bar to spoken?
1 comments

Not latency. Like it can synthesize at least as fast as it plays back. Meaning an hour of audio can be generated in an hour or less.
More importantly, can it synthesize as a stream.