Hacker News new | ask | show | jobs
by j-bos 761 days ago
How do speech to speech models work? Do they just that many more tokens to capture nuances of spoken language?