Hacker News new | ask | show | jobs
by christiansafka 500 days ago
I didn't make this clear enough in the post, but we're still working on voice cloning and inflection transfer. Voice cloning is easier, but to support inflection transfer we have to modality-align an LLM.