Hacker News new | ask | show | jobs
Show HN: Combining LLMs and Voice Models – Part 1 (nitric.io)
4 points by tholm 596 days ago
This is a guide that I wrote to showcase a new batch inference feature for an OSS framework that I author (nitric.io).

I know things like Podcast generation via NotebookLM and also NotebookLlama exist, but wanted to demonstrate a case where an API could be built, and subsequently orchestrated in the cloud.

This is just the first part for producing audio using suno/bark via an API. I'm currently working on a part 2 that will introduce an LLM to make scripts from short prompt, which will be piped to the code introduced in Part 1.

Looking for feedback on improving this, there are a few things I'd like to clean up but overall am pretty happy with the outputs it produces so far.

Thanks in advance for any feedback given.