Hacker News new | ask | show | jobs
by gtsnexp 636 days ago
Can you create an application that processes Lex Fridman's podcasts and removes his voice?
1 comments

Maybe reading the transcripts of the podcasts would serve your purpose?
That approach could work, but it may make it more difficult to consume content while exercising.
Podcast -> Transcription —> Text to speech.
Hmm, I wonder if anyone has a simple pipeline for extracting data for "voice cloning" type models from the combination of original audio and transcribed text. It should be possible to chain this with some post-processing to replace Lex's voice with something more pleasing, and maybe throw in some automated rewriting of the transcript to remove the fluff.
Excellent flowchart! Thank you