| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by bvnierop 387 days ago

I'm sure tools exist, but for a DIY solution that any LLM can help you piece together:

- Silero VAD for chunking audio;

- Whisper for transcribing;

- phi3 or bart-large-cnn (which is finetuned to summarize) for summarizing;

This entire stack can run with your machine in airplane mode once you've downloaded the models.