Hacker News new | ask | show | jobs
by bvnierop 387 days ago
I'm sure tools exist, but for a DIY solution that any LLM can help you piece together:

- Silero VAD for chunking audio;

- Whisper for transcribing;

- phi3 or bart-large-cnn (which is finetuned to summarize) for summarizing;

This entire stack can run with your machine in airplane mode once you've downloaded the models.