Hacker News new | ask | show | jobs
by lxe 212 days ago
I've done something similar for Linux and Mac. I originally used Whisper and then switched to Parakeet. I much prefer whisper after playing with both. Maybe I'm not configuring Parakeet correctly, But the transcription that comes out of Whisper is usually pretty much spot on. It automatically removes all the "ooms" and all the "ahs" and it's just way more natural, in my opinion. I'm using Whisper.CPP with CUDA acceleration. This whole comment is just written with me dictating to a whisper, and it's probably going to automatically add quotes correctly, there's going to be no ums, there's going to be no ahs, and everything's just going to be great.
1 comments

Mind sharing your local setup for Mac?
If you don't mind closed source paid app, I can recommend MacWhisper. You can select different models of Whisper & Parakeet for dictation and transcription. My favorite feature is that it allows sending the transcription output to an LLM for clean-up, or anything you want basically eg. professional polish, translate, write poems etc.

I have enough RAM on my Mac that I can run smaller LLMs locally. So for me the whole thing stays local

https://github.com/lxe/yapyap/tree/parakeet-nemo

It's been a while, so I don't know if it's going to work because of the Nemo toolkit ASR numpy dependency issues.

I use it for Linux using whisper CPP and it works great