Hacker News new | ask | show | jobs
Show HN: Crisper – On-device voice to polished text for macOS (speakcrisper.com)
1 points by gokulnair2001 35 days ago
Hey HN,

I built Crisper because every dictation tool I tried either sent audio to the cloud or gave me raw, messy transcripts I still had to fix.

Crisper runs entirely on-device — no network calls, no account, no subscription. It does two things in sequence: transcribes using a speech model, then runs a local LM pass to strip filler words, fix grammar, and make the output sound intentional. The whole thing takes ~1–2 seconds on Apple Silicon.

A floating hotkey pill sits above every window. When you're done recording, it auto-pastes back into whatever app you were in before — Slack, Notion, VS Code, anything.

A few things I'm happy with: - Three recording modes (toggle, hold-to-record, re-paste last) — all rebindable - Full transcript library with source app, timestamp, and audio playback - Fully offline after first-run model download

Free to download. Would love feedback on the AI polish quality especially — that's the part I'm still tuning.

https://speakcrisper.com

2 comments

Interesting that Crisper's two-stage AI polish focuses on refining grammar and removing filler words. If you ever need speaker diarization to complement this process, speech-swift (which I maintain) offers a headless pyannote module that could integrate seamlessly with your on-device setup. https://soniqo.audio/guides/diarize
How does your "Crisper Speech Model" compared to parakeet-0.8b or similar? I've had great experience with that model via handy.computer.
The biggest difference is parakeet is trained only for English transcription and Crisper uses Whisper model which supports 90+ languages. This is the first version of Crisper where we have used existing OSS models, but we are in process to fine-tune certain models as per our need.