| Hi guys. I have been working on Hitoku Draft, an open-source, voice-first AI assistant that runs entirely locally. I posted about it already, and now it has also transcription with voice editing. Looking for feedback, as I found that outside tech circles other people still do not use this tech much. It's context-aware, in the sense that it reads your screen, documents, and active app to understand what you're working on. You can ask about PDFs, reply to emails, create calendar events, use web search, editing text, all by voice. You can download a compiled version for free with the code HITOKUHN2026 https://hitoku.me/draft/ (base price is 5 dollars) It supports Gemma 4 and Qwen 3.5 for text generation, plus multiple STT backends (Parakeet, Qwen3-ASR). Examples:
- Gemma4 in action, https://www.youtube.com/watch?v=OgfI-3YjEVU
- query a pdf document, https://www.youtube.com/watch?v=ggaDhut7FnU
- reply to email, https://www.youtube.com/watch?v=QFnHXMBp1gA
- and the usual voice dictation (with optional polishing) I currently use it a lot with Claude Code and Logseq. Now with some friends we are also building a new cross-platform version. The goal is on the long run to have AI interactive local models serving people and professionals. |
For my part, I need to be very very sure when it's posted through gumroad. I've gotten burned too many times by short term (as in, within months) abandonware through the gumroad sales channel.
Dev gets bored, doesn't want to deal, download goes unavailable. So you buy it, get a new computer next month, and can't install it. Especially annoying when I "name my own price" typically around $20 to tip the dev, and then the dev won't even keep that build available.