Hacker News new | ask | show | jobs
by rsbeare 945 days ago
This is great! Nice work.

I made my own whisper & auto-typer which types what you say (forked whisper-typer).

I added OpenAI Q/A and RAG query feature so I could ask it questions (instead of auto keystroke typing) by voice command. For responses to questions, I used Eleven Labs - but even with latency optimized & streaming, it was slow, so disabled it.

I just swapped from OpenAI to Mistral 7b for Q/A querying. Much more responsive. Stoked to explore StyleTTS2 now!

Really glad that I came across your post. Thank you for sharing!