Hacker News new | ask | show | jobs
by jamescham 610 days ago
Pete Warden and team just published a paper on Moonshine, their speech to text model.

Key features include:

- 1.7x overall speed boost compared to Whisper - Flexible-sized input window, allowing for more efficient processing of shorter audio clips - Up to 5x faster performance on 10-second audio clips - Matches or exceeds Whisper's accuracy