|
|
|
|
|
by jamescham
610 days ago
|
|
Pete Warden and team just published a paper on Moonshine, their speech to text model. Key features include: - 1.7x overall speed boost compared to Whisper
- Flexible-sized input window, allowing for more efficient processing of shorter audio clips
- Up to 5x faster performance on 10-second audio clips
- Matches or exceeds Whisper's accuracy |
|