|
|
|
|
|
by ilyakaminsky
332 days ago
|
|
I wouldn't describe it as "unusable" so much as needing to understand its constraints and how to work around them. I built a business on top of Whisper [1] and one of the early key insights was to implement a good voice activity detection (VAD) model in order to reduce Whisper's hallucinations on silence. [1] https://speechischeap.com |
|