|
|
|
|
|
by mayeaux
1303 days ago
|
|
Whisper does pretty well, even with background music and things like that, I think you're working with a pretty weird subsection of recorded audio that won't work, for that edge case to work you'll very likely need to train your own model. |
|