Hacker News new | ask | show | jobs
by angrais 1120 days ago
Have you tested MMS with real-world data? It's perhaps outclassed on evaluation metrics but on real-world data it is not as good as whisper.
1 comments

Have you tried MMS on real world data or is it just assumption?
Yes, of course.

Real world data being: one on one interviews (no background noise), small groups of people chatting (lots of background noise), and specific audio recordings ( with varying British regional accents.

In all three instances whisper produced a more accurate transcription.

This is for personal use. The license of MMS is also restrictive so cannot he used for commercial uses while whisper can. Another key consideration when wondering what to choose. On the other hand, one can train MMS (so using own custom dataset) so for some projects it may be more suitable.