Hacker News new | ask | show | jobs
Jargonic Sets New SOTA for Japanese ASR (aiola.ai)
19 points by four_fifths 402 days ago
3 comments

SOTA: not used in the article but probably State Of The Art

ASR: Automatic Speech Recognition, speech-to-text

And here I was, as a ham radio operator, excited to read something about Summits On The Air.

shuffles dejectedly back to shack

Why no comparition to gpt-4o-transcribe?

If you don't compare to latest model on the market, how can you claim it's SOTA?

According to OpenAI, gpt-4o-transcribe has much better performance than whisper-large-v2.

https://openai.com/index/introducing-our-next-generation-aud...

Are there any details on what they changed to improve over other existing models?