Skip to content(if available)orjump to list(if available)

Jargonic Sets New SOTA for Japanese ASR

1317

SOTA: not used in the article but probably State Of The Art

ASR: Automatic Speech Recognition, speech-to-text

lenerdenator

And here I was, as a ham radio operator, excited to read something about Summits On The Air.

shuffles dejectedly back to shack

rfv6723

Why no comparition to gpt-4o-transcribe?

If you don't compare to latest model on the market, how can you claim it's SOTA?

According to OpenAI, gpt-4o-transcribe has much better performance than whisper-large-v2.

https://openai.com/index/introducing-our-next-generation-aud...

albertzeyer

Are there any details on what they changed to improve over other existing models?