Hacker News new | ask | show | jobs
by ijustlovemath 44 days ago
It seems like the problem in this application is that attention itself. Makes me wonder if using a transformer for transcription is the correct architecture.