Hacker News new | ask | show | jobs
by minimaxir 462 days ago
Passing the generated audio back to GPT-4o to ask for the structured annotations would be a fun test case.
1 comments

this is a good solve. we don't support word time stamps natively yet, but are working on teaching GPT-4o that skill