Hacker News new | ask | show | jobs
by prithvi24 675 days ago
This is great to see. Evals on voice are hard - we only have evals on text based prompting, but it doesn't fully capture everything. Excited to give this a try.
1 comments

This tracks. Text evals to test core logic and voice evals for overall end-to-end performance!