Hacker News new | ask | show | jobs
by sumanyusharma 668 days ago
This tracks. Text evals to test core logic and voice evals for overall end-to-end performance!