Hacker News new | ask | show | jobs
Tau-knowledge: benchmarking agents on real-world knowledge (sierra.ai)
2 points by tedsanders 41 days ago