Hacker News new | ask | show | jobs
by tikotus 187 days ago
Thank you so much! Also, you might find this interesting regarding testing LLMs: https://www.nicksypteras.com/blog/cbs-benchmark.html