Y
Hacker News
new
|
ask
|
show
|
jobs
by
Eddygandr
1040 days ago
I really dislike benchllm's use of yamls for test cases - I'd rather it be in code.
""" input: What's 1+1? Be very terse, only numeric output expected: - 2 - 2.0 """
1 comments
jacky2wong
1040 days ago
Agreed. No one should ever have to touch YAML for writing unit tests for LLMs. Ever. Most people writing agents and LLM applications are Python developers/data scientists/ML engineers.
link