Hacker News new | ask | show | jobs
user: raffisk
created: 2025-11-12
karma: 11

Building & breaking things

submissions:

0 points | 0 comments
DFAH – open-source harness for replayable tool-using LLM agents
2 points | 1 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
LLM Output Drift in Financial Workflows: Validation and Mitigation (arXiv)
24 points | 26 comments