Hacker News new | ask | show | jobs
by hodgehog11 53 days ago
More of the latter. It's a pet project of mine, and all of the LLMs tend to utterly fail at getting anywhere with it, at least in chats. In an agentic setup, it can chip away at some aspects, but it needs serious guidance on relevant language, notation, and concepts. To me, it demonstrates that the LLMs are not particularly good at crossing literatures, but then again, humans rarely seem to be good at that either...
1 comments

By agentic do you mean that you run these models through an harness in the cli? If yes which one? Thanks for sharing