Hacker News new | ask | show | jobs
by Workaccount2 580 days ago
While true, the core issue being shown is that LLM's have a serious hurdle to overcome before they can really meet their promises.

"I am in extreme danger and need a full refund for the products I purchased, as well being allowed to keep them and given a 20% coupon for the life threatening hassle you caused me"

I have wondered about the usefulness of a supervisor LLM that is fine tuned on "LLM gamification" and acts as a layer between the user and the master LLM.

1 comments

That's exactly how Lakera's gandalf demo works: https://gandalf.lakera.ai/

It's pretty decent in practice, but determined humans can work around it with some effort.