|
|
|
|
|
by timeleft--
105 days ago
|
|
Thanks for the great observations. I haven't done a formal benchmark, but I built this because I saw this failure mode in several production agentic systems. The quality of the trust-gate will not be a one and done prompt, but will have to iterate on that and will most likely be application specific. |
|