|
|
|
|
|
by woeirua
46 days ago
|
|
At the end of the day the company is going to audit what the agent has done. If the agent issues too many refunds that's a major red flag for the company providing the agent and likely results in the contract being terminated. I don't see how anyone can underwrite what agents are going to do today given that they're still so susceptible to prompt injection. You didn't address my concern, non-reasoning models are so, so variable in their output. |
|
2. another part of the value prop of these companies is figuring out how to construct the proper harness to take advantage of the lower latency of faster models while shoring up the weaker intelligence, how you blend deterministic and non-deterministic behaviors, compliance etc.
its a hard problem which is why f500 is willing to pay up