|
|
|
|
|
by kiitos
372 days ago
|
|
> If the LLM hallucinates, then the code it produces is wrong. That wrong code isn't obviously or programmatically determinable as wrong, the agent has no way to figure out that it's wrong, it's not as if the LLM produces at the same time tests that identify that hallucinated code as being wrong. The only way that this wrong code can be identified as wrong is by the human user "looking closely" and figuring out that it is wrong The LLM can easily hallucinate code that will satisfy the agent and the compiler but will still fail the actual intent of the user. > I review LLM code line-by-line. Lots of code that compiles clean is nonetheless horrible. Indeed most code that LLMs generate compiles clean and is nevertheless horrible! I'm happy that you recognize this truth, but the fact that you review that LLM-generated code line-by-line makes you an extraordinary exception vs. the normal user, who generates LLM code and absolutely does not review it line-by-line. > But none of [the LLM generated code] includes hallucinated API calls. Hallucinated API calls are just one of many many possible kinds of hallucinated code that an LLM can generate, by no means does "hallucinated code" describe only "hallucinated API calls" -- ! |
|
I think at this point our respective points have been made, and we can wrap it up here.