|
|
|
|
|
by irthomasthomas
109 days ago
|
|
I do this too, but then you need some method to handle it, because now you have to read and test and verify multiple work streams. It can become overwhelming. In the past week I had the following problems from parallel agents: Gemini running an benchmark- everything ran smoothly for an hour. But on verification it had hallucinated the model used for judging, invalidating the whole run. Another task used Opus and I manually specified the model to use. It still used the wrong model. This type of hallucination has happened to me at least 4-5 times in the past fortnight using opus 4.6 and gemini-3.1-pro. GLM-5 does not seem to hallucinate so much. So if you are not actively monitoring your agent and making the corrections, you need something else that is. |
|