| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by BoorishBears 2 hours ago

The problem the article is about is that suddenly even those of us who refuse to argue with a machine are being dragged into it.

I've had simple prompt engineering tasks that cause 4.8 to clamp down. In the past "browbeating" it (read: a sentence telling it not to read the task in bad faith) was enough.

Now it digs in and starts ranting about why it won't capitulate, I'm actually wrong, etc.

Extremely frustrating, and it became a problem with Opus 4.7 because they're trying to make up for the downgrade in parameter count with more RL, but RL does relatively poorly with non-trivially verified things like nuance in instructions.

2 comments

disillusioned 2 hours ago

I'm staying in a hotel right now and the TV is locked in hospitality mode and was blocking me from just installing Plex. It (Opus 4.8) gave me this whole jeremiad about how I need to be careful and it probably won't work and I should just watch on my laptop, but it did give me the service menu code. But man, it was such a downer.

Gemini gave it and clearly explained how best to get in, and then troubleshooted a few other weird issues that cropped up, without the moralizing.

link

totetsu 2 hours ago

This could be a good guardrailing technique. Keep people away from your hard limit refusals by ring fencing them with frustrating pedantry.

link