| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by clktmr 125 days ago
	At least try a different question with similar logic, to ensure this isn't patched into the context since it's going viral.

1 comments

j_maffe 125 days ago

You can't "patch" LLM's in 4 hours and this is not the kind of question to trigger a web search

link

vimda 125 days ago

You absolutely can, either through the system prompt or by hardcoding overrides in the backend before it even hits the LLM, and I can guarantee that companies like Google are doing both

link

tlogan 125 days ago

This has been viral on Tiktok far at least one week. Not really 4 hours.

link

nroets 125 days ago

You can pattern match on the prompt (input) then (a) stuff the context with helpful hints to the LLM e.g. "Remember that a car is too heavy for a person to carry" or (b) upgrade to "thinking".

link

throwuxiytayq 125 days ago

Yes, I’m sure that’s what engineers at Google are doing all day. That, and maintaining the moon landing conspiracy.

link

anonymous_user9 125 days ago

If they aren't, they should be (for more effective fraud). Devoting a few of their 200,000 employees to make criticisms of LLMs look wrong seems like an effective use of marketing budget.

link

rluna828 124 days ago

It looks like they do. https://simonwillison.net/2025/May/25/claude-4-system-prompt... They patch it in the prompt and they eventually address it in the re-enforcement training. It seems the eventual goal is to patch all of these tiny "glitches" so as to hide the lack of cognition.

link

londons_explore 125 days ago

A tiny bit of fine-tuning would take minutes...

link