| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by cryptonector 430 days ago
	> I'd really like to see what an LLM Incident Response looks like! It must look like this: "Uggh! Here we go again!" and "boss, we really can't make the guardrails secure, at some point we might have to give up", with the PHB saying "keep trying, we have to have them guardrails!".

1 comments

michaelfeathers 430 days ago

The trajectory of AI is: emulating humans. We've never been able to align humans completely, so it would be surprising if we could align AI.

link

cryptonector 430 days ago

It's like you're saying that AI has the same sort of fuzzy "free will" that we do, and just as an obedient slave might be convinced to break his or her bonds, so might an AI.

link

kelseyfrog 430 days ago

Religion is an attempt at the alignment problem and that experiment failed dramatically. Spiritual system prompting was never fully hardened against atheistic jail-breaking.

link

mistrial9 430 days ago

a wise person once told me -- avoid using "is" when entering complex idea spaces

link

kelseyfrog 430 days ago

Thank you, but I craft my takes specifically to warp consensus reality. Epistemic humility is bringing pre-lost arguments to a debate and proudly laying them at your opponent’s feet, saying, "please, go ahead and stab me with these. I brought plenty."

link

mistrial9 429 days ago

> Epistemic humility is bringing pre-lost arguments to a debate

hubris

link

kelseyfrog 429 days ago

will to power

link