Hacker News new | ask | show | jobs
by cryptonector 430 days ago
> I'd really like to see what an LLM Incident Response looks like!

It must look like this: "Uggh! Here we go again!" and "boss, we really can't make the guardrails secure, at some point we might have to give up", with the PHB saying "keep trying, we have to have them guardrails!".

1 comments

The trajectory of AI is: emulating humans. We've never been able to align humans completely, so it would be surprising if we could align AI.
It's like you're saying that AI has the same sort of fuzzy "free will" that we do, and just as an obedient slave might be convinced to break his or her bonds, so might an AI.
Religion is an attempt at the alignment problem and that experiment failed dramatically. Spiritual system prompting was never fully hardened against atheistic jail-breaking.
a wise person once told me -- avoid using "is" when entering complex idea spaces
Thank you, but I craft my takes specifically to warp consensus reality. Epistemic humility is bringing pre-lost arguments to a debate and proudly laying them at your opponent’s feet, saying, "please, go ahead and stab me with these. I brought plenty."
> Epistemic humility is bringing pre-lost arguments to a debate

hubris

will to power