| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by ggm 224 days ago
	I suspect introspection and meta questions flag you up into logical systems which assume threat not outcome focussed responses.

1 comments

susdamso 224 days ago

Thank you for your reply. Could you elaborate a little more?

link

ggm 224 days ago

I don't work in AI but if I did it'd regard introspective questions to aspects of my own LLMs behaviour as threat risk more than purposeful debugging by customers. I'd code my systems accordingly. Slowing down service or being less exposing might be defensive or protecting.

link

susdamso 224 days ago

Thank you for your detailed response. I'm having a bit of a hard time with this issue right now.

link