Hacker News new | ask | show | jobs
Prompt eval cues predicted refusal shifts across 32k LLM rollouts (medium.com)
1 points by ratnaditya 35 days ago