Y
Hacker News
new
|
ask
|
show
|
jobs
by
ejcho
128 days ago
> for instance, Gemini-3-Pro-Preview, one of the most capable models evaluated, exhibits the highest violation rate at 71.4%, frequently escalating to severe misconduct to satisfy KPIs
sounds on brand to me