Y
Hacker News
new
|
ask
|
show
|
jobs
Political-bias benchmark for Grok 4, GPT-4.1, Gemini 2.5 Pro and Claude Opus 4
(
promptfoo.dev
)
3 points
by
dangelosaurus
325 days ago