Hacker News new | ask | show | jobs
by marcociavarella 26 days ago
Original author here, thanks for sharing. Did anyone try to reproduce the results w/ reasoning models? Very curious to see this.

A general meta-point: an LLM w/ no code generation and/or tool-calls will inherit non-trivial biases from its pre-training, post-training and safety guardrails