| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by marcociavarella 26 days ago
	Original author here, thanks for sharing. Did anyone try to reproduce the results w/ reasoning models? Very curious to see this. A general meta-point: an LLM w/ no code generation and/or tool-calls will inherit non-trivial biases from its pre-training, post-training and safety guardrails