| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by another_twist 165 days ago
	Thats great. I think we need to start researching how to get cheaper models to do math. I have a hunch it should be possible to get leaner models to achieve these results with the right sort of reinforcement learning.

1 comments

alansaber 165 days ago

Deepseek wrote a decent paper on this https://github.com/deepseek-ai/DeepSeek-Math-V2/blob/main/De...