Y
Hacker News
new
|
ask
|
show
|
jobs
by
another_twist
118 days ago
Thats great. I think we need to start researching how to get cheaper models to do math. I have a hunch it should be possible to get leaner models to achieve these results with the right sort of reinforcement learning.
1 comments
alansaber
118 days ago
Deepseek wrote a decent paper on this
https://github.com/deepseek-ai/DeepSeek-Math-V2/blob/main/De...
link