Hacker News new | ask | show | jobs
by curious_cat_163 529 days ago
And be a lot more efficient (e.g. DeepSeek-V3) [1] about it...

[1] https://arxiv.org/pdf/2412.19437v1