Hacker News new | ask | show | jobs
by cootsnuck 558 days ago
So perhaps could be useful for fine-tuning on smaller models?
1 comments

I think so. I believe this type of reasoning method, which achieves better results through longer computation time, is very useful on edge devices like mobile phones. Consider a scenario where we only need the model to output a function/action call on the phone; we don't require it to provide an immediate response.