Y
Hacker News
new
|
ask
|
show
|
jobs
by
erichocean
164 days ago
Here you go:
https://research.nvidia.com/labs/lpr/ToolOrchestra/
Big models (like Claude Opus 4.5) can (and do) just RL-train this into the main model.