|
|
|
|
|
by jasonjmcghee
419 days ago
|
|
Is this the widely used term? Do you know of any open source models fine-tuned as an "inner loop" / native agentic llm? Or what the training process looks like? I don't see why any model couldn't be fine-tuned to work this way - i.e. tool use doesn't need to be followed by an EOS token or something - it could just wait for an output (or even continue with the knowledge there's an open request, and to take action when it comes back) |
|