|
|
|
|
|
by ZYbCRq22HbJ2y7
360 days ago
|
|
There will likely be custom, large, and expensive models at an enterprise level in the near future (some large entities and governments already have them (niprgpt)). With that in mind, what would be the business sense in siloing a single "Agent" instead of using something like a service discovery service that all benefit from? |
|
Also the current LLMs have still too many issues because they are autoregressive and heavily biased towards the first few generated tokens. They also still don't have full bidirectional awareness of certain relationships due to how they are masked during the training. Discrete diffusion looks interesting but I am not sure how does that one deal with tools as I've never seen a model from that class using any tools.