|
|
|
|
|
by econ
14 days ago
|
|
I wonder to what extent models should figure out which model to forward a query to. Or perhaps the big models could learn the difference between an easy and a hard question and charge accordingly? Perhaps, if it can measure complexity, even generate a quote? Small models are fine for small coding tasks but I don't see why big ones can't be broken down most of the time. |
|
This one does not have routing, but reasonix is insane, absolutely insane for saving money. I've used 1.3billion tokens at the cost of 4$. (99-100% cache hit)