|
|
|
|
|
by winter_blue
334 days ago
|
|
I’m actually finding Claude 4 Sonnet’s thinking model to be too slow to meet my needs. It literally takes several minutes per query on Cursor. So running it locally is the exact opposite of what I’m looking for. Rather, I’m willing to pay more, to have it be run on a faster than normal cloud inference machine. Anthropic is already too slow. Since this model is open source, maybe someone could offer it at a “premium” pay per use price, where the response rate / inference is done a lot faster, with more resources thrown at it. |
|