|
|
|
|
|
by mixermachine
131 days ago
|
|
Nothing will come close to Opus 4.6 here. You will be able to fit a destilled 20B to 30B model on your GPU.
Gpt-oss-20B is quite good in my testing locally on a Macbook Pro M2 Pro 32GB. The bigger downside, when you compare it to Opus or any other hosted model, is the limited context. You might be able to achieve around 30k.
Hosted models often have 128k or more. Opus 4.6 has 200k as its standard and 1M in api beta mode. |
|