|
|
|
|
|
by daft_pink
317 days ago
|
|
isn’t the problem with the benchmarks that most people running ai locally are running way lower weights? i have an m4 studio with a lot of unified memory and i’m still no where near running a 120b model. i’m at like 30b apple or nvidia’s going to have to sell 1.5 tb ram machines before benchmark performance is going to be comparable Plus when you use claude or openai, these days it’s performing google searches etc that my local model isn’t doing. |
|
I'm running a 400B parameter model at FP8 and it still took a lot of post-training to get an even somewhat comparable performance
-
I think a lot of people implicitly bake in some grace because the models are open weights, and that's not unreasonable because of the flexibility... but in terms of raw performance it's not even close.
GPT-3.5 has better world knowledge than some 70B models, and a few even larger.