|
|
|
|
|
by concats
97 days ago
|
|
How does it compare for models of any meaningful size? These 0.6B-4B models are, frankly, just amusing curiosities. But commonly regarded as too error prone for any non-demo work. The reason why people are buying Apple Silicon today is because the unified memory allows them to run larger models that are cost prohibitive to run otherwise (usually requiring Nvidia server GPUs). It would be much more interesting to see benchmarks for things like Qwen3.5-122B-A10B, GLM-5, or any dense model is the 20b+ range. Thanks. |
|