|
|
|
|
|
by armanj
56 days ago
|
|
I did a quick benchmark & compared it with Qwen3.5:
https://github.com/ArmanJR/PrismML-Bonsai-vs-Qwen3.5-Benchma... in my results, accuracy-wise Ternary-Bonsai-8B is on par with Qwen3.5-4B. But in accuracy-per-byte, bonsai is the clear winner: => Ternary-Bonsai-1.7B achieved 65.1% from 462 MiB, beating Qwen3.5-0.8B by 12 points while being ~5% smaller on disk.
=> Ternary-Bonsai-4B is the accuracy-per-byte winner above 1 GiB. 83.0% from only 1.1 GiB, within 2 points of Qwen3.5-4B at 40% of the weight size. they show strong promise on edge devices and where disk space is limited. I think this lab is worth watching. |
|