|
|
|
|
|
by Udo
73 days ago
|
|
This looks very promising. It would be cool if support for Bonsai-style models would land in mainline MLX soon, looking forward to trying it out. It seems PrismML has implemented a better version of an idea I had a while back: what if we had a 1-bit model where the scale of the weight is determined by its position. The model would have to be trained from the ground up for this though, which is why I never tried it. The interleaved scale factor approach of Bonsai is a much more flexible approach at almost the same cost. |
|