Hacker News new | ask | show | jobs
by mackid 508 days ago
Microsoft did a bunch of research into low-bit weights for models. I guess OAI didn’t look at this work.

https://proceedings.neurips.cc/paper/2020/file/747e32ab0fea7...