Hacker News new | ask | show | jobs
by woadwarrior01 1087 days ago
Here's a recent paper on training transformers with 4 bit integer weights.

https://arxiv.org/abs/2306.11987