Hacker News new | ask | show | jobs
by wicket 60 days ago
Related paper

BitNet: Scaling 1-bit Transformers for Large Language Models (2023): https://arxiv.org/abs/2310.11453