BitNet: Scaling 1-bit Transformers for Large Language Models (2023): https://arxiv.org/abs/2310.11453