Hacker News new | ask | show | jobs
On-Device LLM Inference Powered by X-Bit Quantization (github.com)
15 points by dynamix 745 days ago