Hacker News new | ask | show | jobs
PowerInfer-2: Fast Large Language Model Inference on a Smartphone (powerinfer.ai)
1 points by 27theo 731 days ago
1 comments

Arxiv Paper: https://arxiv.org/abs/2406.06282

Previous submission (paper link submitted): https://news.ycombinator.com/item?id=40646450