Hacker News new | ask | show | jobs
by keep_reading 904 days ago
Apple has released a paper about running LLMs efficiently with low memory

https://arxiv.org/pdf/2312.11514.pdf