Hacker News new | ask | show | jobs
by SparkyMcUnicorn 399 days ago
MLC[0] indicates that it can run models in the 8B range on iOS, but 1-3B sounds more reasonable to me.

[0] https://llm.mlc.ai/docs/deploy/ios.html#bring-your-own-model