Apple Releases Open Source AI Models That Run On-Device

Y	Hacker News new \| ask \| show \| jobs

	Apple Releases Open Source AI Models That Run On-Device (macrumors.com)
	54 points by 911e 789 days ago

7 comments

JKCalhoun 789 days ago

An LLM in my pocket is truly a mind-blowing concept, I have to say. More than anything else — phone, camera, internet. The feels like a really big deal.

And with regard to LLMs (AI?) in general, I don't think right now we have any idea what we will all be using them for in ten years. But it just feels like a fundamental change is coming from all this.

link

gnabgib 789 days ago

Discussion: [0] (33 points, 18 hours ago, 7 comments)

[0]: https://news.ycombinator.com/item?id=40140675

link

solarkraft 788 days ago

I'm not knowledgeable enough to parse much out of the Readme.

How "good" are the models approximately? What hardware do I need to run them? How fast are they?

link

ChrisArchitect 789 days ago

[dupe]

Some more discussion: https://news.ycombinator.com/item?id=40140675

link

simonw 789 days ago

Has anyone seen a working, clearly explained recipe for running this using the Python MLX library on macOS yet?

link

ein0p 789 days ago

What’s there to explain? There’s a readme in the repo that shows how to do it.

link

simonw 788 days ago

I tried and failed to follow that. I'm looking for a report from someone who has got it (or something like it) to work.

link

ein0p 788 days ago

I’ve tried their 1.1B model. The only hiccup was that it seems to require mlx 0.10.0 which is what’s in requirements.txt. You also have to place the llama tokenizer file into the model dir - they do not distribute it. The models published for MLX do not seem to be instruction tuned, so with their default prompt they get repetitive. But I suppose you could convert the instruction tuned checkpoints with the script in the repo.

link

mritchie712 788 days ago

guessing it'll be available thru ollama soon

https://huggingface.co/apple/OpenELM/discussions/5

link

sp332 789 days ago

Why is the 3B model worse than the 450M model on MMLU and TruthfulQA?

link

Bloating 789 days ago

Now we can give credit to Apple for invented AI!

link

Turing_Machine 788 days ago