| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by shuhongwu 362 days ago
	1. llama.cpp can run on both Android and iOS devices. 2. For non-transformer models, Mamba and RWKV are also good options. 3. Additionally, you should have a better understanding of embeddings, tokens, and the structure of transformers. And on Apple Silicon chips, MLX is preferred and is often the best choice.

1 comments

ada1981 362 days ago

Thanks!

I’m trying to create something that will run online when connected, run local when offline, and dynamically download relevant files to prepare for offline use.

link