Hacker News new | ask | show | jobs
by iAkashPaul 783 days ago
Yep, termux is a good way to do this. Llama.cpp has Android example as well, I forked it here GitHub.com/iakashpaul/portal you can try it with any supported GGUF/Q4+Q8 models