Hacker News new | ask | show | jobs
Pure Go hardware accelerated local inference on VLMs using llama.cpp (github.com)
1 points by deadprogram 223 days ago