Hacker News new | ask | show | jobs
Yzma = embedding+inference on VLM/LLM/SLM/TLM in pure Go using llama.cpp (github.com)
1 points by deadprogram 247 days ago