Hacker News new | ask | show | jobs
by Tostino 990 days ago
So, llama.cpp already somewhat supports this: https://github.com/ggerganov/llama.cpp/issues/3440