Hacker News new | ask | show | jobs
by MacsHeadroom 790 days ago
Who is already using speculative decoding? I haven't seen anything about it in the llama.cpp or ollama docs.
1 comments