Hacker News new | ask | show | jobs
by genpfault 95 days ago
Speculative decoding[1]?

[1]: https://github.com/ggml-org/llama.cpp/blob/master/docs/specu...