Hacker News new | ask | show | jobs
Speculative decoding of llama2 models in pure C (github.com)
2 points by mscheong 787 days ago