Hacker News new | ask | show | jobs
by phonon 391 days ago
This is the state of the art for such a setup. Really good performance!

https://github.com/kvcache-ai/ktransformers