Hacker News new | ask | show | jobs
by hmartin 89 days ago
Taalas hardware implementation of Llama 3.1 8B They claim 16k tok/s vs Cerbras at 2k. https://taalas.com/products/