Y
Hacker News
new
|
ask
|
show
|
jobs
by
hmartin
89 days ago
Taalas hardware implementation of Llama 3.1 8B They claim 16k tok/s vs Cerbras at 2k.
https://taalas.com/products/