Y
Hacker News
new
|
ask
|
show
|
jobs
by
altruios
82 days ago
what tokens/s are you getting with a 122B MoE model in this setup? I didn't see any benchmarks in the benchmarks section on the readme.md
3 comments
aegis_camera
82 days ago
https://www.sharpai.org/benchmark/
The MLX part is what we've done with SwiftLM, the local result is still being verified more details are on-going.
link
aegis_camera
82 days ago
I'll add more details. We just wired up the pipeline on both MAC and IOS.
link
gigatexal
82 days ago
yeah this I'd like to see added to teh readme.
link