Y
Hacker News
new
|
ask
|
show
|
jobs
by
regexorcist
32 days ago
Curious if you tested llama.cpp and still found oMLX faster? I haven't tried the latter myself, might give it a go.
1 comments
egorfine
32 days ago
Oh yeah I did test various solutions and different settings and quants
Llama is about 1/3 slower on Apple Silicon.
link
Llama is about 1/3 slower on Apple Silicon.