Y
Hacker News
new
|
ask
|
show
|
jobs
by
junrushao1994
1141 days ago
TVM Unity, the compiler used by MLC-LLM, does support CPU and SIMD instructions on each CPU backend via LLVM, but we haven't tried it out yet. I believe llama.cpp is the best option out of box at the moment.