Hacker News new | ask | show | jobs
by computerex 101 days ago
You can use my new golang inference engine to run variants of Qwen 3.5 faster than llama.cpp: https://github.com/computerex/dlgo