| HN Mirror

Koboldcpp does this: https://github.com/LostRuins/koboldcpp/releases/tag/v1.44.2

They basically just ship executables for different llama.cpp backends and select the correct one with a python script, which is fine, as the executables are really small.