They basically just ship executables for different llama.cpp backends and select the correct one with a python script, which is fine, as the executables are really small.
They basically just ship executables for different llama.cpp backends and select the correct one with a python script, which is fine, as the executables are really small.