Hacker News new | ask | show | jobs
by mkl 969 days ago
Please consider LLaMA.cpp (https://github.com/ggerganov/llama.cpp), which supports a lot of models and doesn't need an expensive GPU.