Y
Hacker News
new
|
ask
|
show
|
jobs
by
lhl
1028 days ago
LocalAI
https://localai.io/
and LMStudio
https://lmstudio.ai/
both have fairly complete OpenAI compatibility layers. llama-cpp-python has a FastAPI server as well:
https://github.com/abetlen/llama-cpp-python/blob/main/llama_...
(as of this moment it hasn't merged GGUF update yet though)