Hacker News new | ask | show | jobs
by its_down_again 575 days ago
Could you explain more on how to do this? e.g if I am using the Claude API in my service, how would you suggest I go about setting up and controlling my own inference endpoint?
2 comments

You can't. He means by using the open source models.
Runa local LLM tuned for coding on LM Studio. It has a server and provides endpoints.