| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by Eisenstein 641 days ago

Download koboldcpp and llama3.1 gguf weights, use it with the llama3 completions adapter.

Edit the 'background.js' file in the extension and replace the openAI endpoint with

Set anything you want as an API key. Now you have a truly local version.