Hacker News new | ask | show | jobs
by ianschmitz 859 days ago
I’m pretty new to running these locally, but here’s my understanding:

Best models currently: codellama or deepseek coder. 6.7B or 1B depending on how much latency you can tolerate

Treesittee: from looking at the logs of the chat completions requests for Continue or Twinny extensions for VS Code, they both appear to just send a chunk of the document as well as a special placeholder to indicate where the cursor currently is.