Hacker News new | ask | show | jobs
by abroadwin 460 days ago
Neat. It would be nice to provide an option to use an API endpoint without downloading an additional local model. I have several models downloaded via ollama and would prefer to use them without additional space being taken up by the default model.
1 comments

From the README:

Optionally, offload generation to speed up generation while extending the battery life of your MacBook.

Screenshot shows example, mentions OpenAI and gpt-4o.

But it still forces you to download a local model before you can use that feature.