Hacker News new | ask | show | jobs
by gertop 785 days ago
Hugging face is a model repository.

Ollama allows you to run those models.

Different things.

1 comments

I run models using HF just fine. I mean I’m using HF transformers repo, which gets models from HF hub.

Or do you mean commercial deployment of models for inference?

Are you talking about the Hugging Face Python libraries, the Hugging Face hosted inference APIs, the Hugging Face web interfaces, the Hugging Face iPhone app, Hugging Face Spaces (hosted Docker environments with GPU access) or something else?
I updated my comment above: I’m using HF transformers repo, which gets models from HF hub.
Do you have an NVIDIA GPU? I have not had much luck with the transformers library on a Mac.
Of course. I thought Nvidia GPUs are pretty much a must have to play with DL models.
Well being able to run these models on CPU was pretty much the revolutionary part of llama.cpp.
Ollama supports many radeons now. And I guess llama.cpp does too, after all it's what ollama uses as backend.