Hacker News new | ask | show | jobs
by geraltofrivia 1173 days ago
HuggingFace is also hosting models (CLI/code/API), have an extensive community driven dataset store, and has some great positive feedback loops.

Great Library -> Large Community -> Large Coverage of Models & Datasets -> Larger Community -> Larger Coverage of Models & Datasets -> Revenue -> More Engineers -> Greater Library ...

Not all the steps (->) here are trivial: Contrast HF to Explosion (great folks behind spacy).

1 comments

> Not all the steps (->) here are trivial: Contrast HF to Explosion (great folks behind spacy).

Can you elaborate more on this? about contrasting HF to Explosion?

Explosion's core contribution (not moneymakers) is the great SpaCy library they first released in 2015. It was an excellent work, far better designed (IMO) than NLTK and other offerings at the time. Of course the library isn't monetized. SpaCy too has the ability to train custom models and use them.

This never transformed into a model hub. Despite a lot of people using SpaCy and probably building custom models.

Again in contrast, Explosion's other revenue stream (prodigy) is not a SaaS as well. Its a great software, and I presume it brings in a steady income. But in 2023, I would imagine that HF's LLM hosting, cloud training environment, brings in more money than Explosion's data annotation software.

I'll also add that Explosion has a heavy "production-ready, quickly" bent, and even supports wrapping HF models with spaCy. Explosion is probably my favorite company in the AI space and has provided the most tangible value of all the NLP tools I've used.