Hacker News new | ask | show | jobs
by w1nk 1505 days ago
So this question can only come from a place where you have no idea what they do in their field. For every news article or arxiv post that you see talking about how this amazing new GPT-N model has broken all sorts of language benchmark scores, you'll notice that basically nobody can reproduce those results. That's mostly due to the barrier of entry with respect to hardware for training the models.

Huggingface is releasing APIs and model checkpoints that allow any random internet user to execute (almost) SOTA language models in production. FYI - that's an amazing leap forward and a strong piece of kit for MLEs to have access to.

So let me rephrase your question: Is general access to SOTA language models worth 100mm to the software market?

I suspect the answer is a resounding yes.

3 comments

> So let me rephrase your question: Is general access to SOTA language models worth 100mm to the software market?

That doesn't answer the original question. The question was:

"How will you extract $100M+ from the software market?"

Your answer was:

"I think that Huggingface will produce $100M worth of value."

Which may or may not be true, but just because something produces X amount of value doesn't mean the project will be able to extract that value. See any open source project.

So if you want someone to answer precisely how they'll extract hundreds of millions of dollars from an emerging market, I have to imagine this isn't the correct forum to expect such answers.

Enabling the general software community access to SOTA language models will absolutely unlock an order of magnitude more money (than 100mm) over time. At least for now their obvious strategy for capturing this value is providing these APIs to enable it, and I suspect they'll gladly host such versions for the orgs that don't have the capacity to fine tune / host their own LLMs.

Let me try to make the point clearer:

1. Investors expect Huggingface to extract more than $100M from the market. Otherwise they'd be called 'donors'.

2. If they openly publish models, then their APIs will be undercut by other providers who can take the published model and host it for cheaper. It would be cheaper for other companies because: they don't need to pay the cost of training the model, and they can specialize in simply hosting models.

3. Because of 2), Huggingface would need to avoid allowing other companies to host models, including internal APIs (because then providers would simply spin up to making hosting those internal APIs easy).

4) Because of 3), their policy of publishing trained models openly has to change.

So the question that the original poster was asking is: what Huggingface policies will change, given the need to make returns on this investment?

The original poster is likely thinking of OpenAI, which went down a similar route (starting training open models, took in a bunch of money, realized that openly publishing them wasn't sustainable, kept the models secret and created locked down APIs for accessing them).

> So if you want someone to answer precisely how they'll extract hundreds of millions of dollars from an emerging market, I have to imagine this isn't the correct forum to expect such answers.

This market isn't new; Google, AWS, OpenAI, etc. all have APIs they charge for. They also have services to host trained models for you. How will Huggingface make money without resorting to hiding its models?

"This market isn't new; Google, AWS, OpenAI, etc. all have APIs they charge for."

And if they were standalone businesses they'd be losing money, it's neither a big nor profitable market.

When the business model for a project is not 'really obvious' it's usually a bad sign.

AirBnB, Uber, Stripe etc. - 'how' they make money is obvious, it's intrinsic to the product.

I don't think OpenAI is a valid comparison. Huggingface's mission, unlike in the case of OpenAI, is not training models, but being the standard service for sharing them. The vast majority of models and datasets available at the Huggingface Hub have been provided by third-party companies or researchers. They aim to be the Github of ML models and data, not an AI-building startup.
Yep - "Huggingface Enterprise" just like there's "Github Enterprise" seems like a straightforward way to make money, at least to me? Does Microsoft make good money from Github Enterprise?
Hugging Face is selling CPU cycles. They're also letting you upload your own datasets that aren't "limited" like others. I'm not quite sure where you think their approach of "open models" is wrong, they still sell the CPU cycles.

The idea that restricting access to the data is the only way to profit is such an archaic way of thinking. Hugging Face, if they keep making a good user interface and a good front end, will very much be able to fill the niche it is designed for: people who can't afford a $10-20k rig to run a model but who need to run it for their backend project.

Also, it may be due to using HN, but when I think of "where can I run a model" or "get a dataset" I think Hugging Face. They are leveraging the democratization of the data.

Thanks for clarifying, I misunderstood what Huggingface's product was.

I see the niche. The risks are:

- the mid market is constantly churning; either players become too big and you can't meet their requirements or they go bankrupt. Customer acquisition becomes a pretty big expense.

- selling CPU cycles is a cutthroat business which competes pretty directly with AWS, Azure, and Google Cloud. Their edge will likely be ease of use, but at some scale, the larger providers will be able to undercut them hard.

- selling a solution for managing datasets and training models using cloud CPUs is a crowded market.

- not sure how trustworthy the company is with private datasets. Easier to trust an established vendor.

But it wouldn't be a startup if there weren't risks.

> So if you want someone to answer precisely how they'll extract hundreds of millions of dollars from an emerging market, I have to imagine this isn't the correct forum to expect such answers.

I don’t think the idea is to get a precise answer. I think the idea is to answer what their business model is. Like will they sell subscriptions? Or ads? Or patronage? Or enterprise support? Or conferences? Or what.

I don’t use huggingface, but am a little familiar with them and think they are a great group with great software. But since it’s OSS, I’m not sure how they would make such a huge amount as $100M. Not to mention that they probably need to make much more than that to have happy investors. So there’s probably a $1-2B plan for making money somewhere and knowing the general idea for their business model would be cool.

I’m a bit bitter over what happened with OpenAI, and many other great opensource projects that turned into crappy companies boxed into making way more than they naturally could make (eg, elastic).

It's expensive to hire NLP labor right now, and has been for awhile. Seems like one strategy could be: HF provides a cheaper & more scalable alternative to having to hire an in-house NLP team. Basically NLP becomes synonymous with HF.

And they amortize the cost of hiring their own NLP engineers by developing a few models/model-based services that lots of businesses would be willing to pay for. E.g. 'foundation models' for different verticals like healthcare etc. Then it'll also be a lot easier to either fully automate or at least scale up work that's specific to each paying customer (because fine-tuning should go much more quickly, just essentially be a hyperparameter tuning cycle in as many cases as they can get away with).

NLP engineers spend 10% of their time training models, and 90% preparing the dataset and learning about the specifics of the task. I don't think this scales like selling software.
It's a completely different thing to produce something of value and to get paid for it. There are plenty of examples. Immediately come to mind operating systems or compilers. Or a lot of completely underfunded but fundamental open source tools that everybody uses but nobody pays.
So this answer can only come from a place where you have no idea how business works.

There's a gaping difference between 'value creation' and 'value capture'.

Some products create incredible value for many parties, but don't have an easy way to capture value.

Some products create negative value for the system, but are oriented towards capturing a lot of money.

Wikipedia, Web Browsers a lot of Open Source libs. - examples of the kinds of things that can be invaluable, but whereupon it's difficult to capture value.