| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by sharmajai 832 days ago
	Maybe not everything should be about business.

8 comments

smith7018 832 days ago

Agreed but this isn't the same as an open source library; it costs A LOT of money to constantly train these models. That money has to come from somewhere, unfortunately.

link

TehCorwiz 832 days ago

Yeah. The amount of compute required is pretty high. I wonder, is there enough distributed compute available to bootstrap a truly open model through a system like seti@home or folding@home?

link

Filligree 832 days ago

The compute exists, but we'd need some conceptual breakthroughs to make DNN training over high-latency internet links make sense.

link

altruios 832 days ago

Distributing the training data also opens up vectors of attack. Poisoning or biasing the dataset distributed to the computer needs to be guarded against... but I don't think that's actually possible in a distributed model (in principal?). If the compute is happing off server: then trust is required (which is not {efficiently} enforceable?).

link

TehCorwiz 832 days ago

Trust is kinda a solved problem in distributed computing, The different "@Home" projects and Bitcoin handle this by requiring multiple validations of a block of work for just this reason.

link

altruios 832 days ago

How do you verify the work of training without redoing the exact same work for training? (That's the neat part: you don't)

Bitcoin is trust-solved because of how the new blocks depends on previous blocks. With training data, there is no such verification (prompts/answers pairs do not depend at all on other prompt/answer pairs) (if there was, we wouldn't need to do the work of training the data in the first place).

You can rely on multiplying the work where gross variations are ignored (as you suggest): but that will take a lot more overhead in compute, and still is susceptible to bad actors (but much more resistant).

There is no solid/good solution - afaik - for distributed training of an AI (Open assistant I think is working on open training data?), if there is: I'll sign up.

link

pksebben 832 days ago

Forward-Forward looked promising, but then Hinton got the AI-Doomer heebie-jeebies and bailed. Perhaps someone picks up the concept and runs with it - I'd love to myself but I don't have the skillz to build stuff at that depth, yet.

link

TehCorwiz 832 days ago

I agree, but Y-Combinator literally only exists to squeeze the most bizness out of young smart people. That's why you're not seeing so much agreement.

link

phkahler 832 days ago

>> but Y-Combinator literally only exists to squeeze the most bizness out of young smart people.

YC started out with the intent to give young smart people a shot at starting a business. IMHO it has shifted significantly over the years to more what you say. We see ads now seeking a "founding engineer" for YC startups, but it used to be the founders were engineers.

link

te_chris 832 days ago

Squeezed all the alpha out of the idealists now it’s the business guys turn

link

bufferoverflow 832 days ago

If you agree, do you mind paying a few hundred thousand for my neural net training expenses?

link

mvkel 832 days ago

The choice facing many companies that insist on remaining "open" is:

Do you want to 1. be right

2. stay in business

This is one of the reasons why OpenAI pivoted to be closed. Not bc of greedy value extractors; because it was the only way to survive.

link

bufferoverflow 832 days ago

Training these big models is very very expensive. If they don't make money, and they run out of their own money, there will be no more SDXL.

link

sandworm101 832 days ago

>> Training these big models is very very expensive.

Which is why they are not the future. A big model that can generate a picture about anything in response to any input makes for a great website. It generates lots of press. But it is not a reasonable tool for content generation. If you want to produce content in a specific area or genre, the best results come from a model trained or modified in the area. So the big generalized AI, if you use it, would only be the framework on which you built your specialized tool. Building that specialized tool, such as something dedicated to images of a particular politician, does not require huge amounts of computation. That sort of thing can and is being done by individuals.

I am waiting for a tool trained on publicly-accessible mugshots. It wouldn't be a very big project but could yield a tool to generate very believable mugshots of politicians.

link

bufferoverflow 832 days ago

I think it's unreasonable to expect a model for every possible use case. You would need billions of models, if not trillions.

Big generalist models are the future.

link

mikkom 832 days ago

That was basically why openai was founded.

Too bad they decided to get greedy :-(

link

probablynish 832 days ago

Most individuals like being able to acquire more goods and services. A lot follows from there

link

kelseyfrog 832 days ago

You're right, a lot follows from there. But I'm so tired of being a consumer. I just want to be me for a chance. I'm so, so tired.

link

probablynish 832 days ago

Depending on your background and circumstances, there are ways to opt out of the race to a greater/lesser degree. Moving to a cheaper city in your country, or a cheaper country altogether, is one of them. Finding a less stressful way of making less money is another.

I don't know you but I hope things work out :)

link

kelseyfrog 832 days ago

Thank you, appreciate it.

It's just hard being reminded that there's no escape hatch - we've welded them all shut for eternity. Being reduced to choices within a system but the choice horizon never extends to the system itself and won't within my lifetime makes me feel trapped.

link

natebc 832 days ago

well, know that you're not alone in that feeling.

link

ben_w 832 days ago

Great, but aren't they simultaneously losing money and getting sued?

link

baq 832 days ago

Maybe. Paychecks help with not being hungry, though.

I’d be happy if my government or EU or whatever offered cash grants for open research and open weights in AI space.

The problem is, everyone wants to be a billionaire over there and it’s getting crowded.

link