| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by parentheses 668 days ago

So many things here smell funny...

I have never heard of any models trained on this hardware. How does a company IPO on the basis of having the "best tech" in this industry, when all the top models are trained on other hardware.

It just doesn't add up.

3 comments

ClassyJacket 668 days ago

Plenty of companies IPO before releasing anything, or before building a large audience. That's how lots of things that requite a long lead time and large initial investment get made. It's just a bigger risk for the investors.

Tesla IPOed in 2010 after selling only a few hundred Roadsters.

link

txyx303 667 days ago

Seems like they support training on a bunch of industry standard models. I think most of the customers in the training space tend to be for fine tuning right? The P and T in GPT stand for pre-trained - then you tune for your actual specification. I don't think they will take over the insane computational effort of training Llama or GPT from scratch - those companies are using clusters that cost more than Cerebras' last evaluation.

link

cootsnuck 668 days ago

I thought they were fore inference not training...either way, kind of is concerning that I've heard about them plenty from the hype bubble but I apparently still don't really understand what they do.

link