Hacker News new | ask | show | jobs
by parentheses 621 days ago
So many things here smell funny...

I have never heard of any models trained on this hardware. How does a company IPO on the basis of having the "best tech" in this industry, when all the top models are trained on other hardware.

It just doesn't add up.

3 comments

Plenty of companies IPO before releasing anything, or before building a large audience. That's how lots of things that requite a long lead time and large initial investment get made. It's just a bigger risk for the investors.

Tesla IPOed in 2010 after selling only a few hundred Roadsters.

Seems like they support training on a bunch of industry standard models. I think most of the customers in the training space tend to be for fine tuning right? The P and T in GPT stand for pre-trained - then you tune for your actual specification. I don't think they will take over the insane computational effort of training Llama or GPT from scratch - those companies are using clusters that cost more than Cerebras' last evaluation.
I thought they were fore inference not training...either way, kind of is concerning that I've heard about them plenty from the hype bubble but I apparently still don't really understand what they do.