|
|
|
|
|
by whimsicalism
551 days ago
|
|
> Afaik there is no commodity hardware that can run state of the art models like chatgpt-o1. Stack enough GPUs and any of them can run o1. Building a chip to infer LLMs is much easier than building a training chip. Just because one cost dwarfs another does not mean that this is where the most marginal value from developing a better chip will be, especially if other people are just doing it for you. Google gets a good model, inference providers will be begging to be able to run it on their platform, or to just sell google their chips - and as I said, inference chips are much easier. |
|