|
|
|
|
|
by leoedin
36 days ago
|
|
Yeah, I think the important part is the process to convert the model to silicon, not the actual implementation itself. Whether it succeeds now depends a lot on the rate of improvement of model architecture. They're betting on model design and capability improvements slowing down - and then wiping the floor with everyone else with their inference economics. |
|
Harnesses can keep improving with a fixed model and the throughput opens up new possibilities like doing 10x more "thinking" or exploring parallel paths and picking the best.