|
|
|
|
|
by jdietrich
514 days ago
|
|
We need GPUs for inference, not just training. The Jevons Paradox suggests that reducing the cost per token will increase the overall demand for inference. Also, everything we know about LLMs points to an entirely predictable correlation between training compute and performance. |
|