|
|
|
|
|
by fleventynine
8 days ago
|
|
If local models are good enough, doesn't that increase demand for DRAM as everyone buys
DRAM for their poorly utilized local machines? Surely it is a more efficient use of DRAM to run inference on shared hardware with large batch sizes and more utilization. |
|