|
|
|
|
|
by tomrod
59 days ago
|
|
Does the cost scale linearly/superlinearly? What does the $300-$400 price data point tell us with relationship to the parameter density? No gotchas here. I genuinely don't know that 8B parameters is in a zone with significant decreasing marginal returns -- too far out of my knowledge area but genuinely curious. |
|
I expect that this kind of burned-in model is also very difficult to verify (how do you know if some of the weights are off), and not amenable to partial disablement to increase yield. For CPUs, you just laser disable bad cores. Can't forego part of a neural net.