| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by avidiax 59 days ago
	Die size increases cost exponentially, by decreasing chips per wafer and decreasing yield. I expect that this kind of burned-in model is also very difficult to verify (how do you know if some of the weights are off), and not amenable to partial disablement to increase yield. For CPUs, you just laser disable bad cores. Can't forego part of a neural net.

2 comments

robkop 58 days ago

You can ablate surprisingly large chunks of a model with near to no effect, you can try this easily - download an open weight model in torch.

Obviously it’s not ideal but you could likely have single digit % of all weights affected and still have a useful model (many caveats here: e.g. locality of damaged weights matters, distribution of errors matters, fail high/low matters, …)

link

hdndjsbbs 58 days ago

I mean, you probably can just turn off defective parts of the network. You better believe if this becomes popular they would salvage yields by selling "dumber" chips at a discount.

link

vrighter 58 days ago

except that if you do, you've just implemented a different model, with no way to tell which part of it is wrong

link

hdndjsbbs 51 days ago

Could you tell that the original model was "right"?

link