Hacker News new | ask | show | jobs
by orbifold 1240 days ago
Visible light and near ultra-violet light has a wavelength ~400-800 nm, current gen transistors have a pitch of ~40 nm. This gets worse because scaling is actually either quadratic (2d) or cubic (future 3d integration). So we are talking about 100x to 1000x worse spatial scaling disadvantage at the moment. The only redeeming quality of light is wavelength multiplexing, but that is only useful for a subset of applications, like optical communication and (maybe) convolutions (see below).

Moreover even in a hypothetical scenario where we somehow were able to find materials applicable to smaller wavelength, the deBroglie wavelength of an electron is ~1000x smaller, than that of a Photon at the same energy. So in terms of integration density electrons will always have a 10^6 - 10^9 (2d - 3d) theoretical advantage over photons, which means that investment in electron based computation will have a much more likely eventual payoff.

Take for example https://www.nature.com/articles/s41586-020-03070-1, they have a bunch of projections for what they hope to achieve over time. The most fantastical figure they give is 50 Peta MAC / s, but this doesn't take into account the PCM programming time.

If you take a look at the supplementary material https://static-content.springer.com/esm/art%3A10.1038%2Fs415... it becomes clear that they currently have a much lower TOPS/Watt figure than current generation ML ASIC like the TPU and this neglects all the expensive experimental optical equipment they would need to miniaturise. So even in their most favourable comparison they are 5x worse. Most of these papers unfortunately are full of hype and claims like that.

1 comments

100x loss in density isn't actually necessarily a problem. Optical systems have potential for much higher clocks (since there's so much less heat), and the actual logic on a cpu is tiny. If you can get 50ghz clocks, you can lose a lot of density and still win out (I'd take 10x single core perf over 10 cores any day of the week).
Even optical computing needs lots of standard electronics for storage etc. as you point out these consume most of the area, so we are talking about something that consumes 100x more area for a function which is not the main energy expensive thing in traditional computers anyways, memory and communication make up at least 1/3 of the energy budget, which doesn't go away by making the rest optical.

In fact reprogramming the optical non-linearities typically is much slower and energy intensive than retrieving and flipping some bits. Which makes non-static non time-multiplexed computation extremely slow compared to whatever the "best" case static scenario is.