| At this rate they'll be manufacturing 0nm chips soon, and in a decade they'll be on -1nm. But despite the weird naming scheme, it's clear from transistor density [1] and GPU prices [2] that foundries are still making progress in transistors per dollar. That progress is just barely beginning to make large neural networks (Stable Diffusion, vision and speech systems, language model AIs) deployable in consumer applications. It might not matter whether your cell phone renders this page in 1ms or 10ms, but the difference between talking to a 20B parameter language model and a 200B net is night and day [3]. If TSMC/Samsung/Intel can squeeze out just one or two more nodes, then by the middle of the century we might have limited general-purpose AI in every home and office. [1] https://en.wikipedia.org/wiki/Transistor_count#GPUs [2] https://pcpartpicker.com/trends/price/video-card/ [3] https://textsynth.com/playground.html |
I understand roughly why this shift is happening (machine language proving to solve a whole raft of hard problems) and how it's happening (specialized chip designs for matrix math). But I don't understand where it's all going, or how I can plug into it.
It feels like a fundamentally different landscape than what I'm used to. There's more alchemy, perhaps. Or maybe it's that the truly important models are trained using tools and data that are out of reach for individuals.
Does anyone else feel this way? Better yet, has anyone felt this way and overcome it by getting up to speed in the ML world?