|
|
|
|
|
by londons_explore
1247 days ago
|
|
How big is your horde of GPU's? I'm gonna guess that a single instance of ChatGPT is ~2400GB of weights, so you're gonna be needing 105 RTX4090's costing a whopping $210k to run the thing... And swapping data between SSD and video RAM to run it on smaller cheaper hardware probably isn't gonna be possible because transformer architectures typically need to reuse the weights once for every token emitted, so even the fastest SSD's would be too slow. And if you have 105 RTX4090's, you're probably gonna run afoul of nvidias 'no datacenter use' terms. So you'll have to splash out for models allowed in datacenters, which are 5x the price or so for the same amount of compute. |
|
Pretty sure it is just the GPT3 DaVinci model, which is 175B parameters, so approximately 700GB, or about half that at half precision.