|
|
|
|
|
by rightbyte
1280 days ago
|
|
How much disk space does 175B parameters use? A float or half precision float per parameter or does it need pointers to connections too? Given how responses are generated in seconds and for free I am fairly sure it could run on a desktop computer. |
|
OpenAI generates responses so fast by doing the generation in parallel across something like 8x80GB A100s (I don't know the exact details of their hardware setup, but NVIDIA's open FasterTransformer library achieves low latency for large models this way).