|
|
|
|
|
by superpope99
1136 days ago
|
|
I'm always curious about the cost of these training runs. Some back of the envelope calculations: > Overall we reach a throughput of over 1900 tokens / second / TPU-v4 chip in our training run 1 trillion / 1900 = 526315789 chip seconds ~= 150000 chip hours. Assuming "on-demand" pricing [1] that's about $500,000 training cost. [1] https://cloud.google.com/tpu/pricing |
|
Considering I could negotiate A100 for under a dollar/hr - 8 months ago, when they were in high demand, I wouldn't be surprised if the cost was close to 100k for this training run.