|
|
|
|
|
by p1esk
2337 days ago
|
|
In the link I posted: tpu v2-8 has 64GB of total memory, v2-32 has 256GB. As for the beefy vm - can you do heavy data preprocessing on tpus? For example elastic distortions or scaling for images? Probably not, because usually it involves OpenCV or similar libraries. |
|
(If a TPUv2-8 has 64GB memory, how can it fine tune GPT-2 1.5B using Adam with batch size 4? That requires almost 300GB.)