Hacker News new | ask | show | jobs
by krembo 373 days ago
How does that compare to spinning up some ec2s with amazon trainium gpus?
1 comments

Depending on your model, you may spend a lot of time trying to get it to work with Trainium
Why is that? Can you explain?
The trainium toolchain is not as mature as GPU. Your model may fail to compile out of the box, and even if it does it may be slow and require you to dig into details for reasonable training/inference performance