Hacker News new | ask | show | jobs
by mgraczyk 379 days ago
Depending on your model, you may spend a lot of time trying to get it to work with Trainium
1 comments

Why is that? Can you explain?
The trainium toolchain is not as mature as GPU. Your model may fail to compile out of the box, and even if it does it may be slow and require you to dig into details for reasonable training/inference performance