Y
Hacker News
new
|
ask
|
show
|
jobs
by
mgraczyk
379 days ago
Depending on your model, you may spend a lot of time trying to get it to work with Trainium
1 comments
krembo
366 days ago
Why is that? Can you explain?
link
mgraczyk
366 days ago
The trainium toolchain is not as mature as GPU. Your model may fail to compile out of the box, and even if it does it may be slow and require you to dig into details for reasonable training/inference performance
link