Hacker News new | ask | show | jobs
by _0ffh 621 days ago
The trick is to make sure the recursive dependency stays linear, that's how you enable parallel training.