Hacker News new | ask | show | jobs
by apaszke 3197 days ago
It's not like you have to give up a lot - the graphs are simple data structures and creating them is not the expensive part of the training. The computation has to be re-done at every step in a static framework too, and this is the part that matters.