Hacker News new | ask | show | jobs
by binarymax 1713 days ago
Yes, verbatim from the paper: "Moreover, training with PET can be performed in several hours on a single GPU without requiring expensive hyperparameter optimization."
1 comments

Nice. Still reading it.