Hacker News new | ask | show | jobs
by jakobov 728 days ago
How much faster (in terms of the number of iterations to a given performance) is training from distillation?