Hacker News new | ask | show | jobs
by itemize123 48 days ago
actually distillation is without weights - you basically just need a black box teacher model.