Hacker News new | ask | show | jobs
by harisec 441 days ago
Yes, during training multiple checkpoints are created, you can distill from any checkpoint you want.