Hacker News new | ask | show | jobs
by samatdav 531 days ago
Thank you for the idea! We are also considering upsampling and distillation. But on high level, correctly setting up the data for simple fine-tuning can already produce great results.