Hacker News new | ask | show | jobs
by pongogogo 281 days ago
The post mentions an approach of using a large model to generate labels and then distilling this into a smaller model to lower cost (though it doesn't provide an example)