Hacker News new | ask | show | jobs
by siliconc0w 462 days ago
It likely makes sense to use more expensive frontier models as teachers or architects for smaller fine-tuned ones that generate the majority of tokens (though possibly against the ToS).