Hacker News new | ask | show | jobs
BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-Scale Pretraining (arxiv.org)
4 points by circuithunter 294 days ago