Hacker News new | ask | show | jobs
FineWeb2: Adapting Pre-Training Data Processing to Every Language (arxiv.org)
7 points by hynky 364 days ago