Hacker News new | ask | show | jobs
FineWeb2 dataset: A sparkling update with 1000s of languages (huggingface.co)
2 points by hynky 565 days ago