Hacker News new | ask | show | jobs
The Stack v2 – dataset with 900B tokens of code (huggingface.co)
3 points by osanseviero 847 days ago