Hacker News new | ask | show | jobs
RedPajama-Data: Code for preparing large datasets (github.com)
2 points by harrisonpowers 1159 days ago