Hacker News new | ask | show | jobs
by alwayslikethis 1201 days ago
Realistically the most difficult part of training a LLM is curating a good data set. OpenAI never published their training data.