Hacker News new | ask | show | jobs
by Faint 1183 days ago
In that vein, how about stackoverflow? That should give at least straightforward ask-and-answer format, and there's plenty on material to work with.
1 comments

LlaMa was trained on 78 GB of StackExchange (I assume StackOverflow was included in that).
But was it parsed and reformatted specifically in the "chat format" (i.e. the same as inputs later fed to the model when used as a chatbot)? It can make a surprisingly big difference.