Hacker News new | ask | show | jobs
by dchest 1183 days ago
LlaMa was trained on 78 GB of StackExchange (I assume StackOverflow was included in that).
1 comments

But was it parsed and reformatted specifically in the "chat format" (i.e. the same as inputs later fed to the model when used as a chatbot)? It can make a surprisingly big difference.