Hacker News new | ask | show | jobs
by tyre 1023 days ago
They said 7b llama which I read as the base LLaMa model, not this one specifically. All of these LLMs are trained on Stack Overflow so it makes sense that they’d be good out of the box.
1 comments

The top level comment is specifically citing performance of code llama against codex.