Hacker News new | ask | show | jobs
by sp332 1029 days ago
It does say this: Note that all Code Llama models were initialized with Llama 2 weights before they were further trained on code.