Hacker News new | ask | show | jobs
by 0xDEADFED5 698 days ago
Salesforce produced one of the best Llama-3(8B) finetunes, IMO: SFR-Iterative-DPO-LLaMA-3-8B-R

Hopefully they do something with Llama-3.1