Hacker News new | ask | show | jobs
Fine-tune Llama 2 with DPO (huggingface.co)
3 points by kashifr 1053 days ago