Hacker News new | ask | show | jobs
Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning (arxiviq.substack.com)
2 points by che_shr_cat 250 days ago