Hacker News new | ask | show | jobs
Tips for LLM Pretraining and Evaluating Reward Models (sebastianraschka.com)
2 points by rasbt 813 days ago