Hacker News new | ask | show | jobs
Language Modeling, Part 2: Training Dynamics (connorjdavis.substack.com)
1 points by cjamsonhn 159 days ago