Hacker News new | ask | show | jobs
by swordsmith 176 days ago
Seems very oriented toward model architecture and inference engineering. Maybe add some more on model training flow, distillation, data generation, SFT and RL techniques?