Hacker News new | ask | show | jobs
by dominotw 14 days ago
do you mean pre training? so 4.8 is just post training of an old pretrained model?

btw where do they tell you how they trained the model.