Hacker News new | ask | show | jobs
by AIsore 768 days ago
My experience - many are far off and most of the time published tables of different papers are hard to compare. If you make the assertion here of these results to be flawed, I would like to see more substance (code, reproduction,...). And for balance, for the same reason, hard to verify the accuracy of these results without further insight.
1 comments

So many papers play tricks with the learning rate schedule: https://arxiv.org/abs/2307.06440