Hacker News new | ask | show | jobs
by p1esk 296 days ago
They experimented with gpt-2 scale models. Hard to make any meaningful conclusions in the gpt-5 era.