Hacker News new | ask | show | jobs
by jedharris 253 days ago
"Our results show that simple algorithmic improvements can enable significantly more data-efficient pre-training in a compute-rich future."