| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by JimDabell 387 days ago

There’s a related paper that Meta published a couple of days ago that is worth looking at:

> How much do language models memorize?

— https://arxiv.org/abs/2505.24832

— https://news.ycombinator.com/item?id=44171363

It shows that models are limited in how much they can memorise (~3.6 bits per parameter), and once that threshold is reached, the model starts to generalise instead of memorise.