Hacker News new | ask | show | jobs
by pessimizer 344 days ago
Really? I'd assume that an LLM would deduplicate Wikipedia into something much smaller than 25GB. That's its only job.
1 comments

> That's its only job.

The vast, vast majority of LLM knowledge is not found in Wikipedia. It is definitely not its only job.

When trained on next word prediction with the standard loss function, by definition it is it's only job.