| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by Jensson 129 days ago
	You don't need a source for that, an LLM with such little data is barely able to form proper sentences.

1 comments

> an LLM with such little data

There is a mountain of data pre-1905. Certainly enough to train a decent 30B parameter model.

Now, digitizing & OCRing all of that data... THAT is a challenge.