| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by ekianjo 978 days ago
	A roman era GPT would be fantastic

1 comments

Dorialexander 978 days ago

Already in the work. Just had a meeting today with two latinists about it.

link

LightMachine 978 days ago

Why don't you restrict the training just to old text? That would be awesome

link

Dorialexander 978 days ago

Not feasible to go with pretraining only.

What is possible is to use a larger learning rate but this will be a hard trade-off with conversational capacities. Fine tuning is currently based on original texts with a synthetic prompt. The issues that people have noticed (repetitions, not remembering what was in the prompt) will be more significant if the learning rate is higher.

Maybe a solution will be to provide two different variant of the same model, one less immersive and more workable, and the other more immersive and buggy.

link

isoprophlex 978 days ago

I'm not sure you'd be able to scrape together enough data for that to be viable...

link

iopq 978 days ago

https://en.wikipedia.org/wiki/Corpus_Inscriptionum_Latinarum

approximately 180,000 inscriptions

link

xcv123 978 days ago

That only contains a few million tokens. Useless for pre-training an LLM from scratch. You would need to find billions of tokens.

link

iopq 978 days ago

It should be similar to this 1700s English model, probably trained on modern data to start and then at the end fitted to the smaller data set

link

ekianjo 978 days ago

Awesome!!

link