| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by cec 1045 days ago
	We use the same architecture as other LLMs, but we include no natural language in our pretraining. We figured a single-domain training corpus would make evaluation easier. We’ll be looking at layering this on top of something like Code Llama next