| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by rspoerri 690 days ago
	192GByte of RAM are not enough to train 405B models. Reflection 70B requires 140GByte of RAM in fp16, 405 would need ~810Gbyte of RAM.

1 comments

Pretty sure he said he’s inferencing llama3 405 and training his own custom model from scratch. He didn’t say how big his custom model will be.