| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by yarri 401 days ago
	Not sure what “official” means but would direct you to the GCP MaxText [0] framework which is not what this GDM paper is referring to but rather this repo contains various attention implementations in MaxText/layers/attentions.py [0] https://github.com/AI-Hypercomputer/maxtext