| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by atgctg 1068 days ago
	As an example, INT8 support in WebGPU would enable running quantized models, allowing larger LLMs to run locally in the browser. See Limitations section here: https://fleetwood.dev/posts/running-llms-in-the-browser