| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by holtkam2 37 days ago
	I wish I could upvote this twice. We (devs) really REALLY need to consider on-device compute before going to the cloud for LLM inference.