| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by imtringued 800 days ago
	The problem is that you need two GPUs and the AI one can't be from AMD. We aren't 15 years away. More like two or three. NPUs are coming and DDR6 plus quad channel memory would get you decent performance on small LLMs like llama3. You're also forgetting that batch performance is already an order of magnitude better than single session inference.