| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by gsliepen 1842 days ago
	The main trick GPUs use is having a massive amount of hardware threads per actual core. If an instruction in one thread is stalled on a load operation, the core will just switch to another thread. If you have more runnable threads at all times than your memory latency in cycles, the latency will not affect your throughput anymore.