| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by m12k 84 days ago
	So we've basically taken the concept of branch prediction from CPUs and applied it to LLMs?

4 comments

c7b 84 days ago

The concept of predicting future elements in a series is not specific to CS. It's older than computers.

link

kpw94 84 days ago

Speculative execution techniques in software & hardware exist everywhere,

- Speculative multi threading

- Data Value Speculation

- Speculative Memory Disambiguation

- Runahead Execution

- Speculative Prefetching

- Multi-path (Dual-path) Execution (goes beyond branch prediction by computing both paths)

- Optimistic Concurrency Control (for database transactions etc)

link

mike_hearn 84 days ago

Maybe at very high level of abstraction, but there's no branching involved.

link

lossolo 84 days ago

Well, there are multiple token proposals processed in parallel, from which only one is picked, seems like branching to me. The only difference is that in case of CPU there is always only one possible branch that is correct.

link

monster_truck 84 days ago

Well, not exactly, but that was the dream we were sold (here be dragons)

link

fragmede 84 days ago

Well, the TPUs they're running on don't have branch prediction, so that had to end up somewhere in the stack.

link