Hacker News new | ask | show | jobs
Recurrence-Duplication: Deterministic Parallelisation of Non-Affine Scalar Loops (deviantabstraction.com)
1 points by top256 376 days ago
1 comments

TL;DR A loop that carries any pure scalar state can be strip-mined across p threads by having each thread privately replay ≤ p(p-1)/2 “warm-up” updates before its first public iteration. No closed-form skip-ahead, no speculation, and a few extra machine instructions in code-gen.