But only if we use some sort of attention optimization. For the quadratic attention algo it shouldn’t matter where the needle is, right?