Flash attention, which is widely used, is no longer parallel. The attention matrix is solved batch by batch.