I don't think anyone has pretrained a remotely-close-to-SOTA sized backwards model.
We are continuously adding more benchmarks to the paper with UTAustin.