from https://github.com/E-xyza/Exonerate/blob/master/bench/report...
(I believe the author is significantly underestimating the pace of progress)
Specific numbers are at https://github.com/E-xyza/Exonerate/blob/master/bench/report.... GPT-4 does significantly better.