We knew that LolCDE was a vulnerability to e coli since well before 2016 and knew inhibitors of the complex, globomycin being one of them, which they knew about since 1978
From what I understand they used a diffusion model (diffdock) to predict the mechanism. These types of models are not LLMs that need to be trained on text