Also as an aside I believe there's a current trend to generate chemical compounds by creating SMILES strings using BERT which is a cool way to incorporate language and chemistry (An example of a team doing that https://www.cell.com/iscience/fulltext/S2589-0042(21)00237-6)