| Shameless self promotion: I wrote one of the more cited papers in the field [0], back in 2016. A key challenge: very few labs have enough data. Something I view as a key insight: a lot of labs are doing absurdly labor intensive exploratory synthesis without clear hypotheses guiding their work. One of our more useful tasks turned out to be interactively helping scientists refine their experiments before running them. Another was helping scientists develop hypotheses for _why_ reactions were occuring, because they hadn't been able to build principled models that predicted which properties were predictive of reaction formation. Going all the way to synthesis is nice, but there's a lot of lower hanging fruit involved in making scientists more effective. [0] https://www.nature.com/articles/nature17439 |
Also shameless plug: I started a company to do just that, anchored to generating custom million-to-billion point datasets and using ML to interpret and design new experiments at scale.