Hacker News new | ask | show | jobs
by malux85 1014 days ago
I am in computational chemistry which has some crossover with Materials Science. AI is useful for speeding up simulations, doing molecular interaction predictions, doing property predictions, doing molecular docking. LLMs are helpful as research assistants and helping design and run experiments. https://atomictessellator.com/

For example, my latest module (I am putting the finishing touches on this as we speak) uses LLMs to review catalysis literature and then summarize that and control another coding LLM that has been trained to run the simulation tools I created to try and reproduce the works in the papers. Yes, it works, the first catalyst discoveries were made just a few days ago.

2 comments

Kind of mind blowing. A light in the tunnel for the reproduction crisis?

What specifically have you trained your coding LLM on? Is it lora or something more advanced? Have you created a corpus by hand specifically for training?

Yes, created by hand, lots of techniques are required to get a well running system, GoT (graph of thought) RAG (retrieval augmented generation), Ko detection, dynamic problem decomposition, and a few more techniques I have invented but dont really have names for. Its also quite a bit more complicated than the simplistic answer I gave before because you have to do things like experiments in abstraction laddering to get good interface composability.

This space is moving so quick I run many experiments every day.

I would love to work on this full time, I applied to y-combinator but I didn't get in :(

Random question for you: How feasible is it for someone with a software/ML background to get into this space? I'd like to get into learning a new domain through ML, but it feels very daunting.
It sounds cliche but the old advice “do something you’re passionate about” applies. I have loved chemistry since I was a teenager because I had an inspiring teacher, without this passion I would have given up when I got to the very hard parts of math.

Personally I had to level up my math game a lot, when I started I had to keep pausing looking up terms, then write tiny prototypes, then keep learning. At the start it took me a month to read and understand a paper, now I can skim read them in minutes, without a passion for chemistry and the self discipline to keep at it I would have given up.

Take extensive notes, actually review them frequently, and refractor them as you learn.

Find mentors in the space, while reading papers I took names and reached out, I offered free programming to anyone who I found inspiring, and this built relationships with some pretty amazing professors and students around the world.

Hope that helps! <3

That’s too bad, did YC give you a reason?
YC doesn't give reasons.
Are you reproducing the results of other simulation papers by running simulations, or results from other experimental papers?

Is interpreting the other papers and translating them into simulation parameters a rate limiting step in catalysis research? Or is this like, fitting your simulation package’s parameters to get the same output as someone else’s?