Hacker News new | ask | show | jobs
by solresol 411 days ago
Trying to measure how well LLMs can make scientific hypotheses, and more generally, execute on the scientific process (as part of a pivot in my PhD).