|
Not to be a luddite, but large language models are fundamentally not meant for tasks of this nature. And listen to this: > Most notably, it provides confidence levels in its findings, which Cheeseman emphasizes is crucial. These 'confidence levels' are suspect. You can ask Claude today, "What is your confidence in __" and it will, unsurprisingly, give a 'confidence interval'. I'd like to better understand the system implemented by Cheeseman. Otherwise I find the whole thing, heh, cheesy! |
When asked about their confidence, these things are almost entirely useless. If the Magic Disruption Box is incapabele of knowing whether or not it read "42/A" correctly, I'm not convinced it's gonna revolutionize science by doing autonomous research.