|
|
|
|
|
by data_maan
129 days ago
|
|
On the website https://1stproof.org/#about they claim: "This project represents our preliminary efforts to develop an objective and realistic methodology for assessing the capabilities of AI systems to autonomously solve research-level math questions." Sounds to me to be a benchmark in all but a name. And they failed pretty terribly at achieving what they set out to do. |
|
Why the angst ? If the ai can autonomously solve these problems, isnt that a huge step forward for the field.