So, I'm not really a mathematician, but the first 3-8 pages reads like nonsense and a bunch of unrelated facts. A bit surreal may be, but if this the norm for this kind of thing, I'm amazed it arrives at any useful result at all.
It didn't seem like nonsense to me. (Recently graduated undergrad with a math degree; probably could have gone to PhD). It seemed like the AI was cycling through a bunch of different possible approaches to tackle the problem. Eventually it finds one and makes more progress there until it reaches the solution
I'm disappointed only that the chain of thought needed to be rewritten. Need to train these LLMs to natively communicate in LaTeX research paper format.
I believe they rewrite the chain of thought to protect their IP, i.e. the chain of thought reveals information about how the model works in a manner that may aid replication