|
|
|
|
|
by artninja1988
888 days ago
|
|
>To train AlphaGeometry's language model, the researchers had to create their own training data to compensate for the scarcity of existing geometric data. They generated nearly half a billion random geometric diagrams and fed them to the symbolic engine. This engine analyzed each diagram and produced statements about their properties. These statements were organized into 100 million synthetic proofs to train the language model. With all the bickering about copyright, could something similar be used for coding llms? Would kill the ip issues, at least for coding |
|
https://arxiv.org/abs/2207.14502