Hacker News new | ask | show | jobs
by itronitron 609 days ago
so, in conclusion, the training data containing 'math' that LLMs have access to is predominantly written as software code, and not as mathematical notation
1 comments

It would be quite exciting to train a LLM with (OCR scans of) all mathematical journals, pre-prints, time-series etc.