Hacker News new | ask | show | jobs
by sadboi31 546 days ago
https://huggingface.co/datasets/TIGER-Lab/MathInstruct

Works with +700 year old books w some tweaks. took like $400 to train. can't share more because i don't know more.

1 comments

That seems to be just for LLMs, not visual. I'm wanting to go from images of maths notation (photos, scans, digital handwriting) to formulas in Latex or MathML or something. Qwen2-VL can do it, but it's pretty heavyweight for just that.