We were aware of Mathpix and we think it’s an awesome tool! We didn’t mention it because there isn’t any publicly available performance results for the model they use, so we weren’t able to compare it to ours in a quantitative way :)
The evaluation sets involves ~20k samples. They only let you do like 100 for free. Also I’m not sure there’s an API for it for me to do automatically for such a large scale