Hacker News new | ask | show | jobs
by ilikebigcutts 64 days ago
This is cool. Does it consider frontier VLMs?
1 comments

Yes, it evaluate using frontier model for parsing from all 3 major provider (google, anthropic and openai). It is also easy to extend to evaluaye new model (code/dataset is available)