Hacker News new | ask | show | jobs
by dr_dshiv 270 days ago
This is really beautiful content. I’m assuming it comes from the fact that there are Google teams tasked with digitizing old manuscripts?

I work with a library (Biblioteca Philosophica Hermetica) in Amsterdam that has thousands of manuscripts from the renaissance to the early modern period… all very esoteric. We really want to get the renaissance into model training! Over 75% of books (1450-1700) are unscanned — and the manuscripts are in even worse shape.

Curious if anyone knows if there any new handwriting recognition benchmarks? I’ve noticed the main model providers have plateaued in the past year on their ability to read manuscripts / modern handwriting… I think the lack of well-designed competitive benchmarks is the issue…

I love positive examples of the intersection of AI and the humanities.