Hacker News new | ask | show | jobs
by sudosteph 2347 days ago
In case anyone else is interested, I found that the org that handles the the papyri has a really cool write-up about how they handle the digitization process.

http://www.papyrology.ox.ac.uk/POxy/imaging/imaging.html

According to the article: . Over the past century, just over 5,000 of the half-million Oxyrhynchus papyri have been published.

So between the large data set and scanning process, I'm hopeful that all of these (and other) ancient manuscripts will be shared publicly. I love imagining all the potential studies we can do with proper machine learning once we have the data set.

1 comments

System run by a "power Macintosh G4". Not clear when that web page was composed, but G4 was introduced in 1999 and phased out in 2003.