Computational Papyrology: Using Data Science to Understand Ancient Manuscripts
In the 1920s, archeologists from Oxford University discovered over 600 crates of documents buried outside the Egyptian city of Oxyrhynchus. Since they were found, less than 10% of these manuscripts have been analyzed and published. Using a combination of crowdsourcing and data science, they are developing the software infrastructure to speed up this process. Thus far, they have created algorithms to find the consensus of human transcriptions, developed a training/testing sample of characters, applied neural networks to automatically identify characters, and used sequence alignment to determine if the text belongs to the known literature of the ancient Greeks. The most difficult questions faced, involve the cases where the crowdsourced identifications disagree with the neural network’s classification. In this talk, Dr. John Wallin will review the project's status and talk about the role of ambiguity in our data set.
This seminar was held on February 1.2022 via Zoom. Click here to watch the program.
- CDS Seminar: "Environmental Data Science: Recent projects from the Data Science Institute" Feb 6, 2023
- CDS Seminar "The Broader Implications of AI in Basic and Applied Sciences: Featuring ChatGPT, GitHub Copilot, and DALL-E 2*" Jan 25, 2023
- A quick note on predictive modeling process May 10, 2022
- Computational Papyrology: Using Data Science to Understand Ancient Manuscripts Apr 27, 2022
- Getting a job after graduation Apr 27, 2022