skip to navigation skip to content
Wed 3 Jun 2020
11:00 - 12:30


Provided by: Cambridge Digital Humanities


Bookings cannot be made on this event (Event is cancelled).

Other dates:

No more events

Register interest
Register your interest - if you would be interested in additional dates being scheduled.

Booking / availability

Sources to Data

Wed 3 Jun 2020


We are currently reformatting our Learning programme for remote teaching; this will require some rescheduling so bookings will reopen and new sessions will be created for online courses as soon as possible. In the interim we would encourage you to register your interest so as to be notified of the new schedule. Please be aware that we hope to run many of our courses online, but that this is dependent on staff availability and resources so please be aware we may have to postpone or cancel some sessions

Archives typically hold records containing enormous quantities of data presented in a variety of scribal and print formats. Extracting this information has traditionally involved long hours of expensive manual data-entry work. Nowadays this work can be automated to a large degree and could soon open archives and allow for unprecedentedly large structured data sets for curators, researchers, and the public alike. This workshop will examine new methods for collecting historical data from manuscript and printed documents. We will look at archival photography, OCR, page structure recognition, and new handwritten text recognition systems. Cutting-edge Cambridge research in this field will be demonstrated.

Target audience

Post-graduate researchers and staff at the University of Cambridge


Number of sessions: 1

# Date Time Venue Trainer
1 Wed 3 Jun 2020   11:00 - 12:30 11:00 - 12:30 Cambridge Digital Humanities Online Dr Oliver Dunn

Presentation and group discussion

Machine Reading the Archive

Booking / availability