CDH Basics: Bulk data capture New
This CDH Basics session investigates three different methods for accessing digital data ‘in bulk’: using an API (Application Programme Interface), web scraping and direct access (via download or on a hard drive). We will explore the importance of good practice in documenting the provenance of data that others have created and discuss the practical steps in research data management essential to ensuring that you are able to make legal and ethical use of this type of data in your research. No knowledge of programming languages is required, however, there will be a demonstration of a Python web scraper during the session and references to more in-depth tutorials on web scraping will be provided.
CDH Basics sessions are open to staff and graduate students who want to learn and apply digital methods and use digital tools in their research.
Number of sessions: 1
# | Date | Time | Venue | Trainers |
---|---|---|---|---|
1 | Tue 8 Feb 2022 10:00 - 11:00 | 10:00 - 11:00 | Cambridge Digital Humanities Online | Dr Anne Alexander, H.E. Jones |
This session will be delivered using Zoom so please ensure you have it installed ahead of the session. You will also be registered onto our Moodle page for any communications and materials relating to this session. A joining link will be sent out as part of the booking confirmation process.
If you require any help before the session, such as accessibility support, please email the CDH Learning team (learning@cdh.cam.ac.uk), for further assistance.
Booking / availability