skip to navigation skip to content
- Select training provider - (Cambridge Digital Humanities)

All Cambridge Digital Humanities courses

Show:
Show only:

Showing courses 1-100 of 133
Courses per page: 10 | 25 | 50 | 100

Analysing and Visualising Social Media Data (Workshop) new Mon 11 Feb 2019   14:00 Finished

This session introduces a variety of analytical strategies, with a focus on Social Network Analysis, the most widely used and abused method for analysing and visualising digital and social media data. At the end of this session, you will be familiar with the basic concepts, techniques and measures of social network analysis.

Archival Photography: An Introduction new Wed 12 Jun 2019   11:00 Finished

This session focusses on providing photography skills for those undertaking archival research. Dr Oliver Dunn has experience spanning a decade filming documents for major academic research projects. He will go over practical approaches to finding and ordering materials in the archive, methods of handling and filming them, digital file storage, and transcription strategies. The focus is very much on low-tech approaches and small budgets. We’ll consider best uses of smartphones, digital cameras and tripods. The session is held at the Digital Content Unit at the University Library.

Automated writing in the age of Machine Learning new Mon 7 Dec 2020   11:30 Finished

Computer programmes which predict the likely next words in sentences are a familiar part of everyday life for billions of people who encounter them in auto-complete tools for search engines and the predictive keyboards used by mobile phones and word processing software. These tools rely on “language models” developed by researchers in fields such as natural language processing (NLP) and information retrieval which assign probabilities to words in a sequence based on a specific set of “training data” (in this case a collection of texts where the frequencies of word pairings or three-word phrases have been calculated in advance).

Recent developments in machine learning have led to the creation of general language models trained on extremely large datasets which can now produce ‘synthetic’ texts, answer questions, summarise information without the need for lengthy or costly processes of training for each new task. The difficulties in distinguishing the outputs of these language models from texts written by humans has provoked widespread interest in the media. Researchers have experimented with prompting GPT-3, a language model developed by OpenAI to write short stories, answer philosophical questions and apparently propose potential medical treatments -although GPT-3 did have some difficulty with the question “how many eyes does a horse have?”. Meanwhile, The Guardian ‘commissioned’ an op-ed from GPT-3.

This Methods Workshop will explore the generation of ‘synthetic’ texts through presentations, discussion and demonstrations of text generation techniques which participants will be encouraged to try out for themselves during the sessions. We will also report back from the Ghost Fictions Guided Project, organised by Cambridge Digital Humanities Learning Programme in October and November this year. The project looks at how ideas about the distinction between ‘fact’, ‘fiction’ and ‘nonfiction’ are shaping the reception of text generation methods and aims to stimulate deeper critical engagement with machine learning by humanities researchers.

Prior knowledge of programming, computer science or Machine Learning is not required. In order to try out the text generation techniques demonstrated during the course you will need access to Google Drive (accessible via Raven login for University of Cambridge users).

Beginner's Filmmaking Workshop new Mon 17 Feb 2020   10:00 Finished

Tutors: Sarah McEvoy / Kostas Chondros

Are you curious about making a short documentary film?

This beginner’s filmmaking workshop will help you to start thinking visually and communicate using sound and film. Over two days you will be introduced to different camera shot types, how to construct a basic story, use digital video cameras and sound recorders to shoot your own footage, and then edit a short sequence for export.

The workshop assumes no or very little prior knowledge of filmmaking and no prior preparation is required for the workshop. This is a hands-on practical workshop, working in small teams of two or three people. We expect a willingness to be open to ideas and work in a team to jointly create a short film clip.

The workshop will give you the foundational skills to incorporate film and sound in your own future projects, for example short clips for social media, publicity about research projects as a way to engage wider audiences etc.

During the workshop you will work with dedicated video equipment, but the techniques you will learn can be adapted to film making with smartphones, tablets and other readily available personal electronic devices.

COURSE PROGRAMME

Day 1 – Monday 17th February

  • 10.00 Welcome and introductions
  • 10.30 Aims of the session
  • 10.45 Introduction to shot types, camera movements, framing, telling a story, basic rules of camera use, rules of recording sound
  • 11.45 Splitting into groups – interactive demonstration of how to use the cameras
  • 13.00 Lunch
  • 14.00 Filming around Cambridge, practical exercise working in groups
  • 16.00 Return to room to look at footage from all groups
  • 17.00 Feedback session and summary of day 1 intro to day 2

Day 2 – Tuesday 18th February

We will be working on apple macs and Final Cut X; however we do not expect any prior knowledge of working with either computer or software

  • 10.00 Importing footage onto computers
  • 10.15 Basic editing, creating a 2-minute clip, summary of creating a sequence
  • 10.45 Adding clips to timeline, tools for manipulating clips, using second video track, transitions and filters, syncing audio
  • 13.00 Lunch
  • 14.00 Credits, titles, adjusting audio levels, adding music or narration, exporting footage, saving files
  • 16.00 Looking at each other’s edited clips
  • 16.45 Evaluation
  • 17.00 Finish

Handouts will be emailed after the workshop, and include:

Presentation – shot types, how to construct a sequence Editing on Final Cut x Camera functions, audio recording, info about equipment and editing software and model release forms

What you need to take with you

Headphones – preferably the kind you can plug in rather than Bluetooth headphones

Storage device – if you want to take footage you shoot with you after the workshop, you will need a hard drive, USB or SD card that can hold at least 8GB. Video files are large. Please make sure that the device is formatted to FAT32 if you use it on a PC, as we will be using macs. You can check this by right clicking the device and checking the properties. If you prefer, you don’t need to save the footage that you film and can also upload the exported film to Dropbox.

Upon booking this workshop a questionnaire will be issued to participants which must be completed in order to satisfy the booking.

The workshop is led by:

Sarah McEvoy holds BA Hons Fine Art and an MA in Visual Anthropology from Goldsmiths University of London and has most recently completed an MA in Art and Design in Education at UCL Institute of Education. Sarah has worked with arts organisations and charities creating short documentaries and has most recently filmed and edited a film working with a socially engaged artist in the community of South East London. As an artist-educator, Sarah works with youth groups and adults with learning disabilities in the community and museums and galleries.

Kostas Chondros holds an MA in Visual Anthropology from Goldsmiths College, University of London. He also holds an MA in Social Exclusion, Minorities & Gender from Panteion University and a BA in Social Anthropology & History from the University of the Aegean, Greece. Since joining the Personal Histories film production team in 2011, Kostas has filmed several events and taught camera & film production skills. Additionally, as a freelance filmmaker, Kostas documents improvised music performances and collaborates on film projects with other artists and performers. He is also a musician, poet and translator.

Find out how to use blogging in your research. The first of two sessions on research blogging will explore the benefits and limitations of blogging for public engagement.

The second of two sessions on research blogging will explore how social media can enable public engagement with your blog, learn how to set up a Twitter chat and explore other methods to get people talking about your research.

Bug Hunt 2020 [cancelled - Covid 19] new Tue 21 Apr 2020   13:00 CANCELLED

This programme is an opportunity to learn, through practical experience and shared investigation, how to apply digital methods for exploring and analysing a body of archival texts. The core of the programme will be 5 x 2 hour classroom based sessions supplemented by group and individual work on tasks related to the project design, delivery and documentation in between sessions. In addition to attending all five face-to-face sessions, participants should set aside an additional 8-10 hours over the duration of the course for work on project-related tasks.

During the programme we’ll work together on a particular topic: how insects were represented in books created for children in the 19th century. This question will help us to think about how children’s encounters with the natural world might have been framed and shaped by their reading. We’ll work on digital collections of 19th century children’s books exploring how such collections are built and how they can be used for machine reading. We’ll develop specific research questions and you’ll learn how to explore them using different tools for textual stylistic analysis. At the end, we’ll present findings and consider the implications of what we’ve discovered.

Topics covered include;

• The development of methods for machine reading the archive – ideas, motivations and ethics • Children’s books of the long 19th century – a beginner’s guide • Designing a small-scale investigation • Building a collection of digital texts • Transforming texts into searchable data • Analysing stylistic patterns in the data

Bulk Data Capture: an overview new Tue 23 Feb 2021   10:00 Finished

This CDH Basics session provides a brief introduction to different methods for capturing bulk data from online sources or via agreement with data collection holders, including Application Programme Interfaces (APIs). We will address issues of data provenance, exceptions to copyright for text and data-mining, and discuss good practice in managing and working with data that others have created.

Data which other people have created is often either unstructured or structured in the wrong way for the questions that you want to answer. Rather than reinventing the wheel and collecting it all over again, this CDH Basics session introduces participants to OpenRefine, a free ‘power tool’ for dealing with messy data. In order to work with OpenRefine you will need administrator privileges to install software on your laptop. 

CDH Basics: Acquiring data for your project new Wed 1 Nov 2023   09:30 Finished

This session provides a brief introduction to different methods for capturing bulk data from online sources or via agreement with data collection holders, including Application Programme Interfaces (APIs). We will address issues of data provenance, exceptions to copyright for text and data-mining, and discuss good practice in managing and working with data that others have created.

  • Data collection methods
  • Introduction to working with APIs
  • Data brokerage
  • Provenance and integrity
  • Assessing intellectual property, copyright and Data Protection issues
  • Documentation of collection methods
CDH Basics: Analysing and presenting your data new Wed 15 Nov 2023   09:30 Finished

The impact of well-crafted data visualisations has been well-documented historically. Florence Nightingale famously used charts to make her case for hospital hygiene in the Crimean War, while Dr John Snow’s bar charts of cholera deaths in London helped convince the authorities of the water-borne nature of the disease. However, as information designer Alberto Cairo notes, charts can also lie. This introductory Basics session presents the basic principles of data visualisation for researchers who are new to working with quantitative data.

  • Principles and good practice in data visualisation
  • Basic introduction to quantitative methods of data analysis
CDH Basics: Bulk data capture new Tue 8 Feb 2022   10:00 Finished

This CDH Basics session investigates three different methods for accessing digital data ‘in bulk’: using an API (Application Programme Interface), web scraping and direct access (via download or on a hard drive). We will explore the importance of good practice in documenting the provenance of data that others have created and discuss the practical steps in research data management essential to ensuring that you are able to make legal and ethical use of this type of data in your research. No knowledge of programming languages is required, however, there will be a demonstration of a Python web scraper during the session and references to more in-depth tutorials on web scraping will be provided.

CDH Basics: Computer vision: a critical introduction new Tue 24 May 2022   10:00 Finished

Machine learning-driven systems for seeing and sorting still and moving images are increasingly common in many contexts. This CDH Basics session explores the technical fundamentals of machine vision and discusses the societal and cultural impact of these systems, including the challenges and opportunities faced by humanities and social science researchers using computer vision systems as research tools.

In this CDH Basics session, we will discuss how to assess the impact of relevant legal frameworks, including data protection, intellectual property and media law, on your digital research project and consider what approach researchers should take to the terms of service of third-party digital platforms. We will explore the challenge of informed consent in a highly networked world and look at a range of strategies for dealing with this problem. 

CDH Basics: Designing a digital research project new Wed 25 Oct 2023   09:30 Finished

This CDH Basics session explores the lifecycle of a digital research project across the stages of design, data capture, transformation, and analysis, presentation and preservation. It introduces tactics for embedding ethical research principles and practices at each stage of the research process.

  • Introduction to the digital project life cycle
  • Ethics by design and EDI-informed data processing
  • Data and metadata - definitions
  • Basics of data curation (good practice in file naming, version control)
  • Understanding files and folders

Ensuring long-term access to digital data is often a difficult task: both hardware and code decay much more rapidly than many other means of information storage. Digital data created in the 1980s is frequently unreadable, whereas books and manuscripts written in the 980s are still legible. This CDH Basics session explores good practice in data preservation and software sustainability and looks at what you need to do to ensure that the data you don’t want to keep is destroyed.

CDH Basics: Digital research design and data ethics new Tue 9 Nov 2021   10:00 Finished

This CDH Basics session explores the lifecycle of a digital research project, across the stages of design, data capture, transformation, analysis, presentation and preservation, and introduces tactics for embedding ethical research principles and practices at each stage of the research process.

CDH Basics: First steps in coding with Python new Tue 15 Mar 2022   10:00 Finished

This CDH Basics session is aimed at researchers who have never done any coding before. We will explore basic principles and approaches to writing and adapting code, using the popular programming language Python as a case study. Participants will also gain familiarity with using Jupyter Notebooks, an open-source web application that allows users to create and share documents containing live code alongside visualisations and narrative text.

CDH Basics: Foundations of data visualisation new Tue 8 Mar 2022   10:00 Finished

The impact of well-crafted data visualisations has been well-documented historically. Florence Nightingale famously used charts to make her case for hospital hygiene in the Crimean War, while Dr John Snow’s bar charts of cholera deaths in London helped convince the authorities of the water-borne nature of the disease. However, as information designer Alberto Cairo notes, charts can also lie. This introductory CDH Basics session presents the basic principles of data visualisation for researchers who are new to working with quantitative data.

CDH Basics: Re:search new Tue 26 Oct 2021   10:00 Finished

In this CDH Basics session, participants will explore how searching and finding technologies structure scholarship, through an introduction to search engines both for web search and custom search functions within collections. We will discuss how errors introduced by digitisation technologies create blindspots for digital search in historical collections, interacting with social and legal processes to structure bias and discrimination into search processes. The session will provide a brief introduction to the importance of machine-learning driven systems for digital search and suggest strategies for researchers to critically engage with, rather than passively accept, search engine results.

CDH Basics: Sustaining your data new Wed 29 Nov 2023   09:30 Finished

Ensuring long-term access to digital data is often a difficult task: both hardware and code decay much more rapidly than many other means of information storage. Digital data created in the 1980s is frequently unreadable, whereas books and manuscripts written in the 980s are still legible. This session explores good practice in data preservation and software sustainability and looks at what you need to do to ensure that the data you don’t want to keep is destroyed.

  • Data and code sustainability
  • Retention, archiving and re-use
  • Data destruction
  • Recap on the project life-cycle
CDH Basics: Transforming your data new Wed 8 Nov 2023   09:30 Finished

Data which you have captured rather than created yourself is likely to need cleaning up before you can use it effectively. This short session will introduce you to the basic principles of creating structured datasets and walk you through some case studies in data cleaning with OpenRefine, a powerful open source tool for working with messy data.

  • Structuring your data
  • Cleaning messy textual data with OpenRefine
  • Batch processing file names
CDH Basics: Understanding data and metadata new Tue 12 Oct 2021   10:00 Finished

This CDH Basics session provides a basic introduction to good practice around understanding file formats, version control and the principles of data curation for individual researchers. We will examine the importance of metadata (‘data about data’), exploring the crucial role played by classification systems and standards in shaping how scholars interact with historical and cultural records. Rather than accepting data as a ‘given’, we will discuss the creation and curation of data as interpretative practices and analyse their relationship to other traditions of scholarship in the humanities and social sciences.

This CDH Basics session introduces the IIIF image data framework, which has been developed by a consortium of the world’s leading research libraries and image repositories and demonstrates a range of different machine learning-based methods for exploring digital image collections.

Places are limited, and participants must complete this form to participate in addition to booking online. We will write and confirm your participation by email. Bookings will remain open until 10 am, Wednesday 20 October; however, participants are encouraged to apply early as demand is likely to be high, and we will not be able to guarantee that your ArcGis Online account will be activated for the first session.

This CDH Guided Project series will offer an overview of GIS techniques applied to digitising historical material, from basic manual digitisation to using platforms for crowd-sourced digitisation. It will introduce GIS best practices and terminology and enable participants to design and launch their own projects. Each session will offer a 20-minute presentation, followed by 10 minutes of Q&A and one hour of practice, using ArcGis Online and a range of other GIS solutions. The teaching will be delivered by a team composed of a geospatial analyst, an architect and a historian, giving participants from all fields a broad range of views and expertise to draw on.

Participation in this guided project will also contribute to an ongoing research project led by Dr Alexis Litvine and Dr Isabelle Séguy (anrcommunes.fr), which is (among other things) reconstructing historical transport networks for France. During the sessions, participants will help digitise nineteenth-century French roads using military maps. The work will ultimately be part of a journey planner (aka a "Google Maps") of the past for France.

Applications are invited from early career researchers and others at the University of Cambridge to join this project for four online sessions during the Guided Project phase in Oct-November. The project concludes with a live “mapathon” session on International GIS day, i.e. November 17. On this day, participants will all meet (in person preferably but online will be possible) for a friendly but competitive digitisation challenge against participants in a similar guided project held in France — pizza and refreshments will be provided.

Participants will need to commit to joining the live sessions and to set aside at least 3-4 hours of individual digitisation work. Participation in the final “mapathon” (online or in-person) is also expected, but no prior GIS knowledge is required.

Chris Houghton (Head of Digital Scholarship for Gale) joins us to deliver this suite of CDH Labs sessions. Chris collaborates globally with scholars, in the digital humanities community, ensuring the development of Gale Digital Scholar Lab continues to meet their needs.

Are you interested in looking at primary sources in new ways? Would you like to learn how to analyse large sets of historical and contemporary materials to provide a different perspective on your research?

In this session we will introduce Gale Digital Scholar Lab, a cloud hosted text and data mining platform available to the University. The Lab combines the text from Gale’s archive collections available at Cambridge, including Times Digital Archive and Eighteenth-Century Collection Online (ECCO), with powerful text mining tools that enable sophisticated, wide-ranging analysis.

You don’t need any previous experience in text and data mining, and you don’t have to have any interest in coding or algorithms – this session will explain how absolutely anyone can run these analyses and enhance their research accordingly.

Chris Houghton (Head of Digital Scholarship for Gale) joins us to deliver this suite of CDH Labs sessions. Chris collaborates globally with scholars, in the digital humanities community, ensuring the development of Gale Digital Scholar Lab continues to meet their needs.

Are you interested in looking at primary sources in new ways? Would you like to learn how to analyse large sets of historical and contemporary materials to provide a different perspective on your research?

In this session we will introduce Gale Digital Scholar Lab, a cloud hosted text and data mining platform available to the University. The Lab combines the text from Gale’s archive collections available at Cambridge, including Times Digital Archive and Eighteenth-Century Collection Online (ECCO), with powerful text mining tools that enable sophisticated, wide-ranging analysis.

You don’t need any previous experience in text and data mining, and you don’t have to have any interest in coding or algorithms – this session will explain how absolutely anyone can run these analyses and enhance their research accordingly.

Chris Houghton (Head of Digital Scholarship for Gale) joins us to deliver this suite of CDH Labs sessions. Chris collaborates globally with scholars, in the digital humanities community, ensuring the development of Gale Digital Scholar Lab continues to meet their needs.

CDH Labs: Digital Scholar Lab sessions: Tools in Depth new Thu 13 May 2021   15:00 Finished

Chris Houghton (Head of Digital Scholarship for Gale) joins us to deliver this suite of CDH Labs sessions. Chris collaborates globally with scholars, in the digital humanities community, ensuring the development of Gale Digital Scholar Lab continues to meet their needs.

This in-person workshop will provide an accessible, non-technical introduction to Machine Learning systems, aimed primarily at graduate students and researchers in the humanities, arts and social sciences. No prior knowledge of programming is required.

We will focus on the technical, ethical and societal implications of embedding Machine Learning systems for classifying and generating texts and images into the world of work, with a particular emphasis on the impact of Large Language Models such as ChatGPT. We will explore these text generation systems in the context of longer histories of AI, including the ‘deep learning revolution’ in image-based Machine Learning systems which laid the foundations for popular text-to-image generation models such as StableDiffusion.

Participants will have the chance to both learn more about how AI works and also discuss what the embedding of such systems into labour processes, management structures, resource allocation systems may mean for how society works.

Convenor: Mary Chester-Kadwell -  Lead Research Software Engineer, Cambridge Digital Humanities

Please note this workshop has limited spaces and an application process in place. Application forms should be completed by noon, Sunday, 12 March 2023. Successful applicants will be notified by the end-of-day Tuesday, 14 March 2023. 

This course introduces best practices and techniques to help you better manage your code and data, and develop your project into a usable, sustainable, and reproducible workflow for research.

Developing your coding practice is an ongoing process throughout your career. This intermediate course is aimed at students and staff who use coding in research, or plan on starting such a project soon. We present an introduction to a range of best practices and techniques to help you better manage your code and data, and develop your project into a usable, sustainable, and reproducible workflow. All the examples and exercises will be in Python.

If you are interested in attending this course, please fill in the application form. Please ensure you are logged onto your University Google account to access the form further help here

Convenor: Dita N. Love (CDH Methods Fellow)

Sarah Ahmed and Jackie Stacey wrote that “speaking out about injustice, trauma, pain and grief have become crucial aspects of contemporary life which have transformed notions of what it means to be a subject, what it means to speak, and how we can understand the formation of communities and collectives” (p.2, 2001) in the introduction of the special issue Testimonial Cultures. These workshops ask therefore: what does it mean to centre survivor-knowledge, and witness together the aftermath of intersecting violence, when language and traditional methods often fail to re-present the experience of trauma? How can we avoid tokenising creative-digital research under the pressures of a precarious academy and creative sector?

CDH Methods | Digital Archival Photography new Mon 14 Nov 2022   10:30 Finished

This Methods Workshop will introduce advanced techniques used for the digitisation and preservation of archival material. The first workshop will introduce the following topics:

  • Copyrights and sensitive data considerations
  • Understanding Photography basics
  • Digitisation Imaging Standards
  • Scene and capture calibration
  • Image post-processing
  • Taking usable images in any conditions
  • Principles and Digital Preservation good practice

Completing the workshop will give participants a good understanding of archival photography best practices. You will gain a strong professional vocabulary to discuss imaging and a toolkit to assess image quality.

A second session, bookable separately, will focus on how to adopt those principles to the projects chosen by the participants. This will cover learning a practical approach to taking images fit for purpose in any conditions with available resources. It may also address any more advanced imaging topics such as image stitching, Optical Character Recognition, Multispectral Imaging, or photogrammetry if these are in the interest of the participants. It will also be an opportunity to visit the Digital Content Unit at Cambridge University Library.

CDH Methods | Digital Archival Photography new Fri 3 Mar 2023   10:30 Finished

This Methods Workshop will introduce advanced techniques used for the digitisation and preservation of archival material. The first workshop will introduce the following topics:

  • Copyrights and sensitive data considerations
  • Understanding Photography basics
  • Digitisation Imaging Standards
  • Scene and capture calibration
  • Image post-processing
  • Taking usable images in any conditions
  • Principles and Digital Preservation good practice

Completing the workshop will give participants a good understanding of archival photography best practices. You will gain a strong professional vocabulary to discuss imaging and a toolkit to assess image quality.

A second session, bookable separately, will focus on how to adopt those principles to the projects chosen by the participants. This will cover learning a practical approach to taking images fit for purpose in any conditions with available resources. It may also address any more advanced imaging topics such as image stitching, Optical Character Recognition, Multispectral Imaging, or photogrammetry if these are in the interest of the participants. It will also be an opportunity to visit the Digital Content Unit at Cambridge University Library.

This Methods Workshop will introduce advanced techniques used for the digitisation and preservation of archival material. The first workshop will introduce the following topics:

  • Copyrights and sensitive data considerations
  • Understanding Photography basics
  • Digitisation Imaging Standards
  • Scene and capture calibration
  • Image post-processing
  • Taking usable images in any conditions
  • Principles and Digital Preservation good practice

Completing the workshop will give participants a good understanding of archival photography best practices. You will gain a strong professional vocabulary to discuss imaging and a toolkit to assess image quality.

A second session, bookable separately, will focus on how to adopt those principles to the projects chosen by the participants. This will cover learning a practical approach to taking images fit for purpose in any conditions with available resources. It may also address any more advanced imaging topics such as image stitching, Optical Character Recognition, Multispectral Imaging, or photogrammetry if these are in the interest of the participants. It will also be an opportunity to visit the Digital Content Unit at Cambridge University Library.

CDH Methods | Digital Archival Photography in-depth new Mon 13 Mar 2023   10:30 Finished

Following the introductory session, this second session will focus on how to adopt the principles to the projects chosen by the participants. This will cover learning a practical approach to taking images fit for purpose in any conditions with available resources. It may also address more advanced imaging topics such as image stitching, Optical Character Recognition, Multispectral Imaging, or photogrammetry if these are in the interest of the participants. It will also be an opportunity to visit the Digital Content Unit at Cambridge University Library.

CDH Methods: First Steps in Coding with Python new Mon 6 Nov 2023   14:30 Finished

Convenor: Dr Estara Arrant (Cambridge University Library)

This session is aimed at researchers who have never done any coding before. We will explore basic principles and approaches to navigating and working with code, using the popular programming language Python. Participants will use the Jupyter Notebooks platform to learn how to analyse texts. This will provide participants with a working foundation in the fundamentals of coding in Humanities research. The software we will use is free to download and compatible with most computers, and we will provide support in installation and setup before the class.

Convenors: Leah Brainerd & Alex Gushurst-Moore (CDH Methods Fellow)

Centuries of ceramics. Millenia of maquettes. How do we grapple with large datasets? Join archaeologist Leah Brainerd and art historian Alex Gushurst-Moore to increase your computational literacy, learn how to scrape data from collections databases, and interpret that data through visual means.

Over two, two-hour sessions, you will be introduced to:

  • Collections databases: what they are, how they are built, and how to navigate them
  • Web-scraping: how do you go from a webpage on the internet to a dataset on your computer? A basic introduction to how web-scraping with R *Statistics works with a worked example, ethics of data, and learn how to evaluate a website for future data collection
  • Data visualisation software: what options are available and how to use the open-source, online system mapping tool, Kumu
  • Cultural evolutionary theory: cultural evolution is the change of culture over time; explore a theoretical perspective that views cultural information as an evolutionary process which teaches us, through cultural transmission, more about human decision making

The workshop will take place over two sessions. The first session (30 January) will cover collections databases and web-scraping. The second session (6 February) will cover data visualisation and cultural evolutionary theory. These sessions will consist of practical tutorials and discussion with the course leads. After each session, participants will be given an optional task to try out new skills acquired, on which they can receive feedback from the course organisers.

CDH Methods | Introduction to R Studio and R Markdown new Mon 21 Nov 2022   13:00 Finished

Convenor: Giulia Grisot (CDH Methods Fellow and a Visiting Academic)

This Methods Workshop will deliver an introduction to R Studio and R Markdown; the workshop will run through the functionalities and advantages of using R Studio and related tools for organising and analysing data, as well as for writing and referencing.

About the convenor: Giulia has a mixed background in Literary Linguistics, Psycholinguistics and Digital Humanities and has gained experience in both qualitative and quantitative approaches to texts and language in general, becoming familiar with several coding languages (R, python) essential for statistical as well as corpus investigations.

Giulia is currently working with large corpora of Swiss German fictional texts, looking at sentiments in relation to represented spatial locations, using both lexicon-based methods and machine learning.

This in-person workshop will provide an accessible, non-technical introduction to Machine Learning systems, aimed primarily at graduate students and researchers in the humanities, arts and social sciences.

Key topics covered in the sessions will include:

  • Situating Machine Learning in the longer history of Artificial Intelligence
  • Machine Learning system architectures
  • The challenges of dimension reduction, classification and generalisation
  • Sources of bias and problems of interpretation
  • Machine Learning applications and their societal consequences

During the session participants will be encouraged to work through practical exercises in image classification. No prior knowledge of programming is required. Participants wishing to run the experiments for themselves will need access to a laptop, but no special software is required, just an up-to-date web browser and an internet connection. We will be using Google Colab for the text generation experiments which you have access to via your Raven log-in. The image classification experiments will require a GitHub account ([sign up here https://github.com/])

Convenor: Estara Arrant (CDH Methods Fellow)

This methods workshop will teach students three powerful machine learning algorithms appropriate for Humanities research projects. These algorithms are designed to help you identify and explore meaningful patterns and correlations in your research material and are appropriate for descriptive, qualitative data sets of almost any size. These algorithms are applicable to virtually any Humanities field or research question.

  • Multiple Correspondence Analysis: automatically identifies correlations and differences between specific data elements. This helps one to understand how different features (‘variables’ or ‘characteristics’) of one’s data are related to each other, and how strong their relationships are. This can be useful in almost any research project. For example, in a sociological dataset, this analysis could help clarify relationships between specific demographic characteristics (race, gender, political affiliation) and socioeconomic status (working class, education level, income bracket).
  • K-modes clustering and hierarchical clustering: finding groups of similarity and relationship within the entirety of your data. Clustering helps one to identify which variables/characteristics group together, and which do not, and the degree of difference between groups. For example, such clustering could allow an art historian to see how paintings from one decade are characterised by style and artist, as contrasted to paintings from another decade (thus tracking shifts in artistic trends over time)

This workshop will specifically cover the following: Determining when your research could benefit from machine learning analysis. Designing a good methodology and running the analysis. Interpreting the results and determining if they are meaningful. Producing a useful visualisation (graphic) of the results. Communicating the findings to other scholars in the Humanities in an accessible way. Students will actively implement a small research project using a practice dataset and are encouraged to try out the methods in their current research. They will learn the basics of running the analysis in R’s powerful programming language.

This Methods Workshop explores primary data collection using digital and online qualitative methods. Teaching methods for detailed assessment of the suitability of online platforms for the collection of research data. Considering not only general ethical issues, privacy, encryption, terms and conditions but also inclusivity for neurodivergent and vulnerable participants.

Convenor: Orla Delaney (CDH Methods Fellow)

What does it mean to prioritise small data over big data?

Cultural heritage datasets, such as museum databases and digital archives, seem to resist the quantitative methods we usually associate with data science work, asking to be read and explored rather than aggregated and analysed. This workshop provides participants with a non-statistical toolkit that will enable them to approach, critique, and tell the story of a cultural heritage dataset.

Together we will consider approaches to the database from the history of science and technology, media archaeology, and digital ethnography. This will be done alongside an overview of practical considerations relevant to databasing in the sector, such as standards like FAIR (Findable, Accessible, Interoperable, Reusable) and CARE (Collective Benefit, Authority to Control, Responsibility, Ethics), specific technologies like linked data, and the results of recent projects aiming to criticise and diversify the underpinning technologies of cultural heritage databases. This workshop is aimed both at cultural heritage professionals and students, and at data science researchers interested in introducing a qualitative approach to their work.

This project begins from the premise that ‘transparency’ is not clear at all. Transparency is historically mediated, culturally constructed, and ideologically complex. Understood expansively, transparency is enmeshed with a variety of functions and associations, having been mobilised as a political call to action; a design methodology; a radical practice of digital disruption; an ideological tool of surveillance; a corporate strategy of diversion; an aesthetics of obfuscation; a cultural paradigm; a programming protocol; a celebration of Enlightenment rationality; a tactic for spatialising data; an antidote to computational black boxing; an ethical cliché; and more.

Across two workshops, we will explore the multidimensionality and intractability of transparency and investigate how the demand for more of it—in our algorithms, computational systems, and culture more broadly—can encode assumptions about the liberational capacity of restoring representation to the invisible. As a group we will conduct a survey of transparency and its political ramifications to digital culture by learning about its conceptual genealogies; interrogating its relevance to art and architecture; questioning its limits as an ethical imperative; and mapping it as a contemporary strategy of anti/mediation. Drawing on a combination of artworks, historical texts, cultural touchstones, and moving images, these workshops will give participants an opportunity to attend to transparency’s complex configurations within contemporary culture through a media theoretical lens. This project is designed to facilitate collaborative study; foster inter-disciplinary discourse; promote experimental learning; and develop a more theoretically nuanced and historically grounded starting point for critiquing transparency and its operations within digital culture.

Convenor: Tom Kissock (CDH Methods Fellow)

This Methods Workshop will offer Video Data Analysis for Social Science and Humanities students. It’s a relatively new, broad, and innovative multi-disciplinary methodology that helps students understand how video fits into modern research both inside and outside academia. For example, Cisco has estimated that video will make up 80% of internet traffic and 17.1% of it will be live video which is a 15-fold increase since 2017; therefore, it’s a tool that cannot be overlooked when conducting research.

Tom will address how to use video ethically, for example:

  • Informed consent
  • Storage
  • Privacy

and also practically;

  • Building timelines
  • Coding schemes
  • Presenting research findings

Tom will also plans to include a lesson focussed on viewing livestreams in a reflexive manner as this is a huge topic in the TikTok era

About the convenor: Tom has fifteen years’ experience as a Director, Executive Producer, and Livestream expert for the BBC, YouTube, NBC, and Cisco; coupled with seven years’ experience researching video witnessing and human rights abuses. In 2020 he received his MSc in Globalization and Latin American Development from UCL where his research used Video Data Analysis as a research methodology. He tracked how populist politicians in Brazil built misinformation campaigns by strategically cross-sharing videos to avoid journalistic questioning as a symbolic accountability mechanism during the 2018 presidential elections.

His PhD in Sociology at the University of Cambridge is a loose extension of his MSc, but explores positive aspects of streaming advocacy, such as how Indigenous video activists in Brazil use live video on platforms like Instagram, TikTok, and Kwai to reach audiences to discuss climate change, the environment, and land rights. He is interested in how video can produce knowledge and, subsequently how societies value different knowledge through the process of video witnessing. In his spare time, he serves as the Executive Producer of Declarations: Human Rights Podcast (part of Cambridge’s Centre for Governance and Human Rights), has given lectures on live streaming and human rights at MIT, UCL, and the University of Essex, and has written pieces for LatAM Dialogue and the Latin American Bureau.

Convenor: Dr Eleanor Dare (CDH Methods Fellow)

This Methods Workshop will invite participants to originate innovative research methods using virtual and augmented reality technologies underpinned by theoretical and pedagogic understandings. The session is conceived in recognition of an increasing interest in using virtual and extended reality (VR and XR) to create collaborative research spaces that span different locations, time zones, and spatiality. Such spaces might be used to investigate the impact of design, architecture and location on education or new ways to teach an array of subjects, from language to mathematics to performance, AI ethics and music.

About the convenor: Eleanor is currently the Co-Convenor for Arts, Creativity and Education at the University of Cambridge, Faculty of Education, they are also the Senior Teaching Associate: Educational Technologies, Arts and Creativity, lecturing and supervising on MPHIL Arts, Creativities and Education, MPhil Knowledge, Power and Politics, and MEd Transforming Practice. Eleanor is module lead for AI and Education, a Personal and Professional Development course at Cambridge.

Eleanor Dare’s research addresses the implications of digital technology and virtuality as a material for collaboration, critical-educational games development, performance, worldbuilding and pedagogic experimentation. Eleanor has been involved in several AHRC/EPSRC/ESRC/Arts Council/British Council funded projects investigating aspects of virtual and extended reality as well as projects with the Mozilla Foundation (AI-Musement/Monstrous 2022-2023), Theatre in the Mill Bradford (Bussing Out, 2022) and the Big Telly Theatre Company (via the Arts Council of Northern Ireland) for Rear Windows, forthcoming.

Dr Anne Alexander, Cambridge Digital Humanities

Places are limited and participants must complete this form in order to participate in addition to booking online. We will write and confirm your participation by email. Bookings will remain open until 10am, 11 October 2021; However, participants are encouraged to apply early as demand is likely to be high.

This online workshop will provide an accessible, non-technical introduction to Machine Learning systems, aimed primarily at graduate students and researchers in the humanities, arts and social sciences. It is designed as a preparatory session for potential applicants to our Interaction with Machine Learning Guided Project which will run in Lent Term 2022 in collaboration with the Department of Computer Science and Technology. However, it can also be booked as a standalone session.

CDH Methods | Writing Interactive Fiction new Mon 27 Nov 2023   13:00 Finished

Interactive Fiction (IF) stories let readers decide which paths the story should follow, featuring non-linear narrative design. The discipline combines the excitement of post-structuralist narratives with the power of creative coding, making it a perfect introduction for participants more familiar with one field than the other. In this workshop, led by Methods Fellow Claire Carroll, we’ll explore both parser-based (rooted in reader instructions and terminal response) and choice-based (hyperlink or multiple choice-driven) IF and work together to write our own interactive fiction. The workshop will also introduce participants to the passionate IF community, which offers advice and support to experienced writers and newcomers alike.

This CDH Basics session explores how data which you have captured rather than created yourself, is likely to need cleaning up before you can use it effectively. This short session will introduce you to the basic principles of creating structured datasets and walk through some case studies in data cleaning with OpenRefine, a powerful open source tool for working with messy data.

Computer Vision: A critical introduction new Tue 25 May 2021   10:00 Finished

Machine vision systems can potentially help humanities researchers see historical and cultural image collections differently, and could provide tools to answer new research questions. This CDH Basics session provides an introductory overview of basic tasks in machine vision, such as Image Classification, Object Detection and Image Captioning within a critical framework highlighting the challenges of algorithmic bias and the limits of automation as a method for humanistic enquiry.

Creating Databases from Historical Sources (Workshop) Mon 25 Feb 2019   11:00 Finished

This workshop will examine strategies for transforming a variety of sources into structured digital data, ranging from crumbling manuscripts to printed documents and books.

Leonardo Impett, Cambridge Digital Humanities

Application forms should be returned to CDH Learning (learning@cdh.cam.ac.uk) by Friday 22 May 2020. Successful applicants will be notified by 26 May 2020.

This course will introduce graduate students, early-career researchers, and professionals in the humanities to the technologies of image recognition and machine vision, including recent developments in machine vision research in the past half-decade. The course will seek to combine a technical understanding of how machine vision systems work, with a detailed understanding of the possibilities they open to research and study in the humanities, and with a critical exploration of the social, political and ideological dimensions of machine vision.

Learning outcomes

By the end of the course, students should be able to:

  • Understand the basic tasks of machine vision, such as Image Classification, Object Detection, Image-to-Image Translation, Image Captioning, Image Segmentation etc.
  • Understand the fundamental technical operations of image processing and machine vision: the pixel encoding of images, Gaussian and convolutional filters,
  • Explore critical aspects of machine vision in a technically-informed way: e.g. the problems in algorithmic bias brought about by featureless convolutional networks
  • Develop and run their own simple machine vision and image processing pipelines, in a visual programming language compiling to Python
  • Understand the potential synergies and limitations of machine vision applications in humanities research and cultural heritage institutions
Data Presentation and Preservation new Tue 28 Jan 2020   11:30 Finished

The afterlife of your research data forms a vitally important part of your research project. Research funders and academic journal publishers are often strongly committed to the re-use of data and are reluctant to fund or publish research where datasets are not accessible for the purposes of peer review or further use. Yet the push for open data exists in tension with the expectations of data protection law which requires transparency from researchers about how long they will retain personal data. This session will explore good practice in data sharing and archiving as well as introducing sources of further information and advice within the University on this topic.

Data Wrangling (Workshop) new Mon 4 Feb 2019   14:00 Finished

Garbage in, garbage out! Your output is as good or as bad as your input. Data collected from online sources is often dirty and messy. Discover how to clean and organise your data. After transforming raw data into a structured dataset, you will be ready to perform data analysis.

Application forms https://www.cdh.cam.ac.uk/file/cdhdelvingintomassivedaapplicationdocx should be returned to CDH Learning (learning@cdh.cam.ac.uk) by Tuesday 6 October 2020. Successful applicants will be notified by Thursday 8 October 2020.

Massive digital archives such as the Internet Archive offer researchers tantalising possibilities for the recovery of lost, forgotten and neglected literary texts. Yet the reality can be very frustrating due to limitations in the design of the archives and the tools available for exploring them. This programme supports researchers in understanding the issues they are likely to encounter and developing practical methods for delving into massive digital archives.

Digital Archival Photography | in-depth new Tue 9 Jan 2024   10:30 Finished

Following the introductory Methods Workshops, held on 21st November 2023, this session will focus on how to adopt the principles to the projects chosen by the participants. This will cover learning a practical approach to taking images fit for purpose in any conditions with available resources. It may also address any more advanced imaging topics such as image stitching, Optical Character Recognition, Multispectral Imaging, or photogrammetry if these are in the interest of the participants. It will also be an opportunity to visit the Digital Content Unit at Cambridge University Library.

Digital Data Collection and Wrangling new Tue 14 Jan 2020   11:30 Finished

This session addresses the technical and ethical aspects of digital data collection and wrangling – two fundamental stages in the lifecycle of a digital research project. Participants will be introduced to online data sources and practices of internet-mediated data collection, including retrieving data from social media platforms. As data collected from online sources is often dirty and messy, we will also provide a short practical introduction to the process of transforming raw data into a clean and structured dataset using free and open-source software.

Digital Data Collection (Workshop) new Mon 28 Jan 2019   14:00 Finished

This session is a primer on digital data collection. The goal is to become familiar with online data sources and practices of internet-mediated data collection, including retrieving data from social media platforms.

The shelf-life of your dataset dictates the longevity of your findings. Sharing your data and assuring its integrity is a fundamental part of a digital research project. In this session we will discuss the principles of open data, channels for data dissemination and the fundamentals of data preservation.

Digital Mapping for Historians new Wed 26 Jun 2019   09:30 Finished

This intensive workshop will provide an overview of a range of applications of digital mapping in historical research projects and introduce GIS tools and software.

Digital Research Design and Data Ethics new Tue 24 Nov 2020   10:00 Finished

This CDHBasics session explores the lifecycle of a digital research project across the stages of design;

  • data capture
  • transformation
  • analysis
  • presentation and preservation

it also introduces tactics for embedding ethical research principles and practices at each stage of the research process.

Digital Research Design, Methods and Ethics (Workshop) new Mon 21 Jan 2019   14:00 Finished

Find out how to shape a digital research project from scratch. This session will introduce the building blocks of online research design, from the several methodologies available to conduct the research to the ethical guidelines that should underpin our projects.

Doing Qualitative Research Online new Mon 1 Feb 2021   14:00 Finished

What happens to practices of qualitative research when interactions between researcher and research subject are largely mediated? From observations of users’ interactions on social media platforms, to interviews conducted through WhatsApp or Skype, digital communications offer both opportunities and challenges for qualitative research in a wide range of disciplines across the Social Sciences and Humanities. This methods workshop will explore a wide range of topics including:

  • Establishing trust and credibility
  • Engaging with digital gatekeepers
  • Navigating blurred boundaries between ‘private’ and ‘public’
  • Re-conceptualising ‘researchers’, the ‘research field’ and ‘ research subjects’
  • Identity, anonymity and visibility - implications for research practice
  • Mitigation strategies: from data parsimony to creative obfuscation
  • Self-care for researchers in online research
  • Embedding ethical research practice across the project lifecycle

The workshop will take place over two sessions, an introductory seminar and discussion led by Dr Anne Alexander on 1 February, after which participants will be asked to complete a short reflective piece of work assessing their own research design and identifying areas where they feel they need further help and advice. The second session on 8 February will be participant led including small group and plenary discussions exploring strategies for dealing with challenges identified by participants.

Participants should set aside around 1 hour between the two sessions to complete and submit their self-assessment.

Participants are strongly encouraged to attend the CDH Basics session Privacy, information security and consent: a guide for researchers with Dr Anne Alexander on 26 January in advance of the Methods Workshop.

We are currently reformatting our Learning programme for remote teaching; this will require some rescheduling so bookings will reopen and new sessions will be created for online courses as soon as possible. In the interim we would encourage you to register your interest so as to be notified of the new schedule. Please be aware that we hope to run many of our courses online, but that this is dependent on staff availability and resources so please be aware we may have to postpone or cancel some sessions

This workshop will develop your coding practice from testing ideas to creating an efficient workflow for your code, data and analysis. If you are using Jupyter Notebooks (but even if you’re not) this workshop will demonstrate how to better manage your code using good programming practices, and package your code into a program that is easier and quicker to run for lots of data and more reliable.

Required preparation (instructions provided): Python 3 installed on laptop; a text editor or IDE installed on laptop; git installed on laptop and signed up for GitHub; a short internet-based exercise in working with the command line.

Dr Nathan Crilly and Chih-Chun Chen explore the challenges of communicating complex ideas to diverse audiences through a variety of digital media formats. Three case studies will be reported from an EPSRC-funded research project which sought to clarify and communicate the nature of complex system design and its relationship to emerging technologies. For example, the project studied the way in which technologists working in Synthetic Biology and Swarm Robotics conceptualise and address the complexity of the systems they are designing. Outputs from the project include: • A 35-page ‘primer’ on the subject of complexity (now with over 6000 downloads) • A three-minute animated movie discussing the subjectivity of complexity (now with 2500 views) • An interactive website (implemented by Dr Chen since she has programming skills) that generates annotated bibliographies for complexity resources tailored to a user’s interests (launched in March 2019) Dr Crilly and Dr Chih-Chun will discuss the process of engaging with media partners, including working with science communication agencies, animators and film-makers, reflect on what they learned from the process and what they would do differently in future.

Film-making for Beginners new Sat 1 Dec 2018   09:30 Finished

Learn to think visually and communicate using sound and film: participants will be introduced to the language of film, shot types, camera movements, framing, basic rules of camera use, how to tell a story, and editing in the Phoenix Training Suite.

Film-making for Beginners (Level 2) new Mon 24 Jun 2019   09:30 Finished

Learn to think visually and to communicate using sound and film. Participants will be introduced to the language of film, shot types, camera movements, framing, basic rules of camera use, how to tell a story, and editing. Some prior knowledge of filming is required. Please see the CDH website for more details (www.cdh.cam.ac.uk).

First steps in coding with Jupyter Notebooks new Tue 9 Feb 2021   10:00 Finished

This CDH Basics session is aimed at researchers who have never done any coding before. We will explore basic principles and approaches to writing and adapting code, using the popular programming language Python as a case study. Participants will also gain familiarity with using Jupyter Notebooks, an open-source web application which allows users to create and share documents containing live code alongside visualisations and narrative text.

First Steps in Coding with R new Mon 19 Feb 2024   14:00 Finished

Convenor: Dr Estara Arrant (Cambridge University Library)

This session is aimed at researchers with minimal coding experience or who have not done any coding but have data they want to explore and visualise. However, you do not need to have a full set of data to benefit from this class. You will learn the fundamentals of conducting a basic analysis of Humanities-related data in the R language, including prepping and tidying data and generating useful graphs which communicate information about your research to others.  You will also gain a basic overview of the R programming language, which will provide you with principles that you can take forward to learn more advanced data analysis methods. The software we will use (RStudio) is free to download and is compatible with most computers. We will provide installation support and guidance. You will need your own laptop.

First Steps in Version Control with GitHub new Mon 26 Feb 2024   14:00 Finished

Please note this workshop has limited spaces, and an pre-course questionnaire is in place. Please complete before the session.

Version control helps you to write code for your research more sustainably and collaboratively, in line with best practices for open research. You might use code for collecting, analysing or visualising your data or something else. Everyone who codes in some way can benefit from learning about version control for their daily workflow.

This workshop will cover the importance of version control when developing code and foster a culture of best practices in FAIR (Findable, Accessible, Interoperable, Reproducible) code development. We will take you through the basic use of GitHub to help you store, manage, and track changes to your code and develop code collaboratively with others.

Designed with beginners in mind, this workshop caters to those who have not yet delved into Git or GitHub. While prior knowledge of a programming language (e.g., R or Python) would be beneficial, it is not a prerequisite.

From Blog to Book new Thu 10 Oct 2019   14:00 Finished

Blogging as a digital means of research communication seems so simple: with free, easy-to-use platforms we’re all just a few clicks away from setting one up. But having set a blog up, the difficult work begins. Who are you talking to? What are you trying to achieve? How will you generate your content? How will the people you want to talk to find it? How are you going to keep it going alongside your research and teaching commitments? Will it make any difference to anything? And will you ever be able to transform any of this work into a scholarly publication that ‘counts’?

This session will be an interactive conversation between Julie Blake, Cambridge Digital Humanities Methods Fellow and Connie Ruzich, University Professor of English at Robert Morris University, Pittsburgh, USA. Connie’s Behind Their Lines blog started in 2014 during a Fulbright Scholarship at Exeter University to research First World War poetry in the context of the Centenary Commemorations. She became interested in the lost and neglected poetry of the First World War and began blogging about her ‘finds’. Five years later, she has had almost 400,000 visits to her blog, she maintains a lively dialogue with public and academic audiences including via Twitter and she is in the final stages of completing a monograph about this material with Bloomsbury Academic.

We’ll discuss the highs and lows of Connie’s research blogging experience, the surprises, the pitfalls and the lessons learned by hard won experience. We’ll try to answer all the questions listed above, and participants will be invited to join in with their own questions.

Speaker: Mark Algee-Hewitt, Associate Professor of English and Director of the Stanford Literary Lab.

About this Methods workshop

At the heart of many of the current computational models of language usage, from generative A.I. to recommendation engines, are large language models that relate hundreds of thousands, or millions, of words to each other based on shared contexts. Mysterious products of complex modelling algorithms, these objects raise a number of practical (and ethical) questions for Humanities scholars: How are these language models created? What kinds of relationships does their math encode? How do biases in the corpus affect the model? And how can we effectively use them to answer humanities-based questions?

In this workshop, we will explore these questions using a medium-sized language embedding model trained on a corpus of novels. Using approachable code in the R software environment, participants will learn how to manipulate a model, assess similarities and difference within it, visualise relationships between words and even train their own embeddings.

Emma Reay is a third-year PhD researcher at the University of Cambridge and an associate lecturer at Anglia Ruskin University. Her current project explores depictions of children in videogames, and her research interests include representation studies, children's digital media, gaming and education, and playful activism.

Adam Dixon is a game designer and writer who makes both physical and digital games. He has worked on everything from big public games that involve running around cities to narrative video games about learning scientific skills. Much of his work has involved working with museums and research organisations such as the Wellcome Trust, Science Museum, Nottingham Trent University and the V&A. This has included designing games, using play for public research engagement and most recently, teaching teenagers to create digital games for Wellcome Collection’s Play Well exhibition. Outside of that he works and releases his own games including roleplaying games, LARPs and interactive fiction.

Applications https://www.cdh.cam.ac.uk/file/cdhgamedesign201920applicationdocx-0 should be returned to CDH Learning (learning@cdh.cam.ac.uk) by Wednesday 10 June 2020. Successful applicants will be notified by 15 June 2020.

This online course will introduce participants to the practice of game design. It will explore the different ways that digital and analogue games are designed, particularly how you can design with intent to communicate a mood, theme or message. Participants will learn game design skills - such as boxing-in, design documents and prototyping – alongside opportunities to test them out by creating their own short games. Examples will focus on game design in research-related contexts, including using games as part of your research process and to communicate research outcomes to diverse audiences.

The sessions focus on game design, how to shape mechanics and play experiences, so no technical skills are needed. Participants will create their short games using both non-digital tools and simple, free software that will be taught in the sessions.

Topics covered:

  • Game design basics
  • A chance to play and consider thoughtful games
  • Boxing in
  • Planning games
  • Making games
  • Bitsy and Twine
  • Playtesting and iteration

Format

The course will be delivered online, with live teaching sessions taking place on Zoom.

  • Weds 17 June, 4pm BST: Introduction (45 minutes)
  • Weds 24 June, 4pm BST: Game play feedback (45 minutes)
  • Weds 1 July, 4pm BST: Game design seminar (45 minutes)
  • Weds 15 July, 4pm BST: Final session (60 minutes with break)

A CRASSH blog post was created for the originally scheduled session which may be of interest to read and can be found here: http://www.crassh.cam.ac.uk/blog/post/Play-as-Research-Practice

Game Design Workshop [cancelled - industrial action] new Mon 2 Dec 2019   09:30 CANCELLED

This two-day intensive workshop will introduce participants to the practice of game design. It will explore the different ways that digital and analogue games are designed, particularly how you can design with intent to communicate a mood, theme or message. Participants will learn game design skills - such as boxing-in, design documents and prototyping – alongside opportunities to test them out by creating their own short games.

The sessions focus on game design, how to shape mechanics and play experiences, so no technical skills are needed. Participants will create their short games using both non-digital tools and simple, free software that will be taught in the session.

The course participants will be selected via an application process, once a provisional place is booked a call for application form will be issued for completion and return by 1 November 2019. Once the applications are reviewed, places will be confirmed directly in the week beginning 18 November 2019.

Generative Adversarial Networks Experimentation Lab new Tue 11 Dec 2018   11:30 Finished

This workshop will discuss prospective methods and approaches for critically engaging with the images of people created through Generative Adversarial Networks, using design experiments as provocations to expand debate about notions of ‘realism’ and ‘authenticity’ in an era where human and machine vision are ever more systematically intertwined.

Ghost fictions (Guided project) new Mon 26 Oct 2020   14:00 Finished

'Application forms should be returned to CDH Learning (learning@cdh.cam.ac.uk) by Tuesday 13 October 2020. Successful applicants will be notified by 15 October 2020.

This CDH Guided Project series which also includes a Methods Workshop will explore the generation of ‘synthetic’ texts using neural networks.

The release of OpenAI’s GPT-2 and GPT-3 language models in 2019 and 2020 has shown that predictive algorithms trained on very large general datasets can generate ‘synthetic’ texts, perform machine translation tasks, rudimentary reading comprehension, question answering and summarisation automatically without needing large amounts of task-specific training. These ‘ghostwritten’ texts have provoked wide attention in the media.

Researchers have experimented with prompting GPT-3 to write short stories, answer philosophical questions and apparently propose potential medical treatments -although GPT-3 had some difficulty with the question “how many eyes does a horse have?”. The Guardian ‘commissioned’ op-ed from GPT-3.

Through interactive hands-on sessions and demonstrations we will explore synthetic text production and look at how ideas about the distinction between ‘fact’, ‘fiction’ and ‘non-fiction’ are shaping the reception of this emerging technology. Our aim is to stimulate deeper critical engagement with machine learning by humanities researchers and to encourage more public debate about the role of AI in culture and society.

We invite applications from early career researchers and others at the University of Cambridge to join a small project team for four online sessions during the Guided Project phase in Oct-November. Participants will need to commit to joining the live sessions and to set aside at least 3-4 hours work on a small-scale individual project during the course. We are interested in assembling an interdisciplinary group of researchers drawing on insights from across humanities, social science and technology disciplines .Prior knowledge of programming, computer science or Machine Learning is not required.

Humanities Data: a basic introduction new Tue 13 Oct 2020   10:00 Finished

This CDHBasics session will explain what data is, and what ‘humanities data’ looks like (via a behind-the-scenes tour of the Digital Library). This session covers good practice around file formats, version control and the principles of data curation for individual researchers.

Interaction with Machine Learning new Mon 1 Feb 2021   10:00 Finished

Application forms should be returned to CDH Learning (learning@cdh.cam.ac.uk) by Thursday 7 January 2021. We will review applications on a rolling basis and applicants will be notified at the latest by the end of Monday 11 January.

This CDH Guided Project aims to provide humanities, arts and social science researchers with an overview of current theory and practice in the design of human-computer interaction in the age of AI and equip the participants with analytical tools necessary for a critical investigation of contemporary design with AI/ML. Looking closely at interactions between humans and emerging AI systems, the workshop will also explore the potential for interaction between humanities scholars and computer scientists in the process of development and assessment of new solutions.

Lectures and practical research design sessions in Interaction with Machine Learning taught by Professor Alan Blackwell and Advait Sarkar (Microsoft Research) as part of an optional course for Part III and MPhil Computer Science students will form the anchoring element of the Project. These will allow researchers without a Computer Science background to explore how key challenges in AI design are being addressed within the field of interaction design, as well as identify areas in which humanities methodologies and approaches could be adopted to improve the production process, by making it more fair, critical, and socially-aware.

Participants will also take part in three workshops specifically tailored to humanities and social science researchers and will be supported in developing a mini research project investigating how humans interact with systems based on computational models. The projects may include:

  • probing an already existing dataset, system, or user interface from a critical perspective
  • developing an idea for new interaction design based on critical applications of ML/AI.

Please note: no prior practical experience or knowledge of programming is required to take part in the Project, however some awareness of how AI systems work will be beneficial.

Minimum time commitment:

  • 8 weekly online lectures led by Professor Alan Blackwell (Computer Science and Technology) and Advait Sarkar (Microsoft Research). Weekly from 26 January, 2-4pm (with the last hour as an optional session for Guided Project participants).
  • 3 x 1.5 hour specialist workshops for humanities and social science participants led by Tomasz Hollanek and Anne Alexander (CDH)
  • 1.5 hour project showcase and final discussion

Participants are encouraged to set aside additional time to work on their projects between sessions. A Moodle email forum and drop-in ‘clinic’ style support sessions will be available during the Guided Project.

Lecture topics and dates

  • Current research themes in intelligent user interfaces (26 January, 2pm)
  • Program synthesis (2 February, 2pm)
  • Mixed initiative interaction (9 February, 2pm)
  • Interpretability / explainable AI (16 February, 2pm)
  • Labelling as a fundamental problem (23 February, 2pm)
  • Machine learning risks and bias (2 March, 2pm)
  • Visualisation and visual analytics (9 March, 2pm)
  • Research presentations by Computer Science Students (16 March, 2pm)

Workshop themes

  • AI critique, humanities methodologies and user interface design (1 February, 10-11.30am)
  • Recommender systems (1 March 10-11.30am)
  • Machine vision (8 March 10-11.30am)
  • Project presentations and discussion (15 March 10-11.30am)

Objectives By the end of the course participants should:

  • be familiar with current state of the art in intelligent interactive systems
  • understand the human factors that are most critical in the design of such systems
  • be able to evaluate evidence for and against the utility of novel systems
  • be able to apply critical methodologies to current interaction design practices
  • understand the interplay between ML/AI research and humanities approaches

We are currently reformatting our Learning programme for remote teaching; this will require some rescheduling so bookings will reopen and new sessions will be created for online courses as soon as possible. In the interim we would encourage you to register your interest so as to be notified of the new schedule. Please be aware that we hope to run many of our courses online, but that this is dependent on staff availability and resources so please be aware we may have to postpone or cancel some sessions

This session focusses on providing photography skills for those undertaking archival research. Dr Oliver Dunn has experience spanning more than 10 years digitising written and printed historical sources for major university research projects in the humanities and social sciences. The focus is very much on low-tech approaches and small budgets. We’ll consider best uses of smartphones, digital cameras and tripods.

Introduction to Exhibit.so platform new Thu 28 Jul 2022   10:00 Finished

In this workshop, you will learn about the various features of the exhibit.so platform, led by Ed Silverton, from Mnemoscene and introduced by Andy Corrigan from Cambridge Digital Library.

Cambridge Digital Humanities (CDH) is working with Mnemoscene to develop a local instance of the Exhibit tool that will be available to University of Cambridge users.

Exhibit is a tool for visual storytelling developed by Mnemoscene supported by the Esmée Fairbairn Collections Fund. It is an easy-to-use tool for creating captivating interactive stories and quizzes with Cultural Heritage content, also now publicly available at https://www.exhibit.so/. Built using the Universal Viewer it enables users to load images or 3D objects from any IIIF-supporting online catalogue to tell stories within and across collections.

No prior knowledge of IIIF or Exhibit required!

Outcomes

At the end of the workshop attendees will be able to:

  • Identify the key features of Exhibit
  • Identify how to source existing IIIF manifests or add new ones to Exhibit
  • Create stories, quizzes, and kiosks in Exhibit
  • Embed your Exhibit on your website
Introduction to MorphoSource new Thu 28 Jul 2022   14:00 Finished

Cambridge Digital Humanities is working with MorphoSource to offer an introduction to its platform. In this workshop you will be introduced to the MorphoSource platform, which is a repository for researchers, curators, and everybody to find, view, download, and upload 3D scans and data of natural history, scientific specimens, and cultural objects.

Contributions come from museums, researchers, scholars and specialists to share findings, increase impact, and improve access to material for scientific discovery, sharing, and the advancement of human knowledge.

The workshop will cover:

  • Highlight the main features
  • Focus on usage most relevant to the cultural heritage sector
  • Using the site - searching, exploring, referencing
  • Contributing data
  • Embedding content

The workshop has a GLAM focus and is more about safely storing & providing access to complex visual data content rather than story-telling, although still has aspects of engagement, but might also be of interest to STEM areas working with 3D/complex visual data or in the area of scholarly communications/data repositories.

Introduction to Text-Mining with Python 1 new Tue 30 Apr 2019   11:00 Finished

This session will introduce basic methods for reading and processing text files in Python. We will walk through an example that reads in a large text corpus, splits it into tokens (words) and sentences, removes unwanted words (stopwords), counts the words (frequency analysis), and visualises results. We will talk about the 5 steps of text mining and what resources to use when learning text mining for your research in your own time. No prior knowledge of Python is required, and no installations will be needed. We will use web services available in your browser to follow along.

Introduction to Text-Mining with Python 2 new Tue 7 May 2019   11:00 Finished

This session will introduce topic modelling. Topic modelling is looking for clusters of words that summarise the meaning of documents. We will talk about how to choose what sort of text mining you might want for your research. Some knowledge of Python is required, as gained from 'Introduction to Text-Mining with Python 1', or equivalent. No installations will be needed; we will use web services available in your browser to follow along with the examples.

This online session will introduce basic methods for reading and processing text files in Python with Jupyter Notebooks. We'll discuss why you might wish to do text-mining, and whether coding with Python is the right choice for you. We'll run through the 5 steps of text-mining, and start to walk through an example that reads in a text corpus, splits it into words and sentences (tokens), removes unwanted words (stopwords), counts the tokens (frequency analysis), and visualises results.

This initial session is one hour long and will be delivered remotely by video conferencing. During the session we will cover the essentials of working with the Jupyter Notebooks provided so that you can carry on working through the materials in your own time. The first session will be followed by a second, optional Q&A session for troubleshooting issues and recapping essentials.

Required preparation: A short internet-based exercise in working with variables and text in Python will be sent out one week prior to the session. You will also get instructions on how to find the materials we will be using and how to log onto the video conferencing platform. Please make sure you have some time to prepare properly so that we can concentrate on teaching during the remote session.

Introduction to the Command Line new Tue 5 Dec 2023   11:00 Finished

This session introduces the command line, sometimes also known as the shell or the terminal, to humanities researchers. No prior knowledge of the command line or programming of any kind is required or expected from attendees.

A basic understanding of how to use the command line provides a step change in how productive you can be when working with data or text files, particularly large number of files or very large files, which can be hard to manipulate in a graphical interface. Some tools and programs can only be used from the command line, and this session aims to give you the confidence to work with them. In the session we primarily look at seven George Eliot novels and a comparative set of seven Dickens novels (about 3.4 million words in total) but this session should be of use to any humanities researchers working with text collections and the principles have far broader applicability.

We'll focus on running programs which come pre-installed on Mac and Linux, and which can be easily added to Windows. We'll combine these programs in productive ways, discuss how to discover and use the options for each, how to send results to files, and how to work efficiently on the command line so you don't have to retype or remember everything you've done.

We are currently reformatting our Learning programme for remote teaching; this will require some rescheduling so bookings will reopen and new sessions will be created for online courses as soon as possible. In the interim we would encourage you to register your interest so as to be notified of the new schedule. Please be aware that we hope to run many of our courses online, but that this is dependent on staff availability and resources so please be aware we may have to postpone or cancel some sessions

This public workshop will mark the end of the 2020 programme of Machine Reading the Archive, a digital methods development programme organised by Cambridge Digital Humanities with the support of the Researcher Development Fund.

It will showcase the digital archive projects created by our cohort of project participants as well as invited contributions from leading experts in the field.

Mapping the Past [remote delivery] new Fri 22 May 2020   11:00 Finished

This intensive workshop is split into two online chats and two 1-hour sessions. Participants will first learn to collect and process geospatial data from historical sources and process it using geographical information systems from Google Earth to QGIS.

The first online session introduces research techniques for collecting, arranging and mapping geospatial data from historical sources, and is taught by Dr Oliver Dunn. His session is split into two parts: Part A will introduce both online sessions by showing some of our own research that makes use of Google Earth, 3D Maps in Excel, and historical GIS. In Part B you will be asked to locate a set of Scotland’s historical lighthouses on historical maps online and map their location and other attributes in Google earth and 3D Maps.

The second online session introduces students to mapping humanities data using Q-GIS which is a free GIS (Geographical Information System) software platform. Course participants will need to download and install QGIS on their laptops before 5th of June. On the 1st of June there will be further details concerning downloading QGIS, a chat forum where we can discuss why you might wish to use GIS, and whether GIS is the right choice for you, and a release of course teaching materials. On 5 June you will be taken through the map creation process step-by-step. This session will be taught by Max Satchell.

Do you need a database for your data? Or could you store the data in standalone files? Which database paradigm should you consider? What are the consequences of these choices on your work routine? How to navigate all of this with minimal or no programming experience?

These and more are the questions we will address in the course. We aim to provide a gentle introduction to databases and database paradigms, with examples that help explain the differences between the most common database packages and guide researchers to design suitable solutions for their data problems.

These workshops will offer participants the ability to re-think the graphic design of a musical score and will work with a novel set of principles to modify the spacing, layout, and position of its notes and signs for intelligibility purposes and/or artistic purposes.

In previous experimental research, Arild has found that musical scores with modified engraving, spacing, and layout rules can —at least in certain practices and for certain repertoires— elicit more fluent and precise readings than conventional scores. The abstraction of informational units and of discourse structure from a score seems to be enhanced by his approach of separating and redistributing notation symbols and other visual materials using a digital (quantifiable, taxonomic) hierarchy of divisions comparable to what is nowadays conventionally applied in (Western) language texts. This seems to be facilitating the decoding and apprehension of information, affecting the conversion of notation into performance; it is also being investigated at present in terms of academic and artistic impact.

Participants will be able to use the flexibility and manageability of digital production to introduce a radically new conception of the visual structuring of a musical score: Arild proposes to go beyond the mere reproduction of analogical models with digital tools; for that, participants will be experimenting with novel flexible spacing, layout and visual structuring cues that could be enhancing, in music reading, the integrative and abstractive processes that fluent readers already use in language (we do not read sequentially letter by letter; good readers group, prioritise and predict the symbols presented to them). This approach is intrinsically digital, as it is based on being able to use the symbols of a score in a modular, movable, and experimental manner —and in this context 'experimental' would naturally include heuristic or intuitive manipulations by the score users. Arild's view is that a novel conception of music notation should include the possibility of re-organising the materials, allowing the user at either end (creator or reader) to group, separate, highlight and grade visually the symbols present in a score.

Isabelle Higgins, Methods Fellow - Cambridge Digital Humanities

This Methods Fellows' Workshop Series event aims to encourage participants to think critically and reflexively about the nature of digital humanities research. It will explore (both individually and collectively) the function and effect of critical, intersectional and decolonial research methods and their impact on research fields, participants and research outputs.

For each seminar, participants will be provided with a reading list that will contain both core introductory texts and additional readings. They will be expected to do 30 minutes of reading ahead of each seminar. The seminars themselves will be a mix of presentations, small group discussion and the study of specific empirical cases.

Throughout the seminars we will collectively assemble a shared bibliography of academic texts and other digital resources. Participants will also be encouraged to bring and share examples and challenges from their own research.

To increase space for discussion and critical reflection, participants will be encouraged to form small working groups, focused on the seminar theme they find most productive, and to connect with their working group for a 30-minute call to reflect on their chosen seminar outside of the scheduled four hours of teaching. There will be the option to feed back on these discussions to the wider group, deepening our shared understanding of the content covered in the course. Isabelle will also hold virtual office hours following the seminar series. In these ways and others, the series will aim to cater for those new to this area of research, as well as for scholars who are already working in digital humanities.

Key topics covered in the sessions will include:

  • Seminar 1: Digital Humanities in Social and Historical

Context: Considering what and how we research

We will focus on placing digital humanities, as a discipline, in the context of its emergence. Disciplinary Sociology, for example, is increasingly grappling with its colonial past (Meghji, 2020). What happens when we examine the history and context of digital humanities? McIlwain (2020) reminds us of the historical ties between the development of computational technology and the surveillance of Black bodies. Yet digital humanities research has also sought to challenge the legal, social and political power exercised through digital systems (Selwyn, 2019). Does contextualising our methods change how we approach them?

  • Seminar 2: Critical approaches to Digital Environments: Affordances, Interfaces, AI, Algorithms

We will draw on the vast range of work produced by race critical code scholars, which help us to explore the assumptions and inequalities that are coded into the software we study (or use to conduct our studies). Ruha Benjamin (2016a:150) reminds us to ask of digital technology: 'who and what is fixed in place – classified, corralled, and/or coerced, to enable innovation?' How does a consideration of encoded digital inequalities affect our methodologies?

  • Seminar 3: Critical Engagement with User Generated

Content: Beyond content & discourse analysis

We will draw on critical theories that draw attention to the digital and social constructs and conventions that shape the production of user-generated content, with Brock's (2018) Critical Techno-Cultural Discourse Analysis as one such methodological contribution. We'll explore what happens to our research when we broaden our methodological framing, considering the type of content produced by users and how it is produced, who is producing it, and what governs this production.

  • Seminar 4: Looking forward: Our roles as researchers in Digital Humanities

We will pay attention to the growing calls from a range of cross-disciplinary scholars who invite us to actively consider the impact of our methods on the future. We'll explore different notions of methodological responsibility and innovation, from the speculative (Benjamin, 2016b), to the caring (de la Bellacasa, 2011), to the adaptive and inductive (Markham & Buchanan, 2012). What happens when we place our research into its broader context and consider how our methods will shape the future of our discipline?

This course demystifies principles of data visualisation and practices of graph creation in Python to help trainees better understand and reflect how Good Data Visualisation under “5 Principles” can be achieved, and develop Python’s application in data visualisation beyond analysis. This course is aimed at students/staff who are interested in and/or use data visualisation in research or outreach and hope to explore data visualisation in Python with basic Python knowledge. It is delivered in a format of 4-hr workshop (on Zoom) + c. 2hr self-paced preparation and post-class exercises+ 1hr asynchronous question-shooting, combining theories, case learning, peer interactions and practical: we first present an introduction on key concepts of and problems in data visualisation, before case studies and group discussion on data visualisation principles and how to visualise data better in practices; then under a demonstration, we employ Python to visualise data and go through types of graphs.

Itamar Shatz - Methods Fellow CDH

This course will introduce participants to key concepts in statistical analyses, including statistical significance, effect sizes, and linear models. The goal is to give participants the basic tools that they need in order to understand the use of statistical methods by others and to use these methods effectively in their own research. We will focus on an intuitive and practical understanding of statistical analyses, rather than on the mathematical details underlying them. As such, the course will be accessible for those without a quantitative background, although it will help to have knowledge of basic descriptive statistics (e.g., mean and standard deviation).

The course will cover (approximately) the following topics:

  • Session 1: statistical significance and statistical tests (including hypothesis testing, p-values, statistical power, t-test, and chi-square test).
  • Session 2: effect sizes, correlation, confidence intervals, and outliers.
  • Session 3: linear regression (including simple/multiple regression, residuals, beta coefficients, and R-Squared).
  • Session 4: linear regression continued (including test statistics, standard errors, centering, interaction, categorical predictors, linear models, and assumption testing).

This course looks at how modern computational techniques in logic can be used to approach historical questions in the history of logic while also reflecting on the differences and similarities between historical and modern approaches to logic.

Historically, the course will focus on two authors’ approaches to modal logic, the branch of logic that deals with possibility, necessity, and contingency. Ibn Sina (9th century) and John Buridan (14th century). Using these two authors and their discussions of logic as a starting place, we will look at how their logical systems can be represented and formalised using contemporary computational methods, as well as reflecting on the similarities and differences between historical approaches to analysing validity and its relationship to modern notions of algorithms.

The overarching aim of the course is to develop the framework that allows us to computationally show that Buridan and Ibn Sina are working with the same modal logic under two different presentations.

This course will be of interest to academics at all levels (including PhD students) who travel to remote locations (including small libraries worldwide) to access their primary material (often pamphlets and hand-written ephemera) which they are interested in digitising not only for their own scholarly appraisal, but also as a means of enabling access to the wider academic community. We will go step-by-step through preparation of materials, cataloguing systems, rigs and illumination, tethered photography using Lightroom, smartphone lenses and Halide, and packaging and checksums. We will also be discussing theoretical and ethical questions around decolonisation, reparation, and handling of Black and Indigenous heritage.

Methods Fellows Series | Social Network Analysis new Tue 8 Mar 2022   14:00 Finished

Thomas Cowhitt, Methods Fellow - Cambridge Digital Humanities

This Methods Fellow's Workshop Series event will introduce users to social network analysis in R. Participants will be asked to generate their own relational dataset. We will then use several R packages to visualize and interpret relational data. By the conclusion of this course, users will be able to construct a relational dataset, load and clean this dataset in R, and generate static network diagrams and reports on descriptive network statistics.

Methods Fellows Series | Visualising Data Clearly new Wed 4 May 2022   14:00 Finished

If you've ever collected some data but weren't sure how to go about visualising it in a way that could help you uncover new insights, or if you've struggled to present data in a way that helped others understand your findings, this course is intended for you.

We'll talk about how to select the right visualisation for your data, discuss the pros and cons of different approaches, and get hands-on experience displaying information in clear and compelling ways. We'll also discuss broader issues surrounding visualisation science, such as common ways that visualisations are misinterpreted and how to avoid them, and controversies around what counts as best practice in visual communication.

In addition to the weekly online sessions, participants are expected to spend around two hours per week applying the skills learnt to gain greater fluency and enable us to 'workshop' each other's visualisations.

Your participation will also benefit if you have the chance to take our "Give me 5! Principles of Data Visualisation", which is scheduled for 23rd & 30th March. However, attending this workshop is not a prerequisite, so please do not be deterred if you miss the dates.

Methods Fellow Workshop: Audible knowledge: soundscapes, podcasts and digital audio scholarship

Dr Peter McMurray (CDH Methods Fellow)

With the rise of web-based scholarship and affordable digital audio equipment, artists and researchers are increasingly turning to audio formats as way to share their work with a larger audience and to cultivate new forms of knowledge rooted in listening. This workshop will offer an introduction to digital audio recording and editing (using Reaper, a digital audio workstation which can be downloaded/used for free on an extended trial basis). We will focus particularly on the editing choices for soundscape composition and podcasting, and participants will have the opportunity to produce a short audio piece over the course of the workshop.

Applications for this workshop have now closed.

As religious services and communities have shifted online so too have scholars of religion. But at what cost? These sessions raise some of the epistemological and ethical issues of doing fieldwork in a digital environment from an inclusive anthropological perspective with a close-up on a particular case study in each session.

The first session considers conducting virtual ethnography, what is gained and what is lost, with a focus on ethnography with Orthodox Jewish populations; the second session assesses digital surveys of religious communities and their attitudes e.g. what the 'bean-counters' might miss (and strategies not to) and finally in the third session we problematize the ethical tensions in online studies of community media with a particular focus on French Muslim media, already heavily surveilled.

The sessions are intended to develop researcher knowledge and explore cross-cutting issues that concern a broad spectrum of humanities and social science-based scholarship serving as;

  • a forum for the critical discussion of digital methods and epistemologies,
  • a place to learn more about specific case studies particularly in the UK and France, and
  • an assembly of early research minds in the throes of a related or relevant project themselves who wish to share and learn from one another

Applications for this workshop have now closed.

Corpus linguistic approach to language is based on collections of electronic texts. It uses software to search and quantify various linguistic phenomena that make up patterns, which it then compares within and across texts based on their frequency. Corpus stylistics applies tools and methods from corpus linguistics to stylistic research. Corpus stylistics mainly focuses on literary texts, individual or corpora. Corpora are here, usually, principled collections of texts, for example a collection of texts by one author, or texts from a specific period. It focuses both on more general patterns and meanings that are observable across corpora and patterns and meanings in one individual text. In terms of quantitative approaches that corpus stylistics employs, it is in many ways similar to work that is referred to as ‘distant reading’ and also ‘cultural analytics’. These approaches emphasise the gains that we get from looking at texts from “distance”, i.e., in large quantities. For corpus stylistics, it is the relationship between quantitative and qualitative that is central. Therefore, research in corpus stylistics often deals with much smaller “cleaner” data sets, so that the qualitative step in the analysis is more manageable.

This workshop aims to introduce the basic corpus linguistic techniques and methods for working with literary and other texts. It aims:

  • To provide an introduction to corpus linguistics in relation to digital humanities approaches;
  • To develop critical understanding of how data representativeness used in quantitative research may influence results;
  • To critically examine the relationship between quantitative and qualitative textual analyses;
  • To provide a practical toolkit for computational textual analysis.

The aim of this course is to support students, researchers, and professionals interested in exploring the changing nature of the English vocabulary in historical texts at scale, and to reflect critically on the limitations of these computational analyses. We will focus on computational methods for representing word meaning and word meaning change from large-scale historical text corpora. The corpus used will consist of Darwin’s letters from the (Darwin Project https://www.darwinproject.ac.uk/) at Cambridge University Library. All code will be in online Python notebooks.

If you are interested in attending this course, please fill in the application form

[Back to top]