Cambridge Centre for Research Informatics Training course timetable
June 2025
Mon 23 |
Generalised linear models are the kind of models we would use if we had to deal with non-continuous response variables. For example, this happens if you have count data or a binary outcome. This course aims to introduce generalised linear models, using the R software environment. Similar to Core statistics using R this course addresses the practical aspects of using these models, so you can explore real-life issues in the biological sciences. The Generalised linear models course builds heavily on the knowledge gained in the core statistics sessions, which means that the Core statistics using R course is a firm prerequisite for joining. There are several aims to this course: 1. Be able to distinguish between linear models and generalised linear models 2. Analyse binary outcome and count data using R 3. Critically assess model fit R is an open-source programming language so all of the software we will use in the course is free. We will be using the R Studio interface throughout the course. Most of the code will be focussed around the tidyverse and tidymodels packages, so a basic understanding of the tidyverse syntax is essential. If you do not have a University of Cambridge Raven account please book or register your interest here. If for any reason the above links do not work, please email Research Informatics Training Team with details of your course enquiry. Additional information
|
Tue 24 |
This workshop focuses on expression proteomics, which aims to characterise the protein diversity and abundance in a particular system. You will learn about the bioinformatic analysis steps involved when working with these kind of data, in particular several dedicated proteomics Bioconductor packages, part of the R programming language. We will use real-world datasets obtained from label free quantitation (LFQ) as well as tandem mass tag (TMT) mass spectrometry. We cover the basic data structures used to store and manipulate protein abundance data, how to do quality control and filtering of the data, as well as several visualisations. Finally, we include statistical analysis of differential abundance across sample groups (e.g. control vs. treated) and further evaluation and biological interpretation of the results via gene ontology analysis. By the end of this workshop you should have the skills to make sense of expression proteomics data, from start to finish.
If you do not have a University of Cambridge Raven account please book or register your interest here. If for any reason the above links do not work, please email Research Informatics Training Team with details of your course enquiry. Additional information
|
Wed 25 |
This workshop focuses on expression proteomics, which aims to characterise the protein diversity and abundance in a particular system. You will learn about the bioinformatic analysis steps involved when working with these kind of data, in particular several dedicated proteomics Bioconductor packages, part of the R programming language. We will use real-world datasets obtained from label free quantitation (LFQ) as well as tandem mass tag (TMT) mass spectrometry. We cover the basic data structures used to store and manipulate protein abundance data, how to do quality control and filtering of the data, as well as several visualisations. Finally, we include statistical analysis of differential abundance across sample groups (e.g. control vs. treated) and further evaluation and biological interpretation of the results via gene ontology analysis. By the end of this workshop you should have the skills to make sense of expression proteomics data, from start to finish.
If you do not have a University of Cambridge Raven account please book or register your interest here. If for any reason the above links do not work, please email Research Informatics Training Team with details of your course enquiry. Additional information
|
Knowing how to use High Performance Computing (HPC) systems is crucial for fields such as bioinformatics, big data analysis, image processing, machine learning, parallel task execution, and other high-throughput applications. In this introductory course, you will learn the fundamentals of HPC, including what it is and how to effectively utilise it. We will cover best practices for working with HPC systems, explain the roles of "login" and "compute" nodes, outline the typical filesystem organization on HPC clusters, and cover job scheduling with the widely-used SLURM scheduler. This hands-on workshop is designed to be accessible to researchers from various backgrounds, providing numerous opportunities to practice and apply the skills you acquire. As an optional session for those interested, we will also introduce the (free) HPC facilities available at Cambridge University (the course is not otherwise Cambridge-specific).
If you do not have a University of Cambridge Raven account please book or register your interest here. If for any reason the above links do not work, please email Research Informatics Training Team with details of your course enquiry. Additional information
|
|
Thu 26 |
Knowing how to use High Performance Computing (HPC) systems is crucial for fields such as bioinformatics, big data analysis, image processing, machine learning, parallel task execution, and other high-throughput applications. In this introductory course, you will learn the fundamentals of HPC, including what it is and how to effectively utilise it. We will cover best practices for working with HPC systems, explain the roles of "login" and "compute" nodes, outline the typical filesystem organization on HPC clusters, and cover job scheduling with the widely-used SLURM scheduler. This hands-on workshop is designed to be accessible to researchers from various backgrounds, providing numerous opportunities to practice and apply the skills you acquire. As an optional session for those interested, we will also introduce the (free) HPC facilities available at Cambridge University (the course is not otherwise Cambridge-specific).
If you do not have a University of Cambridge Raven account please book or register your interest here. If for any reason the above links do not work, please email Research Informatics Training Team with details of your course enquiry. Additional information
|
Introduction to R (IN-PERSON)
[Places]
R is one of the leading programming languages in Data Science. It is widely used to perform statistics, machine learning, visualisations and data analyses. It is an open source programming language so all the software we will use in the course is free. This course is an introduction to R designed for participants with no programming experience. We will start from scratch by introducing how to start programming in R and progress our way and learn how to read and write to files, manipulate data and visualise it by creating different plots - all the fundamental tasks you need to get you started analysing your data. During the course we will be working with one of the most popular packages in R; tidyverse that will allow you to manipulate your data effectively and visualise it to a publication level standard.
If you do not have a University of Cambridge Raven account please book or register your interest here. If for any reason the above links do not work, please email Research Informatics Training Team with details of your course enquiry. Additional information
|
|
Fri 27 |
Knowing how to use High Performance Computing (HPC) systems is crucial for fields such as bioinformatics, big data analysis, image processing, machine learning, parallel task execution, and other high-throughput applications. In this introductory course, you will learn the fundamentals of HPC, including what it is and how to effectively utilise it. We will cover best practices for working with HPC systems, explain the roles of "login" and "compute" nodes, outline the typical filesystem organization on HPC clusters, and cover job scheduling with the widely-used SLURM scheduler. This hands-on workshop is designed to be accessible to researchers from various backgrounds, providing numerous opportunities to practice and apply the skills you acquire. As an optional session for those interested, we will also introduce the (free) HPC facilities available at Cambridge University (the course is not otherwise Cambridge-specific).
If you do not have a University of Cambridge Raven account please book or register your interest here. If for any reason the above links do not work, please email Research Informatics Training Team with details of your course enquiry. Additional information
|
Introduction to R (IN-PERSON)
[Places]
R is one of the leading programming languages in Data Science. It is widely used to perform statistics, machine learning, visualisations and data analyses. It is an open source programming language so all the software we will use in the course is free. This course is an introduction to R designed for participants with no programming experience. We will start from scratch by introducing how to start programming in R and progress our way and learn how to read and write to files, manipulate data and visualise it by creating different plots - all the fundamental tasks you need to get you started analysing your data. During the course we will be working with one of the most popular packages in R; tidyverse that will allow you to manipulate your data effectively and visualise it to a publication level standard.
If you do not have a University of Cambridge Raven account please book or register your interest here. If for any reason the above links do not work, please email Research Informatics Training Team with details of your course enquiry. Additional information
|
|
Mon 30 |
This comprehensive course equips you with essential skills and knowledge in bacterial genomics analysis, primarily using Illumina-sequenced samples. You'll gain an understanding of how to select the most appropriate analysis workflow, tailored to the genome diversity of a given bacterial species. Through hands-on training, you'll apply both de novo assembly and reference-based mapping approaches to obtain bacterial genomes for your isolates. You will apply standardised workflows for genome assembly and annotation, including quality assessment criteria to ensure the reliability of your results. Along with typing bacteria using methods such as MLST, you'll learn how to construct phylogenetic trees using whole genome and core genome alignments, enabling you to explore the evolutionary relationships among bacterial isolates. You’ll extend this to estimate a time-scaled phylogeny using a starting phylogenetic tree. Lastly, you'll apply methods to detect antimicrobial resistance genes. As examples we will use Mycobacterium tuberculosis, Staphylococcus aureus and Streptococcus pneumoniae, allowing you to become well-equipped to conduct bacterial genomics analyses on a range of species.
If you do not have a University of Cambridge Raven account please book or register your interest here. If for any reason the above links do not work, please email Research Informatics Training Team with details of your course enquiry. Additional information
|
July 2025
Tue 1 |
This comprehensive course equips you with essential skills and knowledge in bacterial genomics analysis, primarily using Illumina-sequenced samples. You'll gain an understanding of how to select the most appropriate analysis workflow, tailored to the genome diversity of a given bacterial species. Through hands-on training, you'll apply both de novo assembly and reference-based mapping approaches to obtain bacterial genomes for your isolates. You will apply standardised workflows for genome assembly and annotation, including quality assessment criteria to ensure the reliability of your results. Along with typing bacteria using methods such as MLST, you'll learn how to construct phylogenetic trees using whole genome and core genome alignments, enabling you to explore the evolutionary relationships among bacterial isolates. You’ll extend this to estimate a time-scaled phylogeny using a starting phylogenetic tree. Lastly, you'll apply methods to detect antimicrobial resistance genes. As examples we will use Mycobacterium tuberculosis, Staphylococcus aureus and Streptococcus pneumoniae, allowing you to become well-equipped to conduct bacterial genomics analyses on a range of species.
If you do not have a University of Cambridge Raven account please book or register your interest here. If for any reason the above links do not work, please email Research Informatics Training Team with details of your course enquiry. Additional information
|
Wed 2 |
This comprehensive course equips you with essential skills and knowledge in bacterial genomics analysis, primarily using Illumina-sequenced samples. You'll gain an understanding of how to select the most appropriate analysis workflow, tailored to the genome diversity of a given bacterial species. Through hands-on training, you'll apply both de novo assembly and reference-based mapping approaches to obtain bacterial genomes for your isolates. You will apply standardised workflows for genome assembly and annotation, including quality assessment criteria to ensure the reliability of your results. Along with typing bacteria using methods such as MLST, you'll learn how to construct phylogenetic trees using whole genome and core genome alignments, enabling you to explore the evolutionary relationships among bacterial isolates. You’ll extend this to estimate a time-scaled phylogeny using a starting phylogenetic tree. Lastly, you'll apply methods to detect antimicrobial resistance genes. As examples we will use Mycobacterium tuberculosis, Staphylococcus aureus and Streptococcus pneumoniae, allowing you to become well-equipped to conduct bacterial genomics analyses on a range of species.
If you do not have a University of Cambridge Raven account please book or register your interest here. If for any reason the above links do not work, please email Research Informatics Training Team with details of your course enquiry. Additional information
|
Thu 3 |
This comprehensive course equips you with essential skills and knowledge in bacterial genomics analysis, primarily using Illumina-sequenced samples. You'll gain an understanding of how to select the most appropriate analysis workflow, tailored to the genome diversity of a given bacterial species. Through hands-on training, you'll apply both de novo assembly and reference-based mapping approaches to obtain bacterial genomes for your isolates. You will apply standardised workflows for genome assembly and annotation, including quality assessment criteria to ensure the reliability of your results. Along with typing bacteria using methods such as MLST, you'll learn how to construct phylogenetic trees using whole genome and core genome alignments, enabling you to explore the evolutionary relationships among bacterial isolates. You’ll extend this to estimate a time-scaled phylogeny using a starting phylogenetic tree. Lastly, you'll apply methods to detect antimicrobial resistance genes. As examples we will use Mycobacterium tuberculosis, Staphylococcus aureus and Streptococcus pneumoniae, allowing you to become well-equipped to conduct bacterial genomics analyses on a range of species.
If you do not have a University of Cambridge Raven account please book or register your interest here. If for any reason the above links do not work, please email Research Informatics Training Team with details of your course enquiry. Additional information
|
Fri 4 |
As the use of AI chatbots continues to rise, it is crucial to understand what they are, how they work, and how to make the most of them. Prompting is the method of interacting with AI, and as AI chatbots become more openly and widely available, a good and effective prompt could make all the difference. In this course, we will provide background on the history of AI chatbots as well as an understanding of how they work. We will provide hands-on use cases of how to prompt like a bioinformatician/software engineer as well as providing strategies and tactics of prompting to unleash the full potential of AI chatbots in biological data analysis. This course is aimed at researchers with no computational background.
If you do not have a University of Cambridge Raven account please book or register your interest here. If for any reason the above links do not work, please email Research Informatics Training Team with details of your course enquiry. Additional information
|
Thu 10 |
This course provides a practical introduction to the writing of Python programs for the complete novice. Participants are lead through the core concepts of Python including Python syntax, data structures and reading/writing files. These are illustrated by a series of example programs. Upon completion of the course, participants will be able to write simple Python programs.
If you do not have a University of Cambridge Raven account please book or register your interest here. If for any reason the above links do not work, please email Research Informatics Training Team with details of your course enquiry. Additional information
|
Recent technological advances have made it possible to obtain genome-wide transcriptome data from single cells using high-throughput sequencing. This course offers an introduction to single-cell RNA sequencing (scRNA-seq) analysis. Participants will gain hands-on experience with key software packages and methodologies for processing, analyzing, and interpreting scRNA-seq data. Key topics include data preprocessing, quality control, normalization, dimensionality reduction, batch correction and data integration, cell clustering and differential expression and abundance analysis. By the end of the course, students will be equipped with the skills to independently conduct and critically analyse data from scRNA-seq experiments.
If you do not have a University of Cambridge Raven account please book or register your interest here. If for any reason the above links do not work, please email Research Informatics Training Team with details of your course enquiry. Additional information
|
|
Fri 11 |
This course provides a practical introduction to the writing of Python programs for the complete novice. Participants are lead through the core concepts of Python including Python syntax, data structures and reading/writing files. These are illustrated by a series of example programs. Upon completion of the course, participants will be able to write simple Python programs.
If you do not have a University of Cambridge Raven account please book or register your interest here. If for any reason the above links do not work, please email Research Informatics Training Team with details of your course enquiry. Additional information
|
Recent technological advances have made it possible to obtain genome-wide transcriptome data from single cells using high-throughput sequencing. This course offers an introduction to single-cell RNA sequencing (scRNA-seq) analysis. Participants will gain hands-on experience with key software packages and methodologies for processing, analyzing, and interpreting scRNA-seq data. Key topics include data preprocessing, quality control, normalization, dimensionality reduction, batch correction and data integration, cell clustering and differential expression and abundance analysis. By the end of the course, students will be equipped with the skills to independently conduct and critically analyse data from scRNA-seq experiments.
If you do not have a University of Cambridge Raven account please book or register your interest here. If for any reason the above links do not work, please email Research Informatics Training Team with details of your course enquiry. Additional information
|
|
Mon 14 |
Recent technological advances have made it possible to obtain genome-wide transcriptome data from single cells using high-throughput sequencing. This course offers an introduction to single-cell RNA sequencing (scRNA-seq) analysis. Participants will gain hands-on experience with key software packages and methodologies for processing, analyzing, and interpreting scRNA-seq data. Key topics include data preprocessing, quality control, normalization, dimensionality reduction, batch correction and data integration, cell clustering and differential expression and abundance analysis. By the end of the course, students will be equipped with the skills to independently conduct and critically analyse data from scRNA-seq experiments.
If you do not have a University of Cambridge Raven account please book or register your interest here. If for any reason the above links do not work, please email Research Informatics Training Team with details of your course enquiry. Additional information
|
Tue 15 |
This course gives an introduction to linear mixed effects models, also called multi-level models or hierarchical models, for the purposes of using them in your own research or studies. We emphasise the practical skills and key concepts needed to work with these models, using applied examples and real datasets. After completing the course, you should have:
Please note that this course builds on knowledge of linear modelling, therefore should not be considered a general introduction to statistical modelling.
If you do not have a University of Cambridge Raven account please book or register your interest here. If for any reason the above links do not work, please email Research Informatics Training Team with details of your course enquiry. Additional information
|
Wed 16 |
This course gives an introduction to linear mixed effects models, also called multi-level models or hierarchical models, for the purposes of using them in your own research or studies. We emphasise the practical skills and key concepts needed to work with these models, using applied examples and real datasets. After completing the course, you should have:
Please note that this course builds on knowledge of linear modelling, therefore should not be considered a general introduction to statistical modelling.
If you do not have a University of Cambridge Raven account please book or register your interest here. If for any reason the above links do not work, please email Research Informatics Training Team with details of your course enquiry. Additional information
|