Date: 16 - 19 July 2018

Loading map...

This workshop will focus on the core steps involved in calling variants with the Broad’s Genome Analysis Toolkit, using the “Best Practices” developed by the GATK team. You will learn why each step is essential to the variant discovery process, what are the operations performed on the data at each step, and how to use the GATK tools to get the most accurate and reliable results out of your dataset.

In the course of this workshop, we highlight key functionalities such as the germline GVCF workflow for joint variant discovery in cohorts, somatic variant discovery using MuTect2, and copy number variation discovery using GATK-CNV. All analyses are demonstrated using GATK version 4. Finally, we demonstrate the use of pipelining tools to assemble and execute GATK workflows.

The workshop covers basic genomics, all currently supported Best Practices pipelines as well as pipelining with WDL/Cromwell/FireCloud. This includes the logic of the major pipelines, file formats and data transformations involved, and hands-on operation of the tools using goal-oriented exercises.

*Day 1: Introduction to Genomics, GATK Best Practices and Pipelining

*Day 2: Germline short variant discovery (SNPs + Indels)

*Day 3: Somatic variant discovery (SNVs + Indels + CNVs)

*Day 4: Writing pipelines with WDL and running them in FireCloud

Please note that this workshop is focused on human data analysis. The majority of the materials presented does apply equally to non-human data, and we will address some questions regarding adaptations that are needed for analysis of non-human data, but we will not go into much detail on those points.

Please note that if you are not eligible for a University of Cambridge Raven account you will need to Book or register Interest by linking here.''

Keywords: HDRUK

Venue: Craik-Marshall Building

City: Cambridge

Country: United Kingdom

Postcode: CB2 3AR

Organizer: University of Cambridge

Host institutions: University of Cambridge Bioinformatics Training

Target audience: The course is aimed primarily at mid-career scientists – especially those whose formal education likely included statistics, but who have not perhaps put this into practice since., Graduate students, Postdocs and Staff members from the University of Cambridge, Institutions and other external Institutions or individuals

Event types:

  • Workshops and courses

Scientific topics: Bioinformatics, Data mining, Data visualisation, Genomics


Activity log