e-learning
Preparing genomic data for phylogeny reconstruction
Abstract
A robust and well-resolved phylogenetic classification is essential to understand genetic relationships within and between species and the evolution of their phenotypic diversity. In the last decade the genomic revolution has represented a drastic change in the amount of data used for phylogenetic inference. The single-gene approach using universal phylogenetic markers for the different lineages across the tree of life, is now being replaced by the assembly of taxon-rich and genome-scale data matrices, the so called phylogenomic approach.
About This Material
This is a Hands-on Tutorial from the GTN which is usable either for individual self-study, or as a teaching material in a classroom.
Questions this will address
- How do I find a set of common proteins (orthologs) across related species or strains?
- How do I organize a set of orthologs to infer evolutionary relations between species or strains (phylogenetic reconstruction)?
Learning Objectives
- Mask repetitive elements from a genome
- Annotate (predict protein-coding genes) the genomes of the samples to compare
- Find a set of common proteins across the samples (orthologs)
- Align orthologs across samples
Licence: Creative Commons Attribution 4.0 International
Keywords: Ecology, data handling, functional annotation, phylogeny
Target audience: Students
Resource type: e-learning
Version: 4
Status: Active
Prerequisites:
- An Introduction to Genome Assembly
- Introduction to Galaxy Analyses
Learning objectives:
- Mask repetitive elements from a genome
- Annotate (predict protein-coding genes) the genomes of the samples to compare
- Find a set of common proteins across the samples (orthologs)
- Align orthologs across samples
Date modified: 2024-06-27
Date published: 2022-10-21
Contributors: Brigida Gallone, Miguel Roncoroni
Scientific topics: Ecology
Activity log