Data exploration with Python 2023
This workshop is ideal for researchers and technical workers with a background in biology and a basic knowledge of Python, to work with large, complex datasets, mine them for biological insights, and create visualizations to display the results.
Date: 20 November - 1 December 2023
Timezone: London
Much of the popularity of Python stems from the availability of high quality libraries of existing code that we can use for our own projects. Libraries ("packages", in Python terminology) are even more useful when they are designed to work together.
For scientific programming, we are lucky to have a collection of mature packages which work together to form a stack:
numpy for numerical processing
pandas for reading, cleaning and processing tabular data files
matplotlib as a low-level charting library
seaborn as a high-level charting library for rapid dataset exploration through visualization
In this course, we will learn how to use these packages together to quickly explore large biological datasets, find meaningful patterns in the data, and present our results clearly.
We will focus on the high-level packages - pandas and seaborn - as this will allow us to do the most work with the smallest amount of code. By concentrating on just two packages for an entire course, we will be able to cover a large part of what these tools can do.
This event will be delivered virtually via Zoom & Slack, see more details on the further information tab.
Please see here for a detailed syllabus of the course.
Contact: [email protected]
Keywords: Python
Organizer: Earlham Institute
Host institutions: Earlham Institute
Activity log