Python Data Wrangling Boot Camp

Introduction to Data Wrangling, Cleaning and Manipulation with the Python Programming Language
10:00 am
5:00 pm
Add to Calendar:
Virtual, Live-stream
Susan McGregor, PhD
Susan McGregor, PhD, Associate Research Scholar, Data Science Institute (DSI).
Department of Environmental Health Sciences
SHARP Training Program
Open to the Public

This two-day course will provide an introduction to the Python programming language and how it can be used to do essential data wrangling, manipulation and cleaning tasks with real-world examples from the biomedical field. Using scalable methods and popular libraries for data manipulation, basic statistical analysis and visualization, this course will provide all the necessary tools and background for getting started with Python for data work.

By the end of the workshop, participants will be able to:

-Load and explore data sets in Python
-Join, reconcile and otherwise clean up messy data sets
-Do basic statistical analyses
-Render exploratory visualizations

Investigators at all career stages are welcome to attend, and we particularly encourage trainees and early-stage investigators to participate. Please note this training is a live-stream, virtual training. It is not a self-paced, pre-recorded online training.


No prior programming experience is required to participate in this workshop. However, participants must have (or create) an unrestricted Google account for working with sample notebooks (via Google Colab) and data sets. Likewise, participants will be expected to complete a brief survey and watch up to 3 hours of pre-recorded introductory material before the start of the real-time workshop activities.


Training Director: Susan McGregor, PhD, Associate Research Scholar, Columbia University Data Science Institute (DSI).


- Scholarships are available:
- Subscribe for updates:
- Email our team:

Capacity is limited. Paid Registration is required to attend.


Python Data Wrangling Boot camp