Python Data Wrangling Boot Camp: Introduction to Data Wrangling, Cleaning and Manipulation with the Python Programming Language

Python Data Wrangling Training NYC SHARP ProgramThe most recent Python Boot Camp was July 26-27, 2021.  Sign up below to hear about the next training!


The Python Boot Camp is a two-day intensive boot camp that combines seminars with hands-on sessions to introduce key Python concepts alongside practical exercises for wrangling and analyzing biomedical data.

Subscribe for updates on registration and scholarship dates, deadlines, and announcements.



Summer 2021 dates: Live-stream, online training July 26-27, 2021; 10am EDT - ~5pm EDT

Python is one of the world's most popular and versatile programming languages and can be used to do everything from creating data visualizations to developing sophisticated machine learning models. At the same time, Python was specifically designed to be easy to learn and use, making it an obvious choice for anyone looking for the next step in expanding their data-gathering and analytical skills.

This two-day course will provide an introduction to the Python programming language and how it can be used to do essential data wrangling, manipulation and cleaning tasks with real-world examples from the biomedical field. Using scalable methods and popular libraries for data manipulation, basic statistical analysis and visualization, this course will provide all the necessary tools and background for getting started with Python for data work. Using hosted notebooks, participants will leave the workshop with functioning code that they can apply to their own data sets.

By the end of the workshop, participants will be able to:

  • Load and explore data sets in Python

  • Join, reconcile and otherwise clean up messy data sets

  • Do basic statistical analyses

  • Render exploratory visualizations

Investigators at all career stages are welcome to attend, and we particularly encourage trainees and early-stage investigators to participate.


No prior programming experience is required to participate in this workshop. However, participants must have (or create) an unrestricted Google account for working with sample notebooks (via Google Colab) and data sets. Likewise, participants will be expected to complete a brief survey and watch up to 3 hours of pre-recorded introductory material before the start of the real-time workshop activities.


Training Director: Susan McGregor, PhD, Associate Research Scholar, Columbia University Data Science Institute (DSI). McGregor has been teaching Python for data analysis and wrangling to learners from diverse backgrounds for almost a decade. Her book, Practical Python Data Wrangling & Data Quality is due out from O'Reilly media in February, 2022 (early release currently available here:


Training scholarships are available for the Python Boot Camp.


COVID-19 Update: The Python Boot Camp will no longer take place in person due to the COVID-19 pandemic.  The Boot Camp will instead be a live-stream, remote training that takes place over live, online video.  Please note this training is not a self-paced, pre-recorded online training. 


"For a beginner the training was at a very good level to get started on my own. I feel I walked away with a lot of knowledge that can be applied to my own research and projects." - Kimberly R., Faculty member at Mercy College, 2021

"Susan clearly explained both high level concepts and specific syntax."Amanda N., Government staff member, City of Atlanta City Auditor's Office, 2021

"The python training was excellent! Even as an early beginner in coding the instructors were fantastic and made the information easy and accessible to understand." - Michael F., Postdoc, 2021

"Great foundation for Python and has inspired me to develop my skills in it! Directly applicable to my research. Susan was a wonderful instructor - knowledgeable, enthusiastic and helpful!" - Anonymous Postdoc, 2021



COVID-19 Update: With the training being offered virtually, we are passing along any and all costs saved to attendees.

  Early-Bird Rate (through 5/15/21) Regular Rate (5/16/21 - 7/19/21) Columbia Discount*
Student/Postdoc/Trainee      $875 $975  10%
Faculty/Academic Staff/Non-Profit Organizations/Government Agencies $975  $1,175 10%
Corporate/For-Profit Organizations $1,175 $1,375 NA

*Columbia Discount: This discount is valid for any active student, postdoc, staff, or faculty at Columbia University. To access Columbia discount, email for instructions and specify if you are paying by credit card, or internal transfer within Columbia.

Invoice Payment and Group Registrations: If you would prefer to pay by invoice/check or would like to pay for a group of registrants, please email with details.

Registration Fee: includes course material, which will be available to all students after the workshop.

Cancellations: Cancellation notices must be received via email at least 30 days prior to the training start date in order to receive a full refund, minus a $75 administrative fee. Cancellation notices received via email 14-29 days prior to the training will receive a 75% refund, minus a $75 administrative fee. Please email your cancellation notice to ColumbiaSHARP.Python@gmail.comDue to workshop capacity and preparation, we regret that we are unable to refund registration fees for cancellations <14 days prior to the training. 

If you are unable to attend the training, we encourage you to send a substitute within the same registration category. Please inform us of the substitute via email at least one week prior to the training to include them on attendee communications, updated registration forms, and materials. Should the substitute fall within a different registration category your credit card will be credited/charged respectively. Please email substitute inquiries to In the event Columbia must cancel the event, your registration fee will be fully refunded. 


Want updates on new Boot Camp details or registration deadlines? Subscribe here.

Questions? Email the Boot Camp team here.

The Python Boot Camp is hosted by the Columbia Mailman School of Public Health's SHARP Program.