This two-day course will provide an introduction to the Python programming language and how it can be used to do essential data wrangling, manipulation and cleaning tasks with real-world examples from the biomedical field. Using scalable methods and popular libraries for data manipulation, basic statistical analysis and visualization, this course will provide all the necessary tools and background for getting started with Python for data work.
By the end of the workshop, participants will be able to:
-Load and explore data sets in Python
-Join, reconcile and otherwise clean up messy data sets
-Do basic statistical analyses
-Render exploratory visualizations
Investigators at all career stages are welcome to attend, and we particularly encourage trainees and early-stage investigators to participate. Please note this training is a live-stream, virtual training. It is not a self-paced, pre-recorded online training.
PREREQUISITES AND REQUIREMENTS
No prior programming experience is required to participate in this workshop. However, participants must have (or create) an unrestricted Google account for working with sample notebooks (via Google Colab) and data sets. Likewise, participants will be expected to complete a brief survey and watch up to 3 hours of pre-recorded introductory material before the start of the real-time workshop activities.
Training Director: Susan McGregor, PhD, Associate Research Scholar, Columbia University Data Science Institute (DSI).
- Scholarships are available: https://www.publichealth.columbia.edu/research/precision-prevention/professional-development-scholarships
- Subscribe for updates: http://eepurl.com/hnpdGT
- Email our team: ColumbiaSHARP.Python@gmail.com
Capacity is limited. Paid Registration is required to attend.
Python Data Wrangling Boot camp
ColumbiaSHARP [dot] Python [at] gmail [dot] com