Available Until 5/18/2024

Master Data Cleaning and Wrangling with OpenRefine and Python - CE200

Register Now!

This course will be held at MLA '24 in Portland, OR.

Saturday, May 18, 1:00 p.m.–5:00 p.m.

Cost: $295 (nonmember: $380)
Attendance maximum: 25

If you begin this course with no understanding of OpenRefine or Python or if you know some Python, in just four hours you’ll leave with skills in cleaning, analyzing, and displaying data and supporting researchers in working with their data and writing DMSPs that will enable you to expand your career options and increase your value in your workplace.

Paige Scudder, an experienced Python coder and instructor for a very successful MLA virtual course on Python and data visualization will be your guide to using two powerful, open source, and user-friendly tools, OpenRefine and Python, to unlock the full potential of a dataset. You’ll learn to use OpenRefine to clean and transform messy data and to seamlessly integrate your clean data with Python to perform statistical analyses and create visualizations.

You’ll leave the course with a Google colab notebook of the coding skills you’ll learn in the course, skills in harnessing the power of Open Refine and Python to improve your data handling and analysis capabilities, a grasp of data handling lingo that allows you to show researchers how to get the most out of their data, and a broader career horizon.

Important Note: You’ll need to bring a laptop computer with WiFi capability to participate in the course.

 

This course is an approved elective for Level II of the Data Services Specialization.

Learning Outcomes

By the end of this course, you will be able to:

  • Identify the components of a clean and tidy dataset
  • Use OpenRefine to create a clean and tidy dataset and import into Python
  • Use Python variables, functions, and other coding fundamentals
  • Use Python to analyze data
  • Use Python to create data visualizations

 

Audience

Medical librarians and other health information professionals who want to learn how to use OpenRefine to clean their data and Python programming for analysis and data visualizations. No previous Python coding experience required, and if you have some, there will still be much to learn!

Instructor

Paige Scudder is a Research and Instruction Librarian for the Tufts Hirsh Health Sciences Library focusing on data management and educational technologies. Paige works with Software Carpentry and has developed an introductory Python series designed to meet beginners where they are and help them develop confidence in their coding skills over the course of a few lunch breaks.

MLA CE Credits: 4