Long Island Library Resources Council

Bridging the Long Island Library Community

Log in

MLA Webinar: Clean & Tidy Data: Getting Started with Spreadsheet Data

  • January 20, 2022
  • 2:00 PM - 3:30 PM
  • MLA Zoom
  • 24


  • Any staff member of a LILRC Institutional Member or a LILRC Retired Member
  • For staff and members of any other councils.

Registration is closed

Data analysis and manipulation usually begins with a spreadsheet. If you work with your own data or advise researchers on theirs, you’ll want to get started right. In this webinar, you’ll learn how to prepare your spreadsheet and format your data to serve later analyses. You’ll learn best practices for curating data, identifying and addressing common data problems, and preparing data for analysis and use. And you’ll learn when spreadsheets may not be the best place to start. Presenters will demonstrate best practices using medical data from a hypothetical pharmacokinetics study.

This webinar is a companion to Clean & Tidy Data: Making Data Usable. The courses stand alone and work together synergistically. Making Data Usable will show you how to identify the steps needed to clean and normalize data.

Special Note: This webinar is approved for the “under construction” Advanced Level of the Data Services Specialization. A Basic Level Data Services Specialization Certification is currently available.

Learning Outcomes

  • At the end of the webinar, participants will be able to:
  • Identify when spreadsheets are useful and when they are not
  • Assess when a task should not be done in spreadsheet software.
  • Identify the features of clean & tidy dataset
  • Identify common data problems


Medical librarians and other health information professionals who provide or plan to provide data services. Familiarity with spreadsheets is helpful.


Anne M. Brown is an Assistant Professor in Data Services, University Libraries at Virginia Tech and affiliate faculty member in the Department of Biochemistry and Academy of Integrated Science. She is the author or co-author of a number of publications and presentations on data-related and data literacy topics.

Daniel Chen is a graduate student in Genetics, Bioinformatics, and Computational Biology at Virginia Tech. His research is focused on data science education and pedagogy in the medical and biomedical sciences. He is the author of Pandas for Everyone: Python Data Analysis and a number of other data science learning materials.

Note: This registration is for the Livestream only and does not offer MLA contact hours. If you are a LILRC member health sciences/hospital librarian, please email Sally Stieglitz, at sstieglitz@lilrc.org, to arrange to view with a unique access code for MLA contact hours. MLA contact hours are not applicable to the MLA Consumer Health Information Specialization

This program is not being recorded.  

Code of Conduct

For questions, please email Eliscia Cirrone, ecirrone@lilrc.org.

Long Island Library Resources Council
627 N. Sunrise Service Road
Bellport NY, 11713
Phone: (631) 675-1570

Powered by Wild Apricot Membership Software