Chapter 6
Cleaning Data with Spreadsheets
Learning Objectives
- Understand common data cleaning techniques used to remove data
- Identify different use cases for cleaning a data set
- Identify corrupted data and handle it appropriately
- Recognize the four types of “dirty” data: Missing, Irregular, Unnecessary, and Inconsistent
Key Terminology
Data Cleaning Techniques
- Filtering
- Sorting
- Redundant data
- Trailing Whitespace
- functions
- REGEX
Types of Dirty Data
- Missing data
- Irregular data
- Unnecessary data
- Inconsistent data