Data Analysis for ML/AI
Data quality and cleaning
Duplicates, whitespace, casing, type coercion errors, validation rules.
5 lessons
Follow in order
Use the button below to sign in and unlock lessons.
Your path
Lessons in sequence
Work through these in order—each lesson builds on the previous one.
-
Lesson 1 of 5
Duplicates: detection and resolution
-
Lesson 2 of 5
Whitespace, casing, and string cleanup
-
Lesson 3 of 5
Validation rules at ingestion
-
Lesson 4 of 5
Standardizing units and formats
-
Lesson 5 of 5
Repeatable, version-controlled cleaning