Guides ยท Technology
Data Cleaning Checklist Basics
Prep datasets before analysis
This guide provides a stepwise data cleaning checklist: schema validation, missing data handling, outlier review, deduplication, and documenting transformations.
- data cleaning
- datasets
- analytics
- quality
- etl
Validate schema
Confirm column types, ranges, and required fields; fail fast on violations.
Handle missing data
Quantify missingness, decide on drop, impute, or flag; record rationale.
Review outliers and duplicates
Use simple profiling to spot outliers; deduplicate using keys and fuzzy checks if needed.
Document steps
Log every transformation, assumptions, and QA checks to keep analyses reproducible.
Keep Exploring
Guides
API Basics
APIs let software request data or actions from other systems through defined endpoints and responses.
Comparison
RAM vs Storage
RAM handles what your device is actively working on, while storage keeps apps, files, and system data available over time.
How it works
Global Positioning System
GPS uses timing signals from satellites to calculate a receiver's position on Earth.
What it is
Blockchain
A blockchain is a distributed ledger secured by cryptography and consensus nodes.