Step 2: Ingest/Clean/Validate
This is the most time consuming, frustrating, unstructured part of the analysis. It's also where the most can go wrong. Therefore, it is the most important and must be done right.
1. Ingest raw data into database
3. Clean / map
Never move to analysis before signing off on data quality.
22.214.171.124 Data Analysis - Summary statistics and confirmation