3) Exploratory data analysis
The third step, EDAExploratory Data Analysis, provides an opportunity to become familiar with your data including distribution and the state of missing values. Exploratory data analysis also drives the next stage of data scrubbing and your choice of algorithm.
In addition, EDAExploratory Data Analysis may come into play in other sections of your code as you check the size and structure of your dataset and integrate that feedback to direct model optimization.
4) Data scrubbing
The data scrubbing stage usually consumes the ...