Data Science with R: Decision Trees and Random Forests/

...

Bagging

Learn how bagging is the first technique used by the random forest algorithm to produce valuable ensembles.

We'll cover the following...

Randomizing observations
Random sampling with replacement
Bagging in action
Out-of-bag (OOB) data

Randomizing observations

To build a valuable machine learning ensemble, the models within the ensemble should produce predictions with low correlation. In other words, the ensemble models should be different from each other. The random forest algorithm takes advantage of the high variance of the CART algorithm to manufacture diversity across the ensemble models.

The random forest algorithm uses bagging (i.e., bootstrap aggregation) to randomize the observations used to train the individual CART decision trees. CART decision trees trained on randomized observations exhibit a diversity of predictions.

Bagging uses random sampling with replacement to ...

Ask

Welcome to the Course

Supervised Learning

Classification Tree Math

Using Classification Trees in R

Introducing the Bias-Variance Tradeoff

Model Tuning

Model Tuning with tidymodels

Feature Engineering

Regression Trees

The Random Forest Algorithm

Using Random Forests

Gradient Boosting Trees

Continuing Your Journey

Credit Card Fraud Detection using the R Language

Bagging

Randomizing observations