Decision tree is a non-parametric supervised learning method used for classification and regression. Goal is to create a model that predicts the value of a target variable by learning simple decision rule inferred from data feature.

Motivation: Interpretability

We want to find best suit for data sets with “region” properties

Contrast decision tree and linear classifier

Screen Shot 2022-05-05 at 11.10.43 AM.png

                           left - linear classifier                                    right- decision tree

Screen Shot 2022-05-05 at 11.12.04 AM.png

Decision Tree Setup

Goal: Train/induce/learn a function h that maps instance to a label

In the model, 3 things are learned:

How to create decision tree

Procedure