AI Concept Takeaway: Decision Tree

My takeaway series follow a Q&A format to explain AI concepts at three levels:

Conceptual Level

Anyone with general knowledge can understand them.

Implementation Level

For anyone who wants to dive into the code implementation details of the concept.

Mathematical Level

For anyone who wants to understand the mathematics behind the technique.

What is decision tree?

The decision tree is a supervised machine learning algorithm that can be used for both classification and regression tasks. It also refers to the tree-based model built by the algorithm.

What are the input and output of decision tree model?

Decision tree is simply a supervised learning model that maps input features to output labels.

Inputs:

A set of input features (independent variables) $X = {x_{1}, x_{2}, . . ., x_{n}}$ , where each feature can be categorical or numerical.

Outputs:

A target variable (dependent variable) $Y$ , which can be categorical (for classification tasks) or continuous (for regression tasks).

What is the structure of decision tree model?

An example of decision tree. (Source: DisplayR)

A decision tree model is structured as a if-else tree, where each internal node represents a decision based on the value of a specific feature, each branch represents the outcome of that decision, and each leaf node represents a final prediction or output.

What are the model parameters of decision tree model?

The tree structure itself is the model parameter. In each node, the threshold value of the feature used for splitting is also a model parameter. They are learned from the training data during the training process.

What are the main algorithms to train a decision tree?

There are several algorithms to train a decision tree, including:

ID3 (Iterative Dichotomiser 3)
C4.5
CART (Classification and Regression Trees)