The Math Behind Decision Trees: An Exploration of Decision Trees

Decision trees are an effective supervised learning technique for predicting the class or value of a target variable. Unlike other supervised learning methods, they're well-suited to classification and regression tasks. Use this course to learn how to work with decision trees and classification, distinguishing between rule-based and ML-based approaches. As you progress through the course, investigate how to work with entropy, Gini impurity, and information gain. Practice implementing both rule-based and ML-based decision trees and leveraging powerful Python visualization libraries to construct intuitive graphical representations of decision trees. Upon completion, you'll be able to create, use, and share rule-based and ML-based decision trees.


  • Discover the key concepts covered in this course
    Define what's meant by classification, describing classification rules and rule-based classifier properties and limitations
    Contrast rule-based and ml-based classifiers
    Outline the structure of a decision tree, the process it uses to "decide," its advantages, and some core considerations when building one
    Work through the creation of a decision tree and list some decision tree algorithms
    Define what's meant by entropy and outline how it's used in relation to decision trees, referencing the id3 algorithm and information gain
    Summarize how information gain and entropy are used in tandem
    Define gini impurity and calculate it for a dataset
    Split decision trees based on gini impurity
  • Import modules and set up data
    Decide splits for a rule-based decision tree
    Define a rule-based decision tree
    Illustrate the use of decision trees for continuous values
    Visualize a decision tree
    Create a rule-based decision tree
    Train an ml-based decision tree
    Use a trained ml-based decision tree to make decisions
    Summarize the key concepts covered in this course


