Introduction

A binary tree is a linked data structure where each node points to two child nodes (at most). The child nodes are called the left child and right child.

Binary tree is a hierarchical data structure. Here are the properties of a binary tree.

Each node can point to two children at most.
The top most element in the tree is called root.
Two Children are usually referred as Left Child and Right Child.
The nodes which don’t point to any children are called leaf nodes.
Non-leaf nodes are called internal nodes. Root is also an internal node if it’s not a leaf node. Let’s step through a few visualizations to internalize these properties.

Now let's look at a couple more properties of the nodes and tree that are very useful for the discussion on trees.

The Depth of the node is the number of nodes on the path from the root to the node. Root has depth 0.

The Height of the node is the number of nodes on the path from the node to the deepest leaf. All leaf nodes have height 0.

You can see that for depth, we begin counting from the root, down towards to the node, starting at 0. For height, we start counting from the node, up towards the root, again starting at 0. Some people start counting depth and height from 1 instead of 0 and it's just a matter of preference.

The Height of the tree is the maximum height of any node in the tree.

Let's step through a visualization to look at the height and depth properties.

In a binary tree, the node might contain a lot of data with a key. Our examples are a little simplistic as we are only working with a single integer which acts both as a key and value. However, in many practical situations, a node contains several fields of data with a key. In our examples below, we will continue to use a single value as data and key, but it's a good thing to remember that, in practice, a node might contain more data.

A subtree of the node is the tree formed by its left and right children. Each node in a binary tree has a left subtree and a right subtree.

Let's step through the following visualization to understand the subtrees.

The most common binary tree is called a Binary Search Tree. Why are we discussing a specialized form of Binary Tree without even getting to the code? Because the structure and layout of data in a Binary Tree would make more sense to you if we discuss insertion/deletion/lookup in a Binary Search Tree.

So what's a Binary Search Tree (BST hereafter)? The name already gives a decent hint. It's a binary tree that helps in searching the data in the tree. We'll shortly look at how a BST facilitates efficient lookups. To understand this, first let's understand the properties of a BST.

A binary tree is a BST if the key of the node is greater than all the nodes in its left subtree and is smaller than all the nodes in its right subtree. Let's look at a binary search tree.

Now that we know what a BST looks like, let's see how can it help with lookups. It's pretty simple actually. Suppose you are looking for key X. Here's what you do at each step starting at the root.

Compare current node's key with X. If it's equal, we've found the key. All done.
If X is less than node's key, we start looking at node's left subtree. It's because we know that right subtree cannot contain anything greater than X.
If X is greater than node's key, we start looking in the right subtree.
We repeat this process until we find the key or we reach the leaf node. If we reach the leaf node and haven't found the key as yet, we return not found.

Let's look at search in a BST in action.

Notice that tree has six elements but we only needed to do three comparisons before we found our desired node. If you look closely, you can see that at each step, we eliminate almost half of the tree.

One thing to remember here is that this is the best case for a BST. There are many cases where your BST might not give you an optimal performance, but they are beyond the scope of this introduction.

We have seen a lot of theory regarding Binary trees and Binary Search trees. Let's look at some code.

In a BST, we need to find the correct location for the Node to be inserted. It depends on the value of the key. Assume, we want to insert a node with key K. Starting at the root, here's a quick description of how we are going to locate the correct location.

Compare current node's key with K.
If K is less than the current node,

If left child of current node is Null, we insert K as the left child of current node and return.
If the left child is not Null, the left child becomes the new current node, and we repeat the process from step 1.

If K is greater than the current node,

If right child of current node is Null, we insert K as the right child of the current node and return.
If the right child is not Null, the right child becomes the new current node, and we repeat the process from step 1.

Using the steps described above, we insert K at the correct location so that BST is quickly searchable later on. Let's step through a quick visualization to see insertion in action.

Javascript (babel-node)

BST.prototype.insert = function(data) {
  var node = new Node(data);
  
  // If it's the first node
  if (this._root === null) {
    this._root = node;
    return;
  }
  
  var current = this._root;
  
  while (current) {
    if (data < current.data) {
      if (current.left === null) {
        current.left = node;
        return;
      }
      current = current.left;
    } else if (data > current.data) {
      if (current.right === null) {
        current.right = node;
        return;
      }
      current = current.right;
    } else {
      // Duplicates are not supported
      return;
    }
  }
};

Javascript (babel-node)

// Returns Keys in the Post Order traversal
BST.prototype.postOrder = function() {
  var output = [];
  
  function postOrderImpl(node) {
    if (node === null) {
      return;
    }
    
    // Visit left subtree
    postOrderImpl(node.left);
      
    // Visit the right subtree
    postOrderImpl(node.right);
    
    // Visit the node itself.
    output.push(node.data);
  }
  
  // Call the internal function
  // with Root as the starting point.
  postOrderImpl(this._root);
  
  return output;
}

Why do we need to understand In-Order predecessor and In-Order successor?

You might have noticed that we haven't discussed deletion in a BST (we only discussed insertion and search). The reason is that deletion is a little tricky in a BST and understanding In-Order predecessor and successor, helps in implementing deletion from a BST.

For a given node X,

Node X's predecessor is the node that comes just before X in InOrder traversal. Also remember that In-Order traversal visits the nodes in a sorted order. Hence, X's predecessor is the node with the largest key smaller than the key of X.

Similarly,

Node X's successor is the node that comes right after X in tree's InOrder traversal. As In-Order traversal visits the nodes in a sorted order, it means that X's successor is the node with the smallest key larger than the key of X.

Time for some examples.

Now that we've understood a lot of terminology about BST's and traversals, let's dive into deleting a Node in a BST. It's a little complicated as we need to ensure that the removal of the node doesn't break the invariant of the BST (left children smaller than the current key and right children larger than the current key).

When deleting a node in BST, there are three cases.

It's a leaf node and has no children.
The node has one child (either left or right).
The node has both left and right children.

First two cases are easier to handle than the third one. Let's look at the easy ones first.

When we have to delete a node with two children, we have two options. Let's say that the node to be deleted has key X.

Replace the current node's key with its predecessor and then trigger delete for predecessor in node's left subtree.
Replace the current node's key with its successor and then trigger delete for successor in node's right subtree.

Why would it work?

Remember predecessor is the highest value smaller than the current key. Now we know that this node has a left subtree. Hence, the predecessor to the current node is somewhere in the left subtree. In fact predecessor is the highest key in the node's left subtree. In addition, as it's the highest node in left subtree, it cannot have a right child. Otherwise that right child would have been the predecessor which ensures that we are now trying to delete a node with zero or one child. We already know that deleting a node with zero or one child is simpler and hence we've reduced the problem from a node with two children to deletion of node with zero or one child.

Similarly, successor is the smallest value larger than the current key. We already know that the node has a right subtree. Hence, the successor to the current node is somewhere in the right subtree. In fact, successor is the smallest key in node's right subtree. Hence, it's also a node with zero or one child.

Let's look at a quick visualization

We'll implement a remove function that would take a key and delete the key ensuring that BST is left in the correct state after removal of the node. However, before we implement remove function, we need to decide what to do when the node to be deleted has two children. We have the option of either going the route of replacing it with predecessor or with its successor.

Let's pick predecessor. We know that the predecessor in the left subtree of the node will be the highest value of this subtree. Hence, we'll implement a quick maximum function.

Javascript (babel-node)

BST.prototype.remove = function(key) {
  this.removeImpl(key, this._root);
}
BST.prototype.removeImpl = function removeImpl(key, node) {
  if (node != null) {
    if (key < node.data) {
      // Key might be in the left subtree.
      node.left = this.removeImpl(key, node.left);
    } else if (key > node.data) {
      node.right = this.removeImpl(key, node.right);
    } else {
      // Node found. 
      // Let's see if it has two children.
      if (node.left && node.right) {
        // Replace current node with 
        // predecessor data
        node.data = this.maximum(node.left);
        node.left = this.removeImpl(node.data, node.left);
      } else {
        // Only 1 child.
        // Let's return the child that's valid.
        node = node.left || node.right;
      }
    }
  }
  return node;
}

1.Data Structures

Assessment

Binary Trees & Binary Search Trees

Introduction

Height and Depth of the Tree

Key of the node and Subtree

Binary Search Tree

Search in a Binary Search Tree

Defining the Tree Node

Defining Binary Search Tree

Inserting into BST

Implementing Search in a BST

BST Insert and Search in action

Tree traversals

Pre-Order Traversal

In-Order Traversal

Post-Order Traversal

Traversals in action

In-Order Predecessor and In-Order Successor

Deleting a node in a BST

Deleting a Leaf Node

Deleting a node with 1 child

Deleting a node with two children

Implementing Node Removal in BST

Exercise

Summary