What are Bayesian networks?

“A Bayesian network signifies the causal probabilistic connection among a set of random variables, their conditional dependencies, and it provides a compact representation of a joint probability distribution.” (Murphy, 1998).

Implementation

Here’s the practical implementation of Bayesian networks.

First, let’s look at how to initialize a Bayesian network by quickly implementing the Monty Hall Problem. The Monty Hall problem arose from the game show Let’s Make a Deal, where a guest had to pick which one of three doors had a reward behind it. The twist was that after the guest chose, the host (originally Monty Hall) would then open one of the doors the guest did not pick and ask if the guest wanted to switch which door they had chosen.

To create the Bayesian network in pomegranate, we first design the distributions that live in each node in the graph. For a discrete Bayesian network, we use Discrete Distribution objects for the root nodes and Conditional Probability Table objects for the inner and leaf nodes. The columns in a Conditional Probability Table correspond to the order in which the parents (the second argument) are specified. The last column is the value the Conditional Probability Table takes itself. In the case below, the first column corresponds to the value guest takes, then the value prize takes, and then the matter that monty takes. B, C, and A refer to the probability that Monty reveals door A, given that the guest has chosen door B, and that the prize is actually behind door C, or P(Monty=A|Guest=B, Prize=C).

from pomegranate import *
# The guests initial door selection is completely random
guest = DiscreteDistribution({'A': 1./3, 'B': 1./3, 'C': 1./3})
# The door the prize is behind is also completely random
prize = DiscreteDistribution({'A': 1./3, 'B': 1./3, 'C': 1./3})
# Monty is dependent on both the guest and the prize.
monty = ConditionalProbabilityTable([
    ['A', 'A', 'A', 0.0],['A', 'A', 'B', 0.5],
    ['A', 'A', 'C', 0.5],['A', 'B', 'A', 0.0],
    ['A', 'B', 'B', 0.0],['A', 'B', 'C', 1.0],
    ['A', 'C', 'A', 0.0],['A', 'C', 'B', 1.0],
    ['A', 'C', 'C', 0.0],['B', 'A', 'A', 0.0],
    ['B', 'A', 'B', 0.0],['B', 'A', 'C', 1.0],
    ['B', 'B', 'A', 0.5],['B', 'B', 'B', 0.0],
    ['B', 'B', 'C', 0.5],['C', 'B', 'A', 1.0],
    ['C', 'B', 'B', 0.0],['C', 'B', 'C', 0.0],
    ['C', 'C', 'A', 0.5],['C', 'C', 'B', 0.5],
    ['C', 'C', 'C', 0.0],['B', 'C', 'A', 1.0],
    ['B', 'C', 'B', 0.0],['B', 'C', 'C', 0.0],
    ['C', 'A', 'A', 0.0],['C', 'A', 'B', 1.0],
    ['C', 'A', 'C', 0.0],],[guest, prize])
# State objects hold both the distribution, and a high level name.
s1 = Node(guest, name="guest")
s2 = Node(prize, name="prize")
s3 = Node(monty, name="monty")
# Create the Bayesian network object with a useful name
model = BayesianNetwork("Monty Hall Problem")
# Add the three states to the network 
model.add_states(s1, s2, s3)
# Add edges which represent conditional dependencies, where the second node is 
# conditionally dependent on the first node (Monty is dependent on both guest and prize)
model.add_edge(s1, s3)
model.add_edge(s2, s3)
# Finding the probalibities
model.bake()
# Probability for Valid Case
print("The Probability if User said door A, then Monty opened door B, but the car was behind door C : ",model.probability([['A', 'B', 'C']]))
# Probability for Invalid Case
print("The Probability if User said door A, then Monty opened door B, but the car was behind door B : ",model.probability([['A', 'B', 'B']]))

What are Bayesian networks?

Overview

Example

Implementation