Before we calculate Norm, let’s look at a straightforward case first. We’ll apply a variational method to approximate the distribution of a hidden variable analytically. Let’s say we have two binary variables, A and B. We know they’re not independent. A is the parent node and B is the child node whose conditional probability we try to estimate.

The following figure depicts this Bayesian network.

Get hands-on with 1200+ tech skills courses.