What is Gibbs sampling?

Gibbs sampling belongs to the Markov Chain Monte Carlo (MCMC) methods. These methods are a class of algorithms that are needed for sampling from probability distributions.

Gibbs sampling

Gibbs sampling creates a Markov chain, which transitions forward according to some probabilistic rules and then converges. At convergence, Gibbs sampling approximates the posterior distribution. Posterior distributions are used for uncertain quantities in the Bayesian Analysis.

Algorithm

Let's say that we have a multivariate probability distribution:

The steps of the algorithm are as follows:

Choose a random value for both variables, $X$ and $Y$ .
Sample from the distribution of $X$ .
Sample a new value for $Y$ on the $X$ we just computed.
Repeat $1$ and $2$ for $n$ iterations.

We can also do alternative sampling between $X$ and $Y$ .

Pros and cons

Now let's look at some pros and cons of Gibbs sampling.

Pros

It is easy to evaluate the conditional distributions.
There is an exact sampling from conjugate conditionals.
There are lower dimensional conditions and it s easy to apply both rejection and importance samplings.

Cons

There is always a need to derive conditional probability distributions.
There is also a need to derive random samples from these distributions.
Gibbs sampling may get slow because of no diagonal steps and correlated parameters.

Example

Let's suppose that we have two variables, $X$ and $Y$ , with a joint distribution as follows:

So, we simulate Gibbs sampling if we do as follows:

Take a new value of $X$ from $Pr(X|Y=y)$ where $y$ is the current value of $Y$ .
Take a new value of $Y$ from $Pr(Y|X=x)$ where $x$ is the current value of $X$ .

Applications

Some applications of Gibbs sampling are as follows:

Modeling unobserved data using Data Augmentation
Statistical mechanics
Image processing (Lattice model)
Bayesian statistics
Bioinformatics (analyzing DNA strands)
Immunology
Segregation and survival analysis

X/Y	0	1
0	0.5	0.2
1	0.3	0.3