Rules of Probability

Explore the foundational rules of probability crucial for generative AI. Understand how probability values are assigned, the difference between dependent and independent data, and how conditional and joint probabilities inform model design. Discover how Bayes' theorem links these concepts and the distinction between discriminative and generative models, with examples including neural networks like VAEs and GANs.

We'll cover the following...

Discriminative and generative modeling and Bayes’ theorem
Types of generative models

At the simplest level, a model, be it for machine learning or a more classical method such as linear regression, is a mathematical description of how various kinds of data relate to one another.

In the task of modeling, we usually think about separating the variables of our dataset into two broad classes:

Independent data: It primarily means inputs to a model are denoted by $X$ . These could be categorical features (such as a $0$ or $1$ in six columns indicating which of six schools a student attends), continuous (such as the heights or test scores of the same students), or ordinal (the rank of a student in the class).
Dependent data: It refers to the outputs of our models and are denoted by $Y$ . As with the independent variables, these can be continuous, categorical, or ordinal, and they can be an individual element or multidimensional matrix (tensor) for each element of the dataset.

In some cases,

Y

is a label that can be used to condition a generative output, such as in a conditional GAN.

So, how can we describe the data in our model using statistics? In other words, how can we quantitatively describe what values we are likely to see, how frequently, and which values are more likely to appear together? One way is by asking the likelihood of observing a particular value in the data or the probability of that value. For example, if we were to ask what the probability is of observing a roll of $4$ on a six-sided die, the answer is that, on average, we would observe a 4 once every six rolls. We write this as follows:

where $P$ denotes the probability of.

What defines the allowed probability values for a particular dataset? If we imagine the set of all possible values of a dataset, such as all values of a die, then a probability maps each value to a number between $0$ and $1$ . The minimum is $0$ because we can’t have a negative chance of seeing a result; the most unlikely result is that we would never see a particular value, or 0% probability, such as rolling a 7 on a six-sided die. Similarly, we can’t have greater than 100% probability of observing a result, represented by the value $1$ ; an outcome with probability $1$ is absolutely certain.

1.Introduction to the Course

2.An Introduction to Generative AI

3.Building Blocks of Deep Neural Networks

4.Teaching Networks to Generate Digits

5.Painting Pictures with Neural Networks Using VAEs

Project

6.Image Generation with GANs

Project

7.Style Transfer with GANs

Assessment

8.Deepfakes with GANs

9.The Rise of Methods for Text Generation

10.NLP 2.0: Using Transformers to Generate Text

11.Composing Music with Generative Models

Project

12.Play Video Games with Generative AI: GAIL

13.Emerging Applications in Generative AI

Assessment

14.Conclusion

15.Appendix

Rules of Probability