Search⌘ K

The MNIST Database

Explore the MNIST database as a foundational dataset for training generative models like Deep Belief Networks using TensorFlow. Understand its origins, image preprocessing steps, and how it serves as a benchmark for machine learning. Learn to load, preprocess, and prepare MNIST data to train models that generate realistic digit images.

In developing the DBN model, we’ll use a dataset that we have discussed before—the MNIST database, which contains digital images of hand-drawn digitsLeCun, Yann; Léon Bottou; Yoshua Bengio; Patrick Haffner (1998). Gradient-Based Learning Applied to Document Recognition. Proceedings of the IEEE. 86 (11): 2278–2324 from 00 to 99. This database is a combination of two sets of earlier images from the National Institute of Standards and Technology (NIST): Special Database 11 (digits written by US high school students) and Special Database 2LeCun, Yann; Corinna Cortes; Christopher J.C. Burges. MNIST handwritten digit database, Yann LeCun, Corinna Cortes, and Chris Burges (written by US Census Bureau employees), the sum of which is split into 60,000 training images and 10,000 test images.

The original images in the dataset were all black and white, while the modified dataset normalized them to fit into a 2020 x 2020 ...