Training a MNIST digit recognizer

The aim here is to achieve the following results with MNIST digit recognizer

99.4% validation accuracy
Less than 20k Parameters
Less than 20 Epochs
Used Batch Normalization, Dropout, a Fully connected layer after have used Global Average Pooling.

Model

The summary of the model defined is shown below

Figure 1.a : Model Summary

Data Augmentation is very important to regularize your network and increase the size of your training set. Data Augmentation strategy used here is random affine transformation. This is done to ensure that the model does not overfit on the training data set and can generalize well on testing data set. In Euclidean geometry, an affine transformation, is a geometric transformation that preserves lines and parallelism (but not necessarily distances and angles).

Training Log

Figure 1.b : Training log

Result

An accurarcy of 99.46 is achieved in 19th epoch with model of 12,674 parameters for the MNIST data

Figure 1.c : Loss and accuracy plots

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Training a MNIST digit recognizer

Model

Training Log

Result

Files

README.md

Latest commit

History

README.md

File metadata and controls

Training a MNIST digit recognizer

Model

Training Log

Result