GitHub - markus7800/ADNN: Automatic Differentiation and Deep Learning Library

Automatic Differentiation and CNNs

This project is heavily inspired by micrograd and tinygrad.

In this project I wrote an AD library and put it to the test by training a CNN on the MNIST dataset.

In the folder AutomaticDifferntiation you can find the core types for scalars, vectors, matrices and tensors as well as the logic for backpropagating basic functions such as addition, multiplication, subsetting, etc.

In ADNN I use these types and implement a convolutional neural net. This contains dense, convolutional and maxpool layers. Of course, also activations and losses are there.

In mnist/mnist.jl I tried three different models on the MNIST dataset. The last one, consisting with the architecture

  # First convolution, operating upon a 28x28 image
  Conv((3, 3), 1=>16, relu),
  MaxPool((2,2)),

  # Second convolution, operating upon a 13x13 image
  Conv((3, 3), 16=>32, relu),
  MaxPool((2,2)),

  # Third convolution, operating upon a 5x5 image
  Conv((3, 3), 32=>32, relu),
  MaxPool((2,2)),

  # Reshape 3d tensor into a 2d one using flatten, at this point it should be (1, 1, 32, N)
  flatten,
  Dense(32, 10)

reached 97.6 % accuracy after 20 epochs which takes around 26 minutes on my Macbook Pro 2017 i5. In comparison, an implementation of the same model in Flux.jl takes 18 minutes to train. But the goal of this project was not performance.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
ADNN		ADNN
AutomaticDifferentiation		AutomaticDifferentiation
mnist		mnist
.DS_Store		.DS_Store
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Automatic Differentiation and CNNs

About

Releases

Packages

Languages

markus7800/ADNN

Folders and files

Latest commit

History

Repository files navigation

Automatic Differentiation and CNNs

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages