Agrad - Auto Grad

A substandard autogradient implementation using only Numpy. This is an extension of my ml library and I wish to use it to implement more complex networks. It is an amalgamation of Joel Grus' and Andrej Karpathy's implementation of autograd.

Done:

loss: mse, softmax cross-entropy
optimizer: basic (SGD), adam, RMSprop, momentum
activation/functions: tanh, leaky relu, sigmoid, relu, exp, basic tensor ops (add, subtract, matmul etc.)
architectures: Linear (MLP) see mnist.py, Transformers (LLaMA.py)

Todo:

building blocks: conv
architectures: imagenet, Mamba
other: KV cache for transformers, test backprop stability on transformer

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
agrad		agrad
.gitignore		.gitignore
LLaMA.py		LLaMA.py
README.md		README.md
mnist.py		mnist.py
setup.py		setup.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Agrad - Auto Grad

Done:

Todo:

About

Releases

Packages

Languages

arnavg115/agrad

Folders and files

Latest commit

History

Repository files navigation

Agrad - Auto Grad

Done:

Todo:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages