Distributed SDG

This project showcases a distributed implementation of the stochastic gradient descent algorithm. It introduces a synchronous and an asynchronous version. On the one hand, the synchronous version includes a master node which ensures that the computation of the gradient and the update steps of SGD are coordinated among worker nodes. On the other hand, in the asynchronous version the worker nodes perform computation on their own and frequently exchange weight updates among each other. The master only manage the start and end of the full algorithm (e.g. split the work and collect the result).

Getting started

cd data
./download.sh
cd ..
sbt
> run
> test
> scalafmt

SGD settings can be modified in src/main/resources/application.conf.

Running on Kubernetes

./build.sh
./run.sh -async
./run.sh -sync

SGD settings can be modified in kube/config-async.conf and kube/config-async.conf.

Dataset

RCV1 can be downloaded from here.

References

RECHT, Benjamin, RE, Christopher, WRIGHT, Stephen, et al. Hogwild: A lock-free approach to parallelizing stochastic gradient descent. In : Advances in neural information processing systems. 2011. p. 693-701.

Name		Name	Last commit message	Last commit date
Latest commit History 154 Commits
data		data
kube		kube
project		project
src		src
.dockerignore		.dockerignore
.gitignore		.gitignore
.scalafmt.conf		.scalafmt.conf
LICENSE		LICENSE
README.md		README.md
build.sbt		build.sbt
build.sh		build.sh
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Distributed SDG

Getting started

Running on Kubernetes

Dataset

References

About

Releases

Packages

Languages

License

zifeo/distributed-sgd

Folders and files

Latest commit

History

Repository files navigation

Distributed SDG

Getting started

Running on Kubernetes

Dataset

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages