GitHub - cfsmp3/nvidia_gpu_prometheus_exporter: NVIDIA GPU Prometheus Exporter

NVIDIA GPU Prometheus Exporter - community fork

This is a Prometheus Exporter for exporting NVIDIA GPU metrics. It uses the Go bindings for NVIDIA Management Library (NVML) which is a C-based API that can be used for monitoring NVIDIA GPU devices. Unlike some other similar exporters, it does not call the nvidia-smi binary.

Note: I'm calling this "community" because the starting point is a merge of pending requests on the original mindprince's repo, plus a lot of changes from other forks that for whatever reason weren't sent upstream at all.

And then, whatever else I add.

This fork will be used extensively in production and as such as it will maintained for the forseable future.

Building

make build
make push

Running

The exporter requires the following:

access to NVML library (libnvidia-ml.so.1).
access to the GPU devices.

To make sure that the exporter can access the NVML libraries, either add them to the search path for shared libraries. Or set LD_LIBRARY_PATH to point to their location.

By default the metrics are exposed on port 9445. This can be updated using the -web.listen-address flag.

Running inside a container

There's a docker image available on Docker Hub at cfsmp3/nvidia_gpu_prometheus_exporter

If you are running the exporter inside a container, you will need to do the following to give the container access to NVML library:

-e LD_LIBRARY_PATH=<path-where-nvml-is-present>
--volume <above-path>:<above-path>

And you will need to do one of the following to give it access to the GPU devices:

Run with --privileged
If you are on docker v17.04.0-ce or above, run with --device-cgroup-rule 'c 195:* mrw'

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
go.mod		go.mod
go.sum		go.sum
main.go		main.go
nvidia_gpu_prometheus_exporter		nvidia_gpu_prometheus_exporter

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NVIDIA GPU Prometheus Exporter - community fork

Building

Running

Running inside a container

About

Releases

Packages

Languages

License

cfsmp3/nvidia_gpu_prometheus_exporter

Folders and files

Latest commit

History

Repository files navigation

NVIDIA GPU Prometheus Exporter - community fork

Building

Running

Running inside a container

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages