GPU-Monitoring via Slack or Mattermost

In a nut shell

Monitoring of a GPU system sending either Slack or Mattermost messages via webhooks

Requirements

NVidia® GPU since it is using nvidia-smi to monitor the GPU jobs.
Linux.
Python 2 (port to Python 3 should be straightforward).

Usage

Create an incoming webhook in Slack or Mattermost and save the web adress of the web hook.
Open the file gpumonitor.py and add the web address of the web hook to the mattermostIncomingWebhook variable.
Start the gpumonitor.py script

If you would like, you can also install the gpumonitor as an init.d service

Copy the gpumonitor.py script to /usr/local/bin/gpumonitor: cp gpumonitor.py /usr/local/bin/gpumonitor
Make sure that the file is executable: chmod +x /usr/local/bin/gpumonitor
Copy the file gpumonitor-init.d to /etc/init.d: cp gpumonitor-init.d /etc/init.d/gpumonitor
Start the init service via /etc/init.d/gpumonitor start
As usual you can monitor the service status via /etc/init.d/gpumonitor status

Options for configuration

mattermostIncomingWebhook: Web address of incoming Slack or Mattermost webhook
nvidiaLogoLink: Logo used for the bot in Mattermost

Screenshot

Screenshot taken from Mattermost group contains

Message after a new job was started.
Status message since a new job was created.
Message after job has finished.

Legal notice

This is a private project.
I am not in any way affiliated with Mattermost, NVidia or Slack.
NVIDIA, the NVIDIA logo, and all other trademarks mentioned in this document are trademarks and/or registered trademarks of NVIDIA Corporation in the U.S. and other countries. Other company and product names may be trademarks of the respective companies with which they are associated.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
gpu-monitoring-edited.png		gpu-monitoring-edited.png
gpumonitor		gpumonitor
gpumonitor-init.d		gpumonitor-init.d
gpumonitor.py		gpumonitor.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GPU-Monitoring via Slack or Mattermost

In a nut shell

Requirements

Usage

Options for configuration

Screenshot

Legal notice

About

Releases

Packages

Languages

License

mharrend/GPU-Monitoring-Slack-Mattermost

Folders and files

Latest commit

History

Repository files navigation

GPU-Monitoring via Slack or Mattermost

In a nut shell

Requirements

Usage

Options for configuration

Screenshot

Legal notice

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages