Send a notification to Slack when GPUs stop.
git clone https://github.com/kubokoHappy/gpu_slack_notification.git
cd gpu_slack_notification
docker build -t gpu_slack_notification:latest .
docker run -it --name gpu_slack_notification --gpus all gpu_slack_notification:latest
https://slack.com/apps/A0F7XDUAZ--incoming-webhook-?tab=more_info
- Add the "Webhook URL" obtained from the above site to /workspace/secret/secret.yml.
SLACK_WEBHOOK_URL: 'https://hooks.slack.com/services/XXXXXXXXX/QQQQQQQQQQQQQQQQQQQ'
- use_gpu_ids : GPU Index.
- sleep_time_sec : Time interval to check the GPU operation.
- job_name : Job name to be displayed when notifying Slack.
python gpu_operation_check.py --use_gpu_ids="1,2,3,4,5,6,7" --sleep_time_sec=600 --job_name='job name'