Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

nvdia-cuda-mps-server consistently hangs at the "creating worker thread" log #49

Open
yangcheng-dev opened this issue Jan 11, 2024 · 0 comments

Comments

@yangcheng-dev
Copy link

I am using nvidia-cuda-mps-server for GPU virtualization (GPU is V100), and the plugin comes from Nebuly-NVIDIA. The CUDA client is k8s.gcr.io/cuda-vector-add:v0.1. After the CUDA client starts as a container, the nvidia-cuda-mps-server process consistently hangs at the "creating worker thread" log, and the client does not print any logs. Where could the problem be? Is it possible that my GPU card does not support MPS, or is it an issue with the client?

Steps to reproduce the issue
1.install Nebuly-NVIDIA plugin:https://github.com/nebuly-ai/k8s-device-plugin
2.start a pod which image is “k8s.gcr.io/cuda-vector-add:v0.1”
3. client does not print any logs,and nvidia-cuda-mps-server hangs at the "creating worker thread" log
WechatIMG284
WechatIMG285
WechatIMG286
WechatIMG288

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant