-
Notifications
You must be signed in to change notification settings - Fork 75
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to mke sure to run TDW with GPUs in Linux Server? #549
Comments
@LuZeking In your xorg file, the NVIDIA device is |
Thanks for your kind reply:)! But I am a bit confused here due to my lack of experience with this kind of setting. Here I have 2 GPUs, as shown above in 2 xorg files, one was set as Identifier Moreover, setting export DISPLAY=:1.0 also didn't works for me: In remote linux HPC, Did I set any configuration wrong? |
Or it may be the reason that CUDA and Cudnn versions did meet the TDW requirement? I see the example in install.md use the CUDA 9.0. |
CUDA is probably irrelevant. Try this:
|
Hello Alters, The /etc/X11/xorg.conf content is: Section "Monitor" Section "Monitor" Section "Device" Section "Device" Section "Screen" Section "Screen" Could you find sth else that may cause this problem? |
Try these options:
or
or
|
Hi Esther, Thank you for your kindness and help:). Unfortunately, the commands you suggested still failed. Do you have any other suggestions? |
Can you send me the exact shell command you're using to launch TDW.x86_64? Maybe it's just not formatted correctly... |
Also, try these:
|
If that doesn't work, please send the list of devices and screens on the server. I'm not sure how exactly to do this but this post might help. I'm interested in seeing if it outputs the same device/screen indices that are in the xorg file. https://askubuntu.com/a/123096 |
Hi Esther, Thank you for such detailed suggestions! I have tried them all, but unfortunately it still not works. The simulation window was launched (but still without GPUs) only when I set DISPLAY as follows:
Then I command And this means it is the same device/screen indices configured in the xorg file, right? |
Hi @LuZeking Sorry I've been slow to respond. I wonder why the display includes You might also have better luck with our Docker container. |
Hi Esther, thanks for your reply. My setup is Windows 11 for my laptop, and Ubuntu 20.04/22.04 for the remote High-Performance Computer (HPC) managed by Slurm (tried 2 different HPCs with Ubuntu 20.04/22.04). The controller is running on my laptop, and the build is running on the HPC. And Yes, I am trying to do X11 forwarding, to use the GPUs in HPC but run the TDW window in my laptop through X11. BTW, in https://github.com/threedworld-mit/tdw/blob/master/Documentation/lessons/remote/x11_forwarding.md, I only found the macOs setup there, does this means it won't work on Windows? actually, it works for the simple controller to print "hello world", but when it comes to running a complex one like ur5 that need a GPU, it is too slow or just crashes. And I will try the docker container soon, which needs root permutation to do, so I need to ask HPC Admin for help. |
Sorry for not responding to this for a while. I had not realized you're trying to forward the X11 port. We only have Mac instructions because we haven't tried doing it yet on Windows. One of our users provided the Mac instructions. If you manage to find a solution for Windows, we can add it to the documentation. In the meantime, can't you just run the TDW window on the HPC's own X? |
Hello,I connected the monitor directly to the HPC and run the TDW window on the HPC's own X,But on Player.log, it shows that the cpu is used, not the gpu. I did not use the nohup command to create a new virtual monitor because using a virtual monitor would make my monitor black.Here's my Player.log: Mono path[0] = '/data1/user/lpy/TDW/TDW_Data/Managed' |
Hi,
I followed the process install.md (remote Linux part), all the steps seem to work. But somehow when I use the Xservers to do simulation, only the CPU was used though I have 2 GPUs, which makes the simulate super slow
e.g. nvidia-smi:
xorg config files:
`# xorg-1-tdw.conf
nvidia-xconfig: X configuration file generated by nvidia-xconfig
nvidia-xconfig: version 515.86.01
Section "Files"
EndSection
Section "InputDevice"
# generated from default
Identifier "Mouse0"
Driver "mouse"
Option "Protocol" "auto"
Option "Device" "/dev/psaux"
Option "Emulate3Buttons" "no"
Option "ZAxisMapping" "4 5"
EndSection
Section "InputDevice"
# generated from default
Identifier "Keyboard0"
Driver "kbd"
EndSection
Section "Monitor"
Identifier "Monitor0"
VendorName "Unknown"
ModelName "Unknown"
Option "DPMS"
EndSection
Section "Device"
Identifier "Device0"
Driver "nvidia"
VendorName "NVIDIA Corporation"
BoardName "NVIDIA TITAN RTX"
BusID "PCI:179:0:0"
EndSection
xorg-2-tdw.conf
nvidia-xconfig: X configuration file generated by nvidia-xconfig
nvidia-xconfig: version 515.86.01
Section "Files"
EndSection
Section "InputDevice"
# generated from default
Identifier "Mouse0"
Driver "mouse"
Option "Protocol" "auto"
Option "Device" "/dev/psaux"
Option "Emulate3Buttons" "no"
Option "ZAxisMapping" "4 5"
EndSection
Section "InputDevice"
# generated from default
Identifier "Keyboard0"
Driver "kbd"
EndSection
Section "Monitor"
Identifier "Monitor0"
VendorName "Unknown"
ModelName "Unknown"
Option "DPMS"
EndSection
Section "Device"
Identifier "Device1"
Driver "nvidia"
VendorName "NVIDIA Corporation"
BoardName "NVIDIA TITAN RTX"
BusID "PCI:23:0:0"
EndSection
`
Finally, I checked the Player.log:
I was struggling to deal with this and have no idea to solve it now, could you do me a favour?
The text was updated successfully, but these errors were encountered: