I am using Tesla 2075 (cc 2.0) with CUDA 4.2. I am running my program on a non-display GPU(Tesla) and I am getting the error:
"the launch timed out and was terminated "
Is this error expected on a non-display GPU ?
And, how could I disable the watchdog timer ?
I assume you are running on Windows. If so, put the Tesla 2075 in TCC mode. This will allow compute access and Windows won’t manage it like a display device, which will get rid of the watchdog timer. If you’re having trouble locating nvidia-smi, just do a windows search for nvidia-smi.exe. (It should have been installed with the display driver.) Then, if the C2075 is the only CUDA GPU in the system, the command will be like this:
nvidia-smi -g 0 -dm 1You can also donvidia-smi --helpto get command line help for the tool. This will probably require a reboot of the system after you change this, to get the card into TCC mode.If on the other hand you are running linux and X-windows on this machine, the solution is a little different. One approach is simply to disable X, e.g. by setting runlevel to 3 and rebooting, but there are other ways to do this. However you’ll lose your X GUI on the other GPU (I assume you have another GPU since you said this is a non-display GPU). To preserve X and the GUI on the other GPU, it’s necessary to modify your xorg.conf file to force X onto your display GPU and get it off of your compute (Tesla) GPU. The methods to do this will vary, but if you have 2 NVIDIA GPUs, (one for display) then the X display should be forced onto a single GPU using the BusID parameter in the relevant “Display” section of the xorg.conf file. In addition, any other “Display” sections should be deleted. For example:
The PCI IDs of the GPUs may be determined from the
lspcicommand or from thenvidia-smi –acommand.You may also wish to refer to the X configuration options appendix of the NVIDIA driver README file.