how to solve a Cuda error 999?

Generic forum to discuss Octane Render, post ideas and suggest improvements.
Forum rules
Please add your OS and Hardware Configuration in your signature, it makes it easier for us to help you analyze problems. Example: Win 7 64 | Geforce GTX680 | i7 3770 | 16GB
Post Reply
User avatar
Hatsize7
Licensed Customer
Posts: 31
Joined: Wed Dec 14, 2016 6:01 pm
Location: Switzerland

I hope someone can help me I'm struggling over this error for some time now.

How can we solve a CUDA error 999: failed to create link error?
Sometimes the message is "failed to allocate device array.

I get this error message when I try to run Octanebench. (4.00c) And obviously this render slave doesn't work when I try to render.

Long story short:
The system is Windows 10, motherboard is ASrock H110 Pro BTC+ (13 PCI-E slots)
32Gb of Ram, SSD disk
5 GPUs (3 GTX 1080, 2 GTX 1070Ti)
Nvidia driver currently running is 419.17

The system was working fine for months. Now lately it stops during rendering, and it is a real pain, because the system restarts and the master is still waiting for the frame to finish (usually around 98% it hangs). I realized that even the Octanebench wouldn't run when this happens.

I updated Windows to 1903. I tried different Nvidia Studio drivers, cleaning them with DDU.
I tried the MSI_util_v2 suggested by others for the Watchdog BSOD problem (since it is a slave, I don't know if there is a BSOD, but it happend before when a monitor was plugged in). In the event viewer I see a critical error 41 (Kernel-Power), when this happens. I changed the virtual memory to 100Gb. I removed the audio driver. (all these were solutions for other similar CUDA errors)
I tried the cards one by one, Octanebench works with each when only one is installed. They work until 3 are plugged in. I started getting the CUDA error message when the 4th card was plugged in. Then I tried different PCI-E slots for the cards and I come to a solution when it all worked, and the Octanebench ran smoothly again. I rendered for a few hours, then the machine restarted again, and now I get the CUDA error 999 again while running Octanbench.

PSU is not a problem, it is a new 1500W and the cards never took more than 130W power each. No overlocking.

I'm thinking about trying Windows 7 to see if it works.
Can anybody help me and shed some light on this?
I have another slave with the same setup with 6 GPUs (1080s) running smoothly since the beginning of the year, somehow this machine is acting up in the past few days. What can I do?
Thank you in advance!
Win 10Pro / C4D R23.008 / Octane 20.1.5-R4 / Nvidia driver 456.38
Intel Core i7 9800X / 96gb / RTX3090
User avatar
paride4331
Octane Guru
Posts: 3862
Joined: Fri Sep 18, 2015 7:19 am

Hello Hatsize7,
try installing this Nvidia drivers:
https://www.nvidia.com/Download/driverR ... 3217/en-us
Regards
Paride
2 x Evga Titan X Hybrid / 3 x Evga RTX 2070 super Hybrid
User avatar
Hatsize7
Licensed Customer
Posts: 31
Joined: Wed Dec 14, 2016 6:01 pm
Location: Switzerland

Hello Paride,

I found another solution in the dark catacombs of the forum from Raphael and it seems like it worked.

"**MAKE SURE YOU ACTIVATE THE "OPTIMIZE FOR COMPUTE" IN THE DRIVERS SETTINGS IN THE NVIDIA CONTROL PANEL** (recent drivers only and win10 <= not sure)

also, you want to disable *all* the nvidia hdmi sound devices in the device editor as it causes the watchdog to trigger sometime on big renders, hence restarting the drivers & crashing the render."

It seems this helped. The machine is running 100% in the past 24 hours rendering an animation, with 5 cards, and it didn't run into any issues yet.

Thanks for your response anyway, that's going to be my next try if something goes wrong.
Win 10Pro / C4D R23.008 / Octane 20.1.5-R4 / Nvidia driver 456.38
Intel Core i7 9800X / 96gb / RTX3090
Post Reply

Return to “General Discussion”