Octane crash when running with mixed cards 2080ti & 1080ti
Posted: Sun Feb 03, 2019 8:09 pm
Ran into this odd problem. I have a system with 3x 2080ti's and 1x 1080ti on a cluster & 1x 2080ti & 1 x 1080ti internal direct to the motherboard.
When I load a scene that takes around 5gb of GPU ram Octane fails to render in both IPR & F10.
I get the following cuda errors:
00:01:47 (0107.89) | Reading preview image successfully
00:01:47 (0107.89) | [profile] Function "GetPreviewImage" over "" execution time: 0.004 seconds
00:01:48 (0108.67) * OCTANE API MSG: CUDA error 2 on device 2: out of memory
00:01:48 (0108.67) * OCTANE API MSG: CUDA error 2 on device 0: out of memory
00:01:48 (0108.67) * OCTANE API MSG: -> failed to allocate device array
00:01:48 (0108.67) * OCTANE API MSG: -> failed to allocate device array
00:01:48 (0108.67) * OCTANE API MSG: CUDA error 2 on device 1: out of memory
00:01:48 (0108.67) * OCTANE API MSG: -> failed to allocate device array
00:01:48 (0108.67) |
00:01:48 (0108.67) | ++++++++++++++++++++++++++
00:01:48 (0108.67) | +++ IPR RENDER FAILURE +++ processing the failure callback
00:01:48 (0108.67) | ++++++++++++++++++++++++++
00:01:48 (0108.67) |
00:01:48 (0108.67) |
00:01:48 (0108.67) | ++++++++++++++++++++++++++
00:01:48 (0108.67) | +++ IPR RENDER FAILURE +++ processing the failure callback
00:01:48 (0108.67) | ++++++++++++++++++++++++++
00:01:48 (0108.67) |
00:01:48 (0108.67) |
00:01:48 (0108.67) | ++++++++++++++++++++++++++
00:01:48 (0108.67) | +++ IPR RENDER FAILURE +++ processing the failure callback
00:01:48 (0108.67) | ++++++++++++++++++++++++++
00:01:48 (0108.67) |
00:01:48 (0108.67) * OCTANE API MSG: CUDA error 2 on device 4: out of memory
00:01:48 (0108.67) * OCTANE API MSG: -> failed to allocate device array
00:01:48 (0108.67) |
00:01:48 (0108.67) | ++++++++++++++++++++++++++
00:01:48 (0108.67) | +++ IPR RENDER FAILURE +++ processing the failure callback
00:01:48 (0108.67) | ++++++++++++++++++++++++++
00:01:48 (0108.67) |
00:01:48 (0108.67) | IPR: reset image callback
00:01:48 (0108.67) | IPR: wait for the getPreviewImage function
00:01:48 (0108.78) | IPR: free the OpenGL buffers
00:01:48 (0108.78) | IPR: rendering done
00:01:48 (0108.82) * OCTANE API MSG: CUDA error 2 on device 2: out of memory
00:01:48 (0108.82) * OCTANE API MSG: -> failed to load module
00:01:48 (0108.82) * OCTANE API MSG: device 2: failed to load compiled OSL code:
00:01:48 (0108.82) * OCTANE API MSG: device 2: failed to compile module 0
00:01:48 (0108.82) * OCTANE API MSG: CUDA error 999 on device 4: unknown error
00:01:48 (0108.82) * OCTANE API MSG: -> failed to link to cubin
00:01:48 (0108.82) * OCTANE API MSG: device 4: failed to compile OSL code:
00:01:48 (0108.82) * OCTANE API MSG: device 4: failed to compile module 0
00:01:49 (0109.07) * OCTANE API MSG: CUDA error 2 on device 1: out of memory
00:01:49 (0109.07) * OCTANE API MSG: -> failed to allocate device memory
00:01:49 (0109.07) * OCTANE API MSG: CUDA error 2 on device 0: out of memory
00:01:49 (0109.07) * OCTANE API MSG: -> failed to allocate device memory
All the cards do have around 11gb free, so this shouldn't happen.
Here's where it gets odd, if I disable all the 2080ti's IPR will open just fine, no errors & the scene will also render its 240 frames without any problems. The same goes if I disable the 1080ti's. So it seems mixing cards is somehow causing a cuda crash.
I have tested the same scene with just 1080ti's & just the 2080ti's and the renders completed without issue. I've also exported an orbx file of the same scene & tested the standalone. With all cards running the standalone renders just fine, no crash. The problem seems to only be on the plug-in side.
My system specs are:
AMD Ryzen Threadripper 1950X 16 core CPU 3.4GHz
32 GB System ram.
Win 10 Pro 64 bit OS.
1x 1200wat PSU
1x 1080ti & 1 x 2080ti direct to the MB.
Asus X339 Zenith Extreme motherboard.
Nividia drivers tested 417.22 & 417.71
Octane 4.01 & Octane 2018 current build. Both same failure.
2x 1200wat PSU to power Amfeltec cluster & cards.
Amfeltec GPU clusters with 3x 2080ti & 1x 1080ti.
When I load a scene that takes around 5gb of GPU ram Octane fails to render in both IPR & F10.
I get the following cuda errors:
00:01:47 (0107.89) | Reading preview image successfully
00:01:47 (0107.89) | [profile] Function "GetPreviewImage" over "" execution time: 0.004 seconds
00:01:48 (0108.67) * OCTANE API MSG: CUDA error 2 on device 2: out of memory
00:01:48 (0108.67) * OCTANE API MSG: CUDA error 2 on device 0: out of memory
00:01:48 (0108.67) * OCTANE API MSG: -> failed to allocate device array
00:01:48 (0108.67) * OCTANE API MSG: -> failed to allocate device array
00:01:48 (0108.67) * OCTANE API MSG: CUDA error 2 on device 1: out of memory
00:01:48 (0108.67) * OCTANE API MSG: -> failed to allocate device array
00:01:48 (0108.67) |
00:01:48 (0108.67) | ++++++++++++++++++++++++++
00:01:48 (0108.67) | +++ IPR RENDER FAILURE +++ processing the failure callback
00:01:48 (0108.67) | ++++++++++++++++++++++++++
00:01:48 (0108.67) |
00:01:48 (0108.67) |
00:01:48 (0108.67) | ++++++++++++++++++++++++++
00:01:48 (0108.67) | +++ IPR RENDER FAILURE +++ processing the failure callback
00:01:48 (0108.67) | ++++++++++++++++++++++++++
00:01:48 (0108.67) |
00:01:48 (0108.67) |
00:01:48 (0108.67) | ++++++++++++++++++++++++++
00:01:48 (0108.67) | +++ IPR RENDER FAILURE +++ processing the failure callback
00:01:48 (0108.67) | ++++++++++++++++++++++++++
00:01:48 (0108.67) |
00:01:48 (0108.67) * OCTANE API MSG: CUDA error 2 on device 4: out of memory
00:01:48 (0108.67) * OCTANE API MSG: -> failed to allocate device array
00:01:48 (0108.67) |
00:01:48 (0108.67) | ++++++++++++++++++++++++++
00:01:48 (0108.67) | +++ IPR RENDER FAILURE +++ processing the failure callback
00:01:48 (0108.67) | ++++++++++++++++++++++++++
00:01:48 (0108.67) |
00:01:48 (0108.67) | IPR: reset image callback
00:01:48 (0108.67) | IPR: wait for the getPreviewImage function
00:01:48 (0108.78) | IPR: free the OpenGL buffers
00:01:48 (0108.78) | IPR: rendering done
00:01:48 (0108.82) * OCTANE API MSG: CUDA error 2 on device 2: out of memory
00:01:48 (0108.82) * OCTANE API MSG: -> failed to load module
00:01:48 (0108.82) * OCTANE API MSG: device 2: failed to load compiled OSL code:
00:01:48 (0108.82) * OCTANE API MSG: device 2: failed to compile module 0
00:01:48 (0108.82) * OCTANE API MSG: CUDA error 999 on device 4: unknown error
00:01:48 (0108.82) * OCTANE API MSG: -> failed to link to cubin
00:01:48 (0108.82) * OCTANE API MSG: device 4: failed to compile OSL code:
00:01:48 (0108.82) * OCTANE API MSG: device 4: failed to compile module 0
00:01:49 (0109.07) * OCTANE API MSG: CUDA error 2 on device 1: out of memory
00:01:49 (0109.07) * OCTANE API MSG: -> failed to allocate device memory
00:01:49 (0109.07) * OCTANE API MSG: CUDA error 2 on device 0: out of memory
00:01:49 (0109.07) * OCTANE API MSG: -> failed to allocate device memory
All the cards do have around 11gb free, so this shouldn't happen.
Here's where it gets odd, if I disable all the 2080ti's IPR will open just fine, no errors & the scene will also render its 240 frames without any problems. The same goes if I disable the 1080ti's. So it seems mixing cards is somehow causing a cuda crash.
I have tested the same scene with just 1080ti's & just the 2080ti's and the renders completed without issue. I've also exported an orbx file of the same scene & tested the standalone. With all cards running the standalone renders just fine, no crash. The problem seems to only be on the plug-in side.
My system specs are:
AMD Ryzen Threadripper 1950X 16 core CPU 3.4GHz
32 GB System ram.
Win 10 Pro 64 bit OS.
1x 1200wat PSU
1x 1080ti & 1 x 2080ti direct to the MB.
Asus X339 Zenith Extreme motherboard.
Nividia drivers tested 417.22 & 417.71
Octane 4.01 & Octane 2018 current build. Both same failure.
2x 1200wat PSU to power Amfeltec cluster & cards.
Amfeltec GPU clusters with 3x 2080ti & 1x 1080ti.