Best Practices For Building A Multiple GPU System

Discuss anything you like on this forum.
Post Reply
User avatar
smicha
Licensed Customer
Posts: 3151
Joined: Wed Sep 21, 2011 4:13 pm
Location: Warsaw, Poland

Itou,

I've heard that there is a software (?) that fixes vram issue for Titans only on win10. So maybe this is new driver related. I'll test today 11 GPU 1080ti machine with win10 and 391 and will let you know if all works fine.
3090, Titan, Quadro, Xeon Scalable Supermicro, 768GB RAM; Sketchup Pro, Classical Architecture.
Custom alloy powder coated laser cut cases, Autodesk metal-sheet 3D modelling.
build-log http://render.otoy.com/forum/viewtopic.php?f=9&t=42540
User avatar
itou31
Licensed Customer
Posts: 377
Joined: Tue Jan 22, 2013 8:43 am

Thanks Smicha,
Do you know the name of this software or where you see it ?
I think that your 11 GPU 1080Ti will be fine with the new drivers 391, as they are same generation.
I7-3930K 64Go RAM Win8.1pro , main 3 titans + 780Ti
Xeon 2696V3 64Go RAM Win8.1/win10/win7, 2x 1080Ti + 3x 980Ti + 2x Titan Black
User avatar
Notiusweb
Licensed Customer
Posts: 1285
Joined: Mon Nov 10, 2014 4:51 am

Oh my goodness...I missed these discussions!

Hope everyone is well, I have been messing with PBR and Unreal Engine. Okay, back to this...

itou31, I had an issue once like this after installing and then reverting Nvidia drivers. Are any of the cards in your rig OC, like is there one 980Ti that is not OC and then the other one is? Sometimes the profile in regedit is what is getting screwy, not necessarily the driver.
I have a hunch that if you erased the registry for all cards but the main mobo GPU card,

2) Navigate to the following key:
HKEY_LOCAL_MACHINE\SYSTEM\CurrentControlSet\Control\Class\{4D36E968-E325-11CE-BFC1-08002BE10318}

3) You will see keys for each video card starting with "0000" and then "0001", etc.


and then installed each card back in one by one, it would re-establish a working profile. Sometimes when doing it all at once, the registry is like, "Sorry 980Ti-B, this slot is for 980Ti-A, not 980Ti-B", meanwhile "A" is sitting in "B's" slot, but working. And like a Catch-22, you can't install it.

I guess if worse comes to worse you could try this, it is not at all guaranteed. But logically, it would allow a more systematic and progressive troubleshooting.
Sorry you are experiencing this....
I'll say it for you....
F&%$ !
Win 10 Pro 64, Xeon E5-2687W v2 (8x 3.40GHz), G.Skill 64 GB DDR3-2400, ASRock X79 Extreme 11
Mobo: 1 Titan RTX, 1 Titan Xp
External: 6 Titan X Pascal, 2 GTX Titan X
Plugs: Enterprise
User avatar
itou31
Licensed Customer
Posts: 377
Joined: Tue Jan 22, 2013 8:43 am

Hi Notiusweb,
welcome back to this thread !
No, I don't want to add one by one : I'm in water cooling with 7 GPUs in one block.
I could try the registry.
But when I do a fresh install (so new registry) I have the same issue with the new drivers (tests all whql from 387.92 --> 391.35, with a DDU each time.)
I'm ready to stay only with 6 GPU with octane 4, and wait for 1080ti gets cheaper.
I7-3930K 64Go RAM Win8.1pro , main 3 titans + 780Ti
Xeon 2696V3 64Go RAM Win8.1/win10/win7, 2x 1080Ti + 3x 980Ti + 2x Titan Black
User avatar
Notiusweb
Licensed Customer
Posts: 1285
Joined: Mon Nov 10, 2014 4:51 am

itou31 wrote:Hi Notiusweb,
welcome back to this thread !
No, I don't want to add one by one : I'm in water cooling with 7 GPUs in one block.
I could try the registry.
But when I do a fresh install (so new registry) I have the same issue with the new drivers (tests all whql from 387.92 --> 391.35, with a DDU each time.)
I'm ready to stay only with 6 GPU with octane 4, and wait for 1080ti gets cheaper.

What if you only unplug the power cables from the PSU, but not from the cards themselves, and then run the rig.
You wouldn't have to unplug anything PCIE-wise or Water-tubing wise. You would just have the cards unplugged as inactive, and you could install one by one:

1) You would power off, plugin in card 1's cables to the PSU, power on and install card 1,
2) then power off, plugin card 2's cables to the PSU, power on and install card 2, with card 1 running
3) then power off, plugin card 3's cables to the PSU, power on and install card 3, with cards 1 and 2 running,
Etc...

That's how I did it in troubleshooting.

Oh....On the Octane 4 side, yeah, it will be interesting to see what power the more developed uber-material Denoising Brigade machine can do!
Win 10 Pro 64, Xeon E5-2687W v2 (8x 3.40GHz), G.Skill 64 GB DDR3-2400, ASRock X79 Extreme 11
Mobo: 1 Titan RTX, 1 Titan Xp
External: 6 Titan X Pascal, 2 GTX Titan X
Plugs: Enterprise
User avatar
itou31
Licensed Customer
Posts: 377
Joined: Tue Jan 22, 2013 8:43 am

Oh, thanks for the advice.
I don't know that way, the card is disable. I will try that.
I7-3930K 64Go RAM Win8.1pro , main 3 titans + 780Ti
Xeon 2696V3 64Go RAM Win8.1/win10/win7, 2x 1080Ti + 3x 980Ti + 2x Titan Black
User avatar
smicha
Licensed Customer
Posts: 3151
Joined: Wed Sep 21, 2011 4:13 pm
Location: Warsaw, Poland

Guys,

Has anyone managed to pass-through from Linux (Ubuntu, Mint,....) a PCI device (GPU) into a virtual machine, to Windows?
3090, Titan, Quadro, Xeon Scalable Supermicro, 768GB RAM; Sketchup Pro, Classical Architecture.
Custom alloy powder coated laser cut cases, Autodesk metal-sheet 3D modelling.
build-log http://render.otoy.com/forum/viewtopic.php?f=9&t=42540
User avatar
itou31
Licensed Customer
Posts: 377
Joined: Tue Jan 22, 2013 8:43 am

Hi Notiusweb,

I cannot manage to disable GPU by disconnecting power. They are still there in windows and have same issue with 391.35 drivers.
I think that I the fault is 780ti between the 3 Titans. So still stay with 385.69 drivers that works with all GPUs....
I7-3930K 64Go RAM Win8.1pro , main 3 titans + 780Ti
Xeon 2696V3 64Go RAM Win8.1/win10/win7, 2x 1080Ti + 3x 980Ti + 2x Titan Black
User avatar
itou31
Licensed Customer
Posts: 377
Joined: Tue Jan 22, 2013 8:43 am

Hi,
I finally fix the problem, so now I can use 3.08 RC4 and test the 4.00 XB1 with all GPUs ! :D
Sorry, it was completly due to my overclocking mod on the titan (on the 4GPU rig) and on the 7GPU rig, the 980Ti that I bought is also vbios mod.
I revert back to original vbios on titan and 980Ti, and now the update to 387.92 drivers works fine (enough to have cuda 9.1).
So all drivers after 385.69 fail with power vbios mod.
I7-3930K 64Go RAM Win8.1pro , main 3 titans + 780Ti
Xeon 2696V3 64Go RAM Win8.1/win10/win7, 2x 1080Ti + 3x 980Ti + 2x Titan Black
User avatar
smicha
Licensed Customer
Posts: 3151
Joined: Wed Sep 21, 2011 4:13 pm
Location: Warsaw, Poland

itou31 wrote:Hi,
I finally fix the problem, so now I can use 3.08 RC4 and test the 4.00 XB1 with all GPUs ! :D
Sorry, it was completly due to my overclocking mod on the titan (on the 4GPU rig) and on the 7GPU rig, the 980Ti that I bought is also vbios mod.
I revert back to original vbios on titan and 980Ti, and now the update to 387.92 drivers works fine (enough to have cuda 9.1).
So all drivers after 385.69 fail with power vbios mod.

Thanks for sharing.
3090, Titan, Quadro, Xeon Scalable Supermicro, 768GB RAM; Sketchup Pro, Classical Architecture.
Custom alloy powder coated laser cut cases, Autodesk metal-sheet 3D modelling.
build-log http://render.otoy.com/forum/viewtopic.php?f=9&t=42540
Post Reply

Return to “Off Topic Forum”