Best Practices For Building A Multiple GPU System

Discuss anything you like on this forum.
Post Reply
User avatar
Tutor
Licensed Customer
Posts: 531
Joined: Tue Nov 20, 2012 2:57 pm
Location: Suburb of Birmingham, AL - Home of the Birmingham Civil Rights Institute

I wish all of you a happy and prosperous 2016 (and afterwards).
Notiusweb wrote:Look at this! Love seeing this! I don't know how long this was there...

From Octane Bench:

12x Tesla K80 1 result

Maximum 876.44 Average 876.44
Minimum 876.44 Median 876.44


This card has:

Memory size (GDDR5) : 24GB (12GB per GPU)
CUDA cores: 4992 ( 2496 per GPU)
Memory bandwidth: 480 GB/sec (240 GB/sec per GPU)
2.91 Tflops double precision performance with NVIDIA GPU Boost


http://www.amazon.com/Nvidia-Accelerato ... B00Q7O7PQA

Maybe they used this - CA16000- it can power 16 GPUs at PCI X16

"Four removable canisters house up to four full-height, full-length, PCIe x16 double-wide GPUs each. The system is powered by three 3000-watt redundant power supplies and includes an IPMI-based system monitor."

http://www.onestopsystems.com/3u-comput ... tesla-gpus

Once upon a time not to long ago I was transported to a galaxy far-far away, and this discussion, whimsically started by me, ensued: viewtopic.php?f=40&t=43597&start=200#p242677 . Then after investigating the prices/performance, I ended up back on earth here: viewtopic.php?f=40&t=43597&start=210#p242697 and where I stand today.

glimpse wrote:Notius, 12 k80s means 6 GPU cards & result is pathetic considering the price You have to spend for those =) you can not justify PRO cards for octaneRender..at all.. That result could be reached for fracture of price. Even with 12GB equiped TitansXs - 6 of them watercooled would cost no more than 10k, while the system You pointed with GPUs You mentioned would stand at least three/four times more expensive =)

Tutor, there's no way to reach 1000 in OB using 7x GK 110 units(if You care about stability)but it's possible using GM 200. Regarding to the same topic, EK waterblocks is making 7 port bridge for single slot waterblock equiped GPUs =) this will allow easy way to plug seven cards without need of EOLed fittings in between..

Here is my last build in progress, eGPU box giving 400 in terms of Octane Bench (OCed using Afterburners, in Windows) with 4x 780s & two 240 rads in mATX case (with GM 200 chip based GPUs like 980Ti or TitanXs result would be 600 easily).

I'll write article about this route, in's & out's, but from all I learned so far, this seems to be one of the most logical options I see right now.
image.jpg

There May Come A Point In The Ouch Realm When Self-Reliance Kicks In

I agree with all of your above quoted points. That build that you have underway looks great. Please keep us apprised concerning it's progress, as it looks like an excellent build for you and others specialising in artificial photography (visualisation) for Architecture, Design and Development. We, as Octane users, have a great diversity of needs for Octane helps support a very broad family with a myriad of skills and needs. As to that One-Stop System box, referenced by Notiusweb, which holds up to 16 GPUs, the last time that I looked at it with any interest, it cost [w/4 (at a minimum/mandatorily included) GTX Titan Xs] - $24,750 and went up to ~ $39,800*/ for 16 Titan Xs. It was, then, one of the lowest priced options out there that I could find for a ready-made massive GPU housing system. However, it's headless, i.e., one still needs a computer to connect it to (Ouch!) and the One-Stop Systems told me that they, at least then, wouldn't sell it to me with less than four pricey GPUs - Double Ouch! If one were to note my next posts following that I'm sure that they could glean that I decided to continue going another route - self-builds. I know they will not be beautiful like your custom build is or like a One-Stop-Shop box would be. I'm sure that you chose a build that suits your business' needs. I run a small business providing corporate training programs (it includes producing 3d animations and AVs) and my costs have to be controlled tightly. If you have some ideas about how I could accomplish my goals and meet my needs for lots of GPUs in as few boxes as possible and at the least possible costs, and still polish them so they would be beautiful like your box, I'm always open to hearing constructive, specific suggestions. I started this journey because I had reached 24 systems and the workspace, power, cooling, management and software licensing costs were eating at my profits like wolves on a lamb. Running my many GPUs in as few boxes as possible is one way I see to cut and contain those costs. My clients never see my physical rendering network and care only about the final production - so the way the systems look is immaterial from a profit standpoint. So for now I'll just build my own systems and live with their bare-utilitarianism and whatever valuable input others like you care to provide that's consistent with my goals.


*/ Evidently, One-Stop System was, at least then, valuing an air cooled base Titan X at a sales price of over $1,250.00 (USD) [ $39,800 - $24,750 (that includes 4 Titan Xs) / 12 = $1,254.17 ]. But maybe that $200+ margin was just to have those other GPUs personally pre-inserted. Mini-Ouch! - a third to the fourteenth time - for the insertion of each of the other twelve GPU cards.
Because I have 180+ GPU processers in 16 tweaked/multiOS systems - Character limit prevents detailed stats.
User avatar
Notiusweb
Licensed Customer
Posts: 1285
Joined: Mon Nov 10, 2014 4:51 am

There's an old Amish saying:

"Use as many GPUs and PSUs as you G*d D@mn can..."

However, in the case of the Tesla K80, it's only part of what happened.
Tesla K80.jpg
The scores, to me, show a pattern of benchmarking for specific use levels. I don't think users typically bench their GPUs this way, which leads me to believe this may be some sort of test run, or data logging, for other use than rendering. Perhaps the manufacturer of a unit holding the GPUs, quality controlling for some system to be delivered??

In any event they have the capacity to hold 12 GPUs, and to run an OS with 12 GPUs. And with what Tutor said...
Evidently, One-Stop System was, at least then, valuing an air cooled base Titan X at a sales price of over $1,250.00 (USD) [ $39,800 - $24,750 (that includes 4 Titan Xs) / 12 = $1,254.17 ]. But maybe that $200+ margin was just to have those other GPUs personally pre-inserted. Mini-Ouch! - a third to the fourteenth time - for the insertion of each of the other twelve GPU cards.
...it is only a matter of time before Titan X package is realized, and with that, it could be up to 16 GPUs.

Thus finally, may I dare say:

A 12x score for Titan X is coming....
You do not have the required permissions to view the files attached to this post.
Win 10 Pro 64, Xeon E5-2687W v2 (8x 3.40GHz), G.Skill 64 GB DDR3-2400, ASRock X79 Extreme 11
Mobo: 1 Titan RTX, 1 Titan Xp
External: 6 Titan X Pascal, 2 GTX Titan X
Plugs: Enterprise
User avatar
Tutor
Licensed Customer
Posts: 531
Joined: Tue Nov 20, 2012 2:57 pm
Location: Suburb of Birmingham, AL - Home of the Birmingham Civil Rights Institute

Notiusweb wrote:There's an old Amish saying:

"Use as many GPUs and PSUs as you G*d D@mn can..."

However, in the case of the Tesla K80, it's only part of what happened.
Tesla K80.jpg
The scores, to me, show a pattern of benchmarking for specific use levels. I don't think users typically bench their GPUs this way, which leads me to believe this may be some sort of test run, or data logging, for other use than rendering. Perhaps the manufacturer of a unit holding the GPUs, quality controlling for some system to be delivered??

In any event they have the capacity to hold 12 GPUs, and to run an OS with 12 GPUs. And with what Tutor said...
Evidently, One-Stop System was, at least then, valuing an air cooled base Titan X at a sales price of over $1,250.00 (USD) [ $39,800 - $24,750 (that includes 4 Titan Xs) / 12 = $1,254.17 ]. But maybe that $200+ margin was just to have those other GPUs personally pre-inserted. Mini-Ouch! - a third to the fourteenth time - for the insertion of each of the other twelve GPU cards.
...it is only a matter of time before Titan X package is realized, and with that, it could be up to 16 GPUs.

Thus finally, may I dare say:

A 12x score for Titan X is coming....
One-stop systems’ latest phenom is the GPUltima which holds 128 GPUs - http://www.onestopsystems.com/gpultima . “The GPUltima is a single 19" rack comprised of 8 OSS High Density Compute Accelerators (HDCA) each with 16 NIVIDA Dual GPUs (128 total*/), 16 dual-socket servers, an Infiniband Switch and an Ethernet Switch.” But, given the price for a single 16 GPU chassis [ http://www.onestopsystems.com/4u-comput ... tx-titan-x ], the GPUltima’s price, about which I haven’t had the nerve or interest to ask, would likely trigger, at least in me, terminal “Ouch.” One-Stop Systems’ focus appears to be on the Tesla market, which we 3d artist aren’t usually interested in because Teslas do not perform as well with current 3d software as do the GTXs [ See, e.g., example - https://render.otoy.com/octanebench/res ... ingleGPU=0 . I agree that the company could be testing their Tesla systems since those are the products One-Stop sells that appear to hold the greatest potential for moneymaking for the company. Thus, subsequently the company could do the same for their maximally overpriced Titan systems and thus a 12x score for Titan X could come sooner thanks to One-Stop Systems. However, I’m not focused on double-precision floating-point tasks in my 3d/AV work (although I'm in the process of learning CUDA coding in all of its facets) and thus if One-Stop were testing one of its systems with Teslas, in addition to my questioning the wisdom of their pricing for a Titan X system (which such floating-point users would likely pass up for a Titan-Z or even the original GTX Titan [both of which have vastly superior double-precision floating-point prowess over any Maxwell]) or just wait for Pascals and Voltas which are said will be the ultimate in floating point monsters, I’d also be questioning One-Stop’s wisdom in using Octane to bench Tesla performance. There are much better tests for higher precision floating-point prowess. Most knowledgeable Tesla owners probably don't care about the measurements Octane Bench yields, although Octane (and other 3d app) users (or potential users) might be lead to make better purchasing decisions by being armed with the knowledge that for their use GTXs are better purchases.

*/ I admit that I could use the GPUltima fully loaded if it were priced within the realm of reasonableness, to house my 108 CUDA GPU rendering processors (excluding my 7 GT 640 4Gs) and 12 hulking OpenCL-only processors [Thus, I currently have 127 compute processors counting the 7 GT 640 4Gs and anticipate having even more when we get the Pascals beginning this year, and in 2018 begin getting the Voltas). Very likely, my Amish distant cousins have long been shouting to me in my sleep - "Use as many GPUs and PSUs as you G*d D@mn can...".
Because I have 180+ GPU processers in 16 tweaked/multiOS systems - Character limit prevents detailed stats.
User avatar
Notiusweb
Licensed Customer
Posts: 1285
Joined: Mon Nov 10, 2014 4:51 am

LOL
the GPUltima’s price, about which I haven’t had the nerve or interest to ask, would likely trigger, at least in me, terminal “Ouch."
And you wrapped it up nicely by boomeranging the quote at the end :lol:
Win 10 Pro 64, Xeon E5-2687W v2 (8x 3.40GHz), G.Skill 64 GB DDR3-2400, ASRock X79 Extreme 11
Mobo: 1 Titan RTX, 1 Titan Xp
External: 6 Titan X Pascal, 2 GTX Titan X
Plugs: Enterprise
User avatar
smicha
Licensed Customer
Posts: 3151
Joined: Wed Sep 21, 2011 4:13 pm
Location: Warsaw, Poland

https://www.youtube.com/watch?v=LXOaCkb ... ploademail

7 WC GPUs with the case I keep telling is the best. If I had a choice I'd exchange my SMH10 with S8.
3090, Titan, Quadro, Xeon Scalable Supermicro, 768GB RAM; Sketchup Pro, Classical Architecture.
Custom alloy powder coated laser cut cases, Autodesk metal-sheet 3D modelling.
build-log http://render.otoy.com/forum/viewtopic.php?f=9&t=42540
User avatar
Tutor
Licensed Customer
Posts: 531
Joined: Tue Nov 20, 2012 2:57 pm
Location: Suburb of Birmingham, AL - Home of the Birmingham Civil Rights Institute

smicha wrote:https://www.youtube.com/watch?v=LXOaCkbt4lI&feature=em-uploademail

7 WC GPUs with the case I keep telling is the best. If I had a choice I'd exchange my SMH10 with S8.
Mindblowing rig!
Because I have 180+ GPU processers in 16 tweaked/multiOS systems - Character limit prevents detailed stats.
User avatar
Notiusweb
Licensed Customer
Posts: 1285
Joined: Mon Nov 10, 2014 4:51 am

Re: Best Practices For Building A Multiple GPU System
Postby smicha » Sun Jan 03, 2016 9:07 am
https://www.youtube.com/watch?v=LXOaCkb ... ploademail

7 WC GPUs with the case I keep telling is the best. If I had a choice I'd exchange my SMH10 with S8.
Watercooled TITAN+3x780_6GB@1300mHz/7000mHz, [email protected], P8P67WS, 32GB, 256GB, 2x2TB WD EARX, 1350W
CASE-LABS SMH10, EK, Airplex_480, SR1_560, 20xNB_PL2, 2xD5, Aquaero 6XT
build-log http://render.otoy.com/forum/viewtopic.php?f=9&t=42540
He could add a 2nd 1600 PSU with idea of running it through an extension cord to another breaker, and have a rendering beast as well with 7 Titan Xs.

Simcha, S8 to add more GPUs?
Win 10 Pro 64, Xeon E5-2687W v2 (8x 3.40GHz), G.Skill 64 GB DDR3-2400, ASRock X79 Extreme 11
Mobo: 1 Titan RTX, 1 Titan Xp
External: 6 Titan X Pascal, 2 GTX Titan X
Plugs: Enterprise
User avatar
smicha
Licensed Customer
Posts: 3151
Joined: Wed Sep 21, 2011 4:13 pm
Location: Warsaw, Poland

S8 'cause it's half of the size of STH10 and can handle 3x360 rads easily. Plus a pedestal is available.
3090, Titan, Quadro, Xeon Scalable Supermicro, 768GB RAM; Sketchup Pro, Classical Architecture.
Custom alloy powder coated laser cut cases, Autodesk metal-sheet 3D modelling.
build-log http://render.otoy.com/forum/viewtopic.php?f=9&t=42540
User avatar
Notiusweb
Licensed Customer
Posts: 1285
Joined: Mon Nov 10, 2014 4:51 am

Smicha, might you add a rad outside the case in the pedestal? Why let the case stop you ;) .
Or does the case itself mark the boundary, as it were, for adding components.
Win 10 Pro 64, Xeon E5-2687W v2 (8x 3.40GHz), G.Skill 64 GB DDR3-2400, ASRock X79 Extreme 11
Mobo: 1 Titan RTX, 1 Titan Xp
External: 6 Titan X Pascal, 2 GTX Titan X
Plugs: Enterprise
User avatar
smicha
Licensed Customer
Posts: 3151
Joined: Wed Sep 21, 2011 4:13 pm
Location: Warsaw, Poland

2x 360 in the pedestal :) and pedestal over a pedestal is possible :)
3090, Titan, Quadro, Xeon Scalable Supermicro, 768GB RAM; Sketchup Pro, Classical Architecture.
Custom alloy powder coated laser cut cases, Autodesk metal-sheet 3D modelling.
build-log http://render.otoy.com/forum/viewtopic.php?f=9&t=42540
Post Reply

Return to “Off Topic Forum”