coilbook wrote:One more thing
We use netstor box with 4 GPUs and we've been having where 1-2 GPUs would stop rendering and the other 2 continue. Never had this problem in version 2
can it be because of this:
"Since the beginning of Octane the integration kernels had one CUDA thread calculate one complete sample. We changed this for various reasons, the main one being the fact that the integration kernels got really huge and impossible to optimize. Also OSL and OpenCL are pretty much impossible to implement this way. To solve the problem, we split the big task of calculating a sample into smaller steps which are then processed one by one by the CUDA threads. I.e. there are a lot more kernel calls are happening than in the past."
parallel samples value is too high maybe some cards get stuck calculating longer and fail.
Hey,
One potential issue you could be having is with some stability issues with the latest CUDA drivers. Try lowering the CUDA drivers on your master and slave to 358.50, and let me know how that goes.
Before you do that, here are some logging files that will help to get more valuable information. Download the attached files and copy octane_log_flags file into the standalone install directory on the master and slave machine. Also copy the octane_daemon_log_flags file into the install directory of the slave machine as well. This will output information into a file named "octane_log". So far, it is not easy to reliably duplicate your issue based on the info you provided. The more information we can use, the better. Also, let me know if reverting your CUDA drivers to 358.50 resolves your issue.
Also