All things Nvidia GPU

GWGeorge007

Joined: 8 Jan 18

Posts: 3065

Credit: 4969467686

RAC: 1415571

James Bradshaw wrote: Thanks

15 Sep 2024 16:55:15 UTC

Message 228275 in response to message 228272

(moderation:

)

James Bradshaw wrote:

Thanks Mikey. Understand, but I am running only two projects-Milky and Ein. I have tried to balance out all five hosts running both--CPU on all for Milky and GPU on all for Ein and I end up killing my production for both Mil and Ein. Am going back to Ein only on four hosts and Milky on one host That should keep my head above water for Ein. I think in my case, I just need to save my pennies and get a host with a bunch of CPUs/cores dedicated for Milky...

Just a tip, since I'm not exactly sure what CPU cores/threads you are using... on each of your 5 computers.

Try this before you start spending money on something you won't need for a while.

For Milkyway CPU tasks you may want to run them at less than full CPU threads, maybe half... and still run Einstein GPU tasks on your GPUs. This will become apparent to you as to why in a moment.

To get the most production out of your GPU's, run them at one task per GPU at a time to get a baseline of the time required to complete and validate the tasks for each GPU. I would suggest getting 10 or more tasks completed and validated and then average them out for a baseline.

Once you have a baseline, then change them to two tasks per GPU and then check how long it takes and divide the time by 2. (again, 10 or more) If it is still less than your baseline, change them up to three tasks and check the times... and divide by three to compare with the baseline. (same, 10 or more)

Continue on with this strategy until your multiple tasks per GPU give you a higher/longer timeline than your baseline. If that's the case, revert to one less task per GPU to recover the best timeline per GPU on all GPUs.

After doing the above, you can now increase Milkyway tasks per CPU on each of the computers, one thread at a time, until your GPU timeline production starts to increase. At that time reduce your CPU thread count by one to revert back to the GPU timeline per task you had before.

This could take a day or two to accomplish, but it ~~should~~ will give you the best times for both GPU and CPU on each of your computers.

Of course this does not include your using any one of the computers as a daily driver. If you are, then I would suggest lessening the CPU usage on that particular computer by two threads.

If you are not sure how to change the number of tasks per CPU & GPU, or checking the timeline for tasks, just ask.

We are here to help!

George

Proud member of the Old Farts Association

James Bradshaw

Joined: 1 Mar 14

Posts: 23

Credit: 1287735631

RAC: 2439038

George, thanks for your

15 Sep 2024 18:31:33 UTC

Message 228277

(moderation:

)

George, thanks for your detailed thoughts and suggestions. A number of months ago I did attempt to change the number of tasks for CPUs and GPUs, and the only thing that seemed to happen were messages that I was running multiple instances of BOINC.. What I did could be similar to a self-inflicted gun-shot wound...LOL. So, some instruction for that would be most appreciated. You already spent a lot of time on your last message, and I thank you for that. JB

Keith Myers

Joined: 11 Feb 11

Posts: 4964

Credit: 18741987698

RAC: 7015121

Changing the number of WU's

16 Sep 2024 1:24:28 UTC

Message 228282 in response to message 228277

(moderation:

)

Changing the number of WU's per gpu should not cause multiple clients to run. You just need to change the divisor on the Project Preferences page for gpu utilization from 1.0 for one WU per gpu, 0.5 for two WU per gpu, 0.33 for three WU per gpu . . . etc etc

https://einsteinathome.org/account/prefs/project

At the bottom of this page is the configuration for each gpu application that Einstein offers.

KLiK

Joined: 1 Apr 14

Posts: 67

Credit: 432378463

RAC: 1287576

James Bradshaw wrote: Mikey,

16 Sep 2024 4:00:41 UTC

Message 228285 in response to message 228273

(moderation:

)

James Bradshaw wrote:

Mikey, more specific to your comment, yes, I can leave the MX550 host on EIN only, so that is a good suggestion. I have had a problem balancing out usage of my hosts with Milky going CPU only and Ein doing GPU. Perhaps my problem involves that Ein GPU tasks also use at least some CPU time?? My overall performance took a big hit with Milky/CPU and Ein/GPU conflicts.

Note that running BRP7 usually takes quite less CPU power (0,2 CPU power in my example) , then ASGWS-O3.

& also, I run my ASGWS-O3 with only 0,9 on CPU, in order to keep other CPU tasks running (on WCG in my example).

Give it a try, let us know how it works!

non-profit org. Play4Life in Zagreb, Croatia, EU

Tom M

Joined: 2 Feb 06

Posts: 6453

Credit: 9579813175

RAC: 7428593

James Bradshaw wrote:Mikey,

16 Sep 2024 11:57:08 UTC

Message 228292 in response to message 228273

(moderation:

)

James Bradshaw wrote:

Mikey, more specific to your comment, yes, I can leave the MX550 host on EIN only, so that is a good suggestion. I have had a problem balancing out usage of my hosts with Milky going CPU only and Ein doing GPU. Perhaps my problem involves that Ein GPU tasks also use at least some CPU time?? My overall performance took a big hit with Milky/CPU and Ein/GPU conflicts.

Every GPU task uses some CPU resources.

I know that previous experience with running Intel iGpu's slows CPU processing speeds running at the same time.

I suspect this is a limitation caused by running the iGpu on CPU memory and other bandwidth limits.

AMD iGpu's may have the same architectural limitations.

Tom M

A Proud member of the O.F.A. (Old Farts Association). Be well, do good work, and keep in touch.® (Garrison Keillor) I want some more patience. RIGHT NOW!

Tom M

Joined: 2 Feb 06

Posts: 6453

Credit: 9579813175

RAC: 7428593

I have been poking around

16 Oct 2024 0:44:01 UTC

Message 229083

(moderation:

)

I have been poking around trying to find an explanation for how the parameter: ACTIVE THREAD effects things.

CUDA_MPS_ACTIVE_THREAD_PERCENTAGE

On Volta GPUs, this environment variable sets the portion of the available threads that can be used by the client contexts. The limit can be configured at different levels

.So what does this actually mean?

A Proud member of the O.F.A. (Old Farts Association). Be well, do good work, and keep in touch.® (Garrison Keillor) I want some more patience. RIGHT NOW!

Ian&Steve C.

Joined: 19 Jan 20

Posts: 3956

Credit: 46952622642

RAC: 64626746

In simple terms it allocates

16 Oct 2024 5:52:54 UTC

Message 229087

(moderation:

)

In simple terms it allocates the percentage of available threads or SMs from the GPU. Say you have 5000 threads (cores) on the GPU. And you set the active thread percentage to 50%, each process will only use up to 2500 threads.

_________________________________________________________________________

Tom M

Joined: 2 Feb 06

Posts: 6453

Credit: 9579813175

RAC: 7428593

Ian&Steve C. wrote: In

16 Oct 2024 11:17:49 UTC

Message 229098 in response to message 229087

(moderation:

)

Ian&Steve C. wrote:

In simple terms it allocates the percentage of available threads or SMs from the GPU. Say you have 5000 threads (cores) on the GPU. And you set the active thread percentage to 50%, each process will only use up to 2500 threads.

So 70% would provide (potentially) higher/faster processing than 40%? And 100% would basically put you back to straight round robin time slicing?

A Proud member of the O.F.A. (Old Farts Association). Be well, do good work, and keep in touch.® (Garrison Keillor) I want some more patience. RIGHT NOW!

Ian&Steve C.

Joined: 19 Jan 20

Posts: 3956

Credit: 46952622642

RAC: 64626746

You have to test to find

16 Oct 2024 23:17:45 UTC

Message 229105

(moderation:

)

You have to test to find out.

if you were only running 1x task then yes 70% would probably be faster. But with running 4-5x I think 40% is faster

_________________________________________________________________________

Tom M

Joined: 2 Feb 06

Posts: 6453

Credit: 9579813175

RAC: 7428593

A solid case can be made that

17 Oct 2024 16:39:06 UTC

Message 229123

(moderation:

)

A solid case can be made that a Titan V is more "efficient" (uses less electricity for a given level of production) than a EVGA rtx 3080 ti. But what happens to the rtx 3080 ti's production if you were to power limit it to say 200 watts?

My Titan V's are running ~150 watts or less. If I can get the equivalent production at 200 watts on my rtx 3080 ti running the same number of threads....

Experiment has started.

A Proud member of the O.F.A. (Old Farts Association). Be well, do good work, and keep in touch.® (Garrison Keillor) I want some more patience. RIGHT NOW!

All things Nvidia GPU

Forums › Cruncher's Corner

Comment viewing options

Forums › Cruncher's Corner