New Improved Gravational Wave App - Discussion

mikey
mikey
Joined: 22 Jan 05
Posts: 12681
Credit: 1839085411
RAC: 3872

pututu wrote: Fixed the core

pututu wrote:

Fixed the core clock to 4.2GHz paired with Tesla P100 for this host. No other cpu project is running.

With 3 tasks per gpu, average run time is about 630 secs when staggering the 3 tasks. Run to run variation is small (look for those between 1880 - 1890 secs range). About 1.37M PPD. GPU board power fluctuates from 80W to 130W. 

Do you mean a gpu like this: NVIDIA NVIDIA P104-100 (8117MB) because that one, mine, is running them in 1700 seconds for each task running just 1 at a time. My core clock is nowhere near where yours is!!

pututu
pututu
Joined: 6 Apr 17
Posts: 63
Credit: 653417392
RAC: 8

mikey wrote:pututu

mikey wrote:

pututu wrote:

Fixed the core clock to 4.2GHz paired with Tesla P100 for this host. No other cpu project is running.

With 3 tasks per gpu, average run time is about 630 secs when staggering the 3 tasks. Run to run variation is small (look for those between 1880 - 1890 secs range). About 1.37M PPD. GPU board power fluctuates from 80W to 130W. 

Do you mean a gpu like this: NVIDIA NVIDIA P104-100 (8117MB) because that one, mine, is running them in 1700 seconds for each task running just 1 at a time. My core clock is nowhere near where yours is!!

 

Mine is this.

It has 4096 bit memory bus, thus higher memory bandwidth (that helps a lot to speed up read/write access between the gpu chip and vram). Not certain if the higher FP64 matters much for this OA3S task but perhaps is does (just an observation) since I read here that the 4090 even with higher memory bandwidth can only do around 600 secs per task, similar to P100 which is already 3 generations behind. Maybe someone else here can shed a light if FP64 compute is used in O3AS.

Ian&Steve C.
Ian&Steve C.
Joined: 19 Jan 20
Posts: 3945
Credit: 46765032642
RAC: 64067719

No. A P100 is the

No. A P100 is the Nvidia Tesla P100 data center card. Nvidia names all their generational data center flagships like this, P100/V100/A100/etc. A P100 is like a 1080Ti with better FP64 and HBM memory. 
 

your P104-100 is a mining GPU, which is basically just a 1070 with GDDR5X memory. 

_________________________________________________________________________

mikey
mikey
Joined: 22 Jan 05
Posts: 12681
Credit: 1839085411
RAC: 3872

pututu wrote: mikey

pututu wrote:

mikey wrote:

pututu wrote:

Fixed the core clock to 4.2GHz paired with Tesla P100 for this host. No other cpu project is running.

With 3 tasks per gpu, average run time is about 630 secs when staggering the 3 tasks. Run to run variation is small (look for those between 1880 - 1890 secs range). About 1.37M PPD. GPU board power fluctuates from 80W to 130W. 

Do you mean a gpu like this: NVIDIA NVIDIA P104-100 (8117MB) because that one, mine, is running them in 1700 seconds for each task running just 1 at a time. My core clock is nowhere near where yours is!!

 

Mine is this.

It has 4096 bit memory bus, thus higher memory bandwidth (that helps a lot to speed up read/write access between the gpu chip and vram). Not certain if the higher FP64 matters much for this OA3S task but perhaps is does (just an observation) since I read here that the 4090 even with higher memory bandwidth can only do around 600 secs per task, similar to P100 which is already 3 generations behind. Maybe someone else here can shed a light if FP64 compute is used in O3AS. 

Ahhh that's a very nice card!!

mikey
mikey
Joined: 22 Jan 05
Posts: 12681
Credit: 1839085411
RAC: 3872

Ian&Steve C. wrote: No. A

Ian&Steve C. wrote:

No. A P100 is the Nvidia Tesla P100 data center card. Nvidia names all their generational data center flagships like this, P100/V100/A100/etc. A P100 is like a 1080Ti with better FP64 and HBM memory. 
 

your P104-100 is a mining GPU, which is basically just a 1070 with GDDR5X memory.  

Ahhh I hate when they do that stuff with the names, it's almost like they want to confuse people.

pututu
pututu
Joined: 6 Apr 17
Posts: 63
Credit: 653417392
RAC: 8

mikey wrote:pututu

mikey wrote:

pututu wrote:

mikey wrote:

pututu wrote:

Fixed the core clock to 4.2GHz paired with Tesla P100 for this host. No other cpu project is running.

With 3 tasks per gpu, average run time is about 630 secs when staggering the 3 tasks. Run to run variation is small (look for those between 1880 - 1890 secs range). About 1.37M PPD. GPU board power fluctuates from 80W to 130W. 

Do you mean a gpu like this: NVIDIA NVIDIA P104-100 (8117MB) because that one, mine, is running them in 1700 seconds for each task running just 1 at a time. My core clock is nowhere near where yours is!!

 

Mine is this.

It has 4096 bit memory bus, thus higher memory bandwidth (that helps a lot to speed up read/write access between the gpu chip and vram). Not certain if the higher FP64 matters much for this OA3S task but perhaps is does (just an observation) since I read here that the 4090 even with higher memory bandwidth can only do around 600 secs per task, similar to P100 which is already 3 generations behind. Maybe someone else here can shed a light if FP64 compute is used in O3AS. 

Ahhh that's a very nice card!!

 

The P100 has dropped a lot in price over the past year or so. You can get one in ebay for USD160ish. This card is only good for project requiring high memory bandwidth and FP64, so has limited use in BOINC. In the past, I used it in Milkyway and maybe will use it in gpugrid  (quantum chemistry) when tasks are available. This version has no active cooling. Certainly good for O3AS but the high credit will eventually revert within two months or so.

DF1DX
DF1DX
Joined: 14 Aug 10
Posts: 105
Credit: 3852196854
RAC: 4881545

After a short test yesterday

After a short test yesterday with about 270 GW-WUs, I can't report a single crash on my host. Linux, Nvidia 4090, driver 535.129.03. No problem here.

Two WUs at the same time results in more than double the runtime, a delayed start helps, but requires constant manual control.

i am also waiting for the Nvidia version.

Minor nitpick: I still have two orphaned wus (741768987 and 741769147) from the first test last july in my tasklist.

mikey
mikey
Joined: 22 Jan 05
Posts: 12681
Credit: 1839085411
RAC: 3872

pututu wrote: mikey

pututu wrote:

mikey wrote:

pututu wrote:

mikey wrote:

pututu wrote:

Fixed the core clock to 4.2GHz paired with Tesla P100 for this host. No other cpu project is running.

With 3 tasks per gpu, average run time is about 630 secs when staggering the 3 tasks. Run to run variation is small (look for those between 1880 - 1890 secs range). About 1.37M PPD. GPU board power fluctuates from 80W to 130W. 

Do you mean a gpu like this: NVIDIA NVIDIA P104-100 (8117MB) because that one, mine, is running them in 1700 seconds for each task running just 1 at a time. My core clock is nowhere near where yours is!!

 

Mine is this.

It has 4096 bit memory bus, thus higher memory bandwidth (that helps a lot to speed up read/write access between the gpu chip and vram). Not certain if the higher FP64 matters much for this OA3S task but perhaps is does (just an observation) since I read here that the 4090 even with higher memory bandwidth can only do around 600 secs per task, similar to P100 which is already 3 generations behind. Maybe someone else here can shed a light if FP64 compute is used in O3AS. 

Ahhh that's a very nice card!!

 

The P100 has dropped a lot in price over the past year or so. You can get one in ebay for USD160ish. This card is only good for project requiring high memory bandwidth and FP64, so has limited use in BOINC. In the past, I used it in Milkyway and maybe will use it in gpugrid  (quantum chemistry) when tasks are available. This version has no active cooling. Certainly good for O3AS but the high credit will eventually revert within two months or so.

Thanks I will look but the passive cooling is not a good thing for me.

Tom M
Tom M
Joined: 2 Feb 06
Posts: 6439
Credit: 9568713795
RAC: 8576232

mikey wrote:Do you mean a

Post deleted.

A Proud member of the O.F.A.  (Old Farts Association).  Be well, do good work, and keep in touch.® (Garrison Keillor)  I want some more patience. RIGHT NOW!

Tom M
Tom M
Joined: 2 Feb 06
Posts: 6439
Credit: 9568713795
RAC: 8576232

mikey wrote:e]pututu

mikey wrote:

pututu wrote:

mikey wrote:

pututu wrote:

mikey wrote:

pututu wrote:

Fixed the core clock to 4.2GHz paired with Tesla P100 for this host. No other cpu project is running.

With 3 tasks per gpu, average run time is about 630 secs when staggering the 3 tasks. Run to run variation is small (look for those between 1880 - 1890 secs range). About 1.37M PPD. GPU board power fluctuates from 80W to 130W. 

Do you mean a gpu like this: NVIDIA NVIDIA P104-100 (8117MB) because that one, mine, is running them in 1700 seconds for each task running just 1 at a time. My core clock is nowhere near where yours is!!

 

Mine is this.

It has 4096 bit memory bus, thus higher memory bandwidth (that helps a lot to speed up read/write access between the gpu chip and vram). Not certain if the higher FP64 matters much for this OA3S task but perhaps is does (just an observation) since I read here that the 4090 even with higher memory bandwidth can only do around 600 secs per task, similar to P100 which is already 3 generations behind. Maybe someone else here can shed a light if FP64 compute is used in O3AS. 

Ahhh that's a very nice card!!

 

The P100 has dropped a lot in price over the past year or so. You can get one in ebay for USD160ish. This card is only good for project requiring high memory bandwidth and FP64, so has limited use in BOINC. In the past, I used it in Milkyway and maybe will use it in gpugrid  (quantum chemistry) when tasks are available. This version has no active cooling. Certainly good for O3AS but the high credit will eventually revert within two months or so.

Thanks I will look but the passive cooling is not a good thing for me.

I ran across some after-market GPU cooling for passively cooled cards.  Maybe one fits the P100?

A Proud member of the O.F.A.  (Old Farts Association).  Be well, do good work, and keep in touch.® (Garrison Keillor)  I want some more patience. RIGHT NOW!

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.