All-Sky Gravitational Wave Search on O3 data (O3ASHF1)

maeax

Joined: 6 Aug 12

Posts: 23

Credit: 1536970788

RAC: 1667629

Bernd Machenschalk

11 Dec 2024 3:22:22 UTC

Message 230837 in response to message 230821

(moderation:

)

Bernd Machenschalk wrote:

Indeed the new tasks should earn you 20k. Fixed, also for previously generated workunits.

br3achd

Joined: 14 Jun 24

Posts: 8

Credit: 143261562

RAC: 742921

maeax wrote:Bernd

11 Dec 2024 3:43:03 UTC

Message 230838 in response to message 230837

(moderation:

)

maeax wrote:

Bernd Machenschalk wrote:

Indeed the new tasks should earn you 20k. Fixed, also for previously generated workunits.

This could easily mean previously generated but not yet completed, submitted, and/or validated workunits, especially since if you look at your task list, they still have the 1-4k credit assigned to them and not 20k.

Ian&Steve C.

Joined: 19 Jan 20

Posts: 4116

Credit: 49145237424

RAC: 32184959

yes I'm starting to see the

11 Dec 2024 3:38:47 UTC

Message 230839

(moderation:

)

yes I'm starting to see the 20k tasks now. many thanks :)

_________________________________________________________________________

Bernd Machenschalk

Moderator

Administrator

Joined: 15 Oct 04

Posts: 4346

Credit: 252831396

RAC: 40936

In short, all tasks granted

11 Dec 2024 7:40:55 UTC

Message 230844

(moderation:

)

In short, all tasks granted credit from now on will get you 20k, regardless of how old the workunits are. Credit of tasks that were already granted credit will not be changed, that would be too much effort. But tasks pending or newly replicated from older workunits will get the new credit.

GWGeorge007

Joined: 8 Jan 18

Posts: 3172

Credit: 5124146723

RAC: 3326921

Bernd Machenschalk wrote: In

11 Dec 2024 20:37:00 UTC

Message 230871 in response to message 230844

(moderation:

)

Bernd Machenschalk wrote:

In short, all tasks granted credit from now on will get you 20k, regardless of how old the workunits are. Credit of tasks that were already granted credit will not be changed, that would be too much effort. But tasks pending or newly replicated from older workunits will get the new credit.

Thank you for the clarification!

George

Proud member of the Old Farts Association

Beyond

Joined: 28 Feb 05

Posts: 124

Credit: 2660550974

RAC: 4636648

Thanks Bernd and team, the

12 Dec 2024 2:50:33 UTC

Message 230881 in response to message 230844

(moderation:

)

Thanks Bernd and team, the new O3AS WUs seem to be running nicely.

David Crick

Joined: 8 Aug 24

Posts: 10

Credit: 2720053

RAC: 175

A combination of me upgrading

17 Dec 2024 1:03:03 UTC

Message 231023

(moderation:

)

A combination of me upgrading my Fedora installation, the latest 565.77 release of the Nvidia Linux drivers being available for it [https://rpmfusion.org/Howto/NVIDIA], plus the new low frequency O3ASHF1 WUs having started all came together to make me test E@H under these new circumstances.

Confirming what others have previously reported and to add a further data point, the newer WUs take approx. 2.34x longer on my 2080 Super. The doubling of credit from 10,000 to 20,000 therefore almost balances that out.

KLiK

Joined: 1 Apr 14

Posts: 80

Credit: 496969152

RAC: 931645

David Crick wrote: A

23 Dec 2024 1:12:15 UTC

Message 231273 in response to message 231023

(moderation:

)

David Crick wrote:

A combination of me upgrading my Fedora installation, the latest 565.77 release of the Nvidia Linux drivers being available for it [https://rpmfusion.org/Howto/NVIDIA], plus the new low frequency O3ASHF1 WUs having started all came together to make me test E@H under these new circumstances.

Confirming what others have previously reported and to add a further data point, the newer WUs take approx. 2.34x longer on my 2080 Super. The doubling of credit from 10,000 to 20,000 therefore almost balances that out.

This is exactly what I also see on my cards...tasks quite a bit longer, some 3x as much!

Also, Tthrottle has downgraded my GPU contribution...as GPU is heated more with new tasks, from ~2/3 to ~1/4 (or ~25%)...so watch out for the cooling also!

non-profit org. Play4Life in Zagreb, Croatia, EU

Mad_Max

Joined: 2 Jan 10

Posts: 165

Credit: 2251304332

RAC: 597498

The problem with the work of

29 Dec 2024 0:32:51 UTC

Message 231439

(moderation:

)

Look like the problem with the work of the old (OpenCL 1.2, without OpenCL 2.0 support) GPUs in the O3AS project have not been solved?

Some users (including me) wrote about it a year and two years ago about all previous GW apps - the E@H GW applications launches, it seems to work correctly, but it is EXTREMELY slow. And by extremely, I mean about hundreds or even few thousand times slower than can be expected from a comparison of technical characteristics of hardware.

For example, AMD RX570 (GCN 4.0 micro-architecture, supports OpenCL 2.0) performs one task in about 1.5 hours, provided sufficient CPU support to avoid GPU starvation (in the statistics of my computers, the execution time is usually about 2 times longer, but this is because they process 2 GW tasks in parallel or sometimes even 1 GW task + 2 BRP7/MeerKAT tasks).

Whereas, for example, on AMD HD 7850 / HD 7870 / HD 7950 / R7 265 / R9 270 / R9 280 etc (Pitcairn and Tahiti GPU chips, GCN 1.0 micro-architecture, supports only OpenCL 1.2), progress is extremely slow - less than 1% per full day of calculation, i.e. few thousands of times slower!
The application did not freeze/hung - the logs show the progress of calculations, saving checkpoints, hardware monitoring shows low (20-40% plus reduced operating frequencies), but non-zero GPU usage + maximum load of 1 CPU core/thread.

Example of log(stderr.txt) after >=1 day of calc:

https://pastebin.com/6x7dm8Cg

And it keeps adding few dots every hours to the log and write checkpoints on regular intervals. Whereas the full task, as I understand it = 9000 such dots.

So everything looks like normal, correct Work. Except that it's moving abnormally slowly.

Although at the hardware level, in terms of theoretical FP performance, these GPUs differ by less than 2 times:
AMD RX570 = 2048 Shader Processors running @ 1.3 GHz = 5.3 TFLOPS of Theoretical Peak FP32 performance
HD 7870 = 1280 SP@ 1.1 GHz = 2.8 TFLOPS theoretical peak FP32 performance

5.3/2.8 ~= 1.89 times
The hardware architecture is also very close - these are different iterations/sub-versions of the same AMD GCN architecture. And VRAM almost the same (GDDR5 on 256bit bus, only RAM clock speeds differs by a few dozen %)
The only significant difference that I am aware of is that the newer GCN versions supports OpenCL 2.0, whereas its first iteration supported only OpenCL 1.2.

The OS and driver versions are also the exactly the same used to compare the RX 570 and HD 7870.

I also tried using this and other similar outdated but still well-functioning(and in fact, even faster than some of the latest generation iGPUs.) GPUs to crunch for the BRP7/MeerKAT project. It works without visible problems and is quite fast for an old GPUs. BUT gives a huge number of validation errors after sending results back to server. Moreover, it is also inconsistent and unstable on this part - in one or several days it can be only 1-3% of tasks with validation errors, and in another it suddenly exceeds 50%, and then again several % for few days, then jumps to 30%, and so on. Despite the fact that nothing changed on my side at all - even the computer did not restart in between.

P.S.
All same GPUs worked without any issues and with near 100% validation rates on few other GPU DC projects when there was work for them: FGRPB1G here for E@H and OPNG1 for WCG (Open Pandemics - molecular dynamics for Medical Research). But all of them have no more tasks to process for at least half a year or more.

Ian&Steve C.

Joined: 19 Jan 20

Posts: 4116

Credit: 49145237424

RAC: 32184959

historically those old GCN

29 Dec 2024 3:22:52 UTC

Message 231441 in response to message 231439

(moderation:

)

historically those old GCN 1.0 GPUs have not been able to process the work from the GW application. probably some required feature used in the app that's not supported by the GPU.

the app is actually hung. it's not just processing slow. it's not processing at all. BOINC will count at minimal rate when it gets no feedback about progress. in increments of 0.001%. but that doesnt mean it's doing anything, just default BOINC behavior.

each line showing ".c" seems to indicate as much. if it was doing anything, you would see lines with longer outputs like "........c" ".......................c" etc. You can see this in your RX570 results

stick to the BRP7 app for your old GPUs since they seem to work there.

_________________________________________________________________________

All-Sky Gravitational Wave Search on O3 data (O3ASHF1)

Forums › Technical News

Comment viewing options

Forums › Technical News