stuck in loops

Henry Tsai
Henry Tsai
Joined: 28 Sep 10
Posts: 3
Credit: 919709
RAC: 0
Topic 195376

Hi,
While it is interesting to participate the Einstein@home project, I came across a couple tasks that were stuck in loops. I could see them (CUDA) using up hours and hours of GPU calculation, but progress remained at the same point. Hope you guys can resolve or prevent this type of problem soon. Thanks.
Henry Tsai, Ph.D.

mikey
mikey
Joined: 22 Jan 05
Posts: 12857
Credit: 1884348140
RAC: 281210

stuck in loops

Quote:
Hi,
While it is interesting to participate the Einstein@home project, I came across a couple tasks that were stuck in loops. I could see them (CUDA) using up hours and hours of GPU calculation, but progress remained at the same point. Hope you guys can resolve or prevent this type of problem soon. Thanks.
Henry Tsai, Ph.D.

If you see one of these units in progress, just exit Boinc and then restart it, it should reset the numbers to actual time used and pick up crunching from where it stopped. Apparently there are no clear cut answers as to why this happens but one idea is that it could be related to the Boinc benchmarks being run every week. The only problem with that is that I just had one crash, on another project, and the benchmarks were not run. Apparently Boinc does not stop the gpu when the benchmarks are being run, causing issues.

Richard Haselgrove
Richard Haselgrove
Joined: 10 Dec 05
Posts: 2143
Credit: 3001491932
RAC: 698360

RE: Apparently Boinc does

Message 99988 in response to message 99987

Quote:
Apparently Boinc does not stop the gpu when the benchmarks are being run, causing issues.


Where did you read that? I haven't seen it reported anywhere, and it doesn't match my observations: I always see NVidia tasks being suspended when benchmarks run, although I can't check that the same is true for ATI (no suitable card here).

One recent change, which David actually checked in at my request, is that GPU tasks are no longer unloaded from the graphics card's memory during benchmarking (it's only the CPU which is benchmarked, after all). In theory, that should mean that the GPU task can restart more quickly afterwards - but maybe it causes problems with the hybrid Einstein app. If that's the case, it should be confirmed and reported on boinc_alpha.

Julie Kazzimir
Julie Kazzimir
Joined: 15 Sep 10
Posts: 1
Credit: 9695
RAC: 0

my computer is also stuck in

my computer is also stuck in a loop, the project is not advancing. turning my computer off and on does not help

mikey
mikey
Joined: 22 Jan 05
Posts: 12857
Credit: 1884348140
RAC: 281210

RE: RE: Apparently Boinc

Message 99990 in response to message 99988

Quote:
Quote:
Apparently Boinc does not stop the gpu when the benchmarks are being run, causing issues.

Where did you read that? I haven't seen it reported anywhere, and it doesn't match my observations: I always see NVidia tasks being suspended when benchmarks run, although I can't check that the same is true for ATI (no suitable card here).

Read this http://boinc.thesonntags.com/collatz/forum_thread.php?id=603

Slicker is the Project Admin over there, scroll down a few messages to number Message 9881 - Posted 4 Oct 2010 13:37:13 UTC.

Quote:
One recent change, which David actually checked in at my request, is that GPU tasks are no longer unloaded from the graphics card's memory during benchmarking (it's only the CPU which is benchmarked, after all). In theory, that should mean that the GPU task can restart more quickly afterwards - but maybe it causes problems with the hybrid Einstein app. If that's the case, it should be confirmed and reported on boinc_alpha.

Is that in version 2.05? Also the Boinc Alpha mailing list is not always a nice place for us 'boinc users' especially if we seem to be whining! Being ignored on the list seems to be a fairly common occurrence. Of course impressions are what is important to us non programmers, not always deeds, so if one gets the impression they are being ignored, even if they are not, it tends to stick. The mailing list has never been what is called 'user friendly'. And NO I am not blaming anyone, it is what it is.

mikey
mikey
Joined: 22 Jan 05
Posts: 12857
Credit: 1884348140
RAC: 281210

RE: my computer is also

Message 99991 in response to message 99989

Quote:
my computer is also stuck in a loop, the project is not advancing. turning my computer off and on does not help

Just exit Boinc and then after a minute or so restart it and it should be okay. If not just abort the task and move on, the task will be sent to someone else. Not all tasks can be crunched, some are just bad.

Richard Haselgrove
Richard Haselgrove
Joined: 10 Dec 05
Posts: 2143
Credit: 3001491932
RAC: 698360

RE: RE: RE: Apparently

Message 99992 in response to message 99990

Quote:
Quote:
Quote:
Apparently Boinc does not stop the gpu when the benchmarks are being run, causing issues.

Where did you read that? I haven't seen it reported anywhere, and it doesn't match my observations: I always see NVidia tasks being suspended when benchmarks run, although I can't check that the same is true for ATI (no suitable card here).

Read this http://boinc.thesonntags.com/collatz/forum_thread.php?id=603

Slicker is the Project Admin over there, scroll down a few messages to number Message 9881 - Posted 4 Oct 2010 13:37:13 UTC.

Quote:
One recent change, which David actually checked in at my request, is that GPU tasks are no longer unloaded from the graphics card's memory during benchmarking (it's only the CPU which is benchmarked, after all). In theory, that should mean that the GPU task can restart more quickly afterwards - but maybe it causes problems with the hybrid Einstein app. If that's the case, it should be confirmed and reported on boinc_alpha.

Is that in version 2.05?


I was thinking of BOINC changes. This was, to be specific, changeset [trac]changeset:20765[/trac] - actually, at the beginning of March, so not that recent. That's been in every BOINC client since v6.10.43, so I think we'd have heard of problems by now if they were widespread.

v2.05 is a Collatz-specific application version - it sounds as if they have a little local problem there, different from any generic BOINC problem. Doesn't stop Einstein having a local version of the same problem, of course - and sometimes these "two projects at once" reports do reveal underlying BOINC problems.

Quote:
Also the Boinc Alpha mailing list is not always a nice place for us 'boinc users' especially if we seem to be whining! Being ignored on the list seems to be a fairly common occurrence. Of course impressions are what is important to us non programmers, not always deeds, so if one gets the impression they are being ignored, even if they are not, it tends to stick. The mailing list has never been what is called 'user friendly'. And NO I am not blaming anyone, it is what it is.


I hear what you say, and agree with the sentiments. But for the time being, boinc_alpha is what we've got, and if we want to improve BOINC, we can only work with it.

mikey
mikey
Joined: 22 Jan 05
Posts: 12857
Credit: 1884348140
RAC: 281210

RE: RE: RE: RE: Appar

Message 99993 in response to message 99992

Quote:
Quote:
Quote:
Quote:
Apparently Boinc does not stop the gpu when the benchmarks are being run, causing issues.

Where did you read that? I haven't seen it reported anywhere, and it doesn't match my observations: I always see NVidia tasks being suspended when benchmarks run, although I can't check that the same is true for ATI (no suitable card here).

Read this http://boinc.thesonntags.com/collatz/forum_thread.php?id=603

Slicker is the Project Admin over there, scroll down a few messages to number Message 9881 - Posted 4 Oct 2010 13:37:13 UTC.

Quote:
One recent change, which David actually checked in at my request, is that GPU tasks are no longer unloaded from the graphics card's memory during benchmarking (it's only the CPU which is benchmarked, after all). In theory, that should mean that the GPU task can restart more quickly afterwards - but maybe it causes problems with the hybrid Einstein app. If that's the case, it should be confirmed and reported on boinc_alpha.

Is that in version 2.05?

I was thinking of BOINC changes. This was, to be specific, changeset [trac]changeset:20765[/trac] - actually, at the beginning of March, so not that recent. That's been in every BOINC client since v6.10.43, so I think we'd have heard of problems by now if they were widespread.

v2.05 is a Collatz-specific application version - it sounds as if they have a little local problem there, different from any generic BOINC problem. Doesn't stop Einstein having a local version of the same problem, of course - and sometimes these "two projects at once" reports do reveal underlying BOINC problems.

I did not know 2.05 was a Collatz thing, sorry about that. But the gpu problems are occurring across several projects, not just Collatz. I have also personally experienced it on DNETC too. The current theory is that when the benchmarks are started by Boinc it is causing a freezing of the gpu and the workunit is not restarting itself afterwards. Although my personal experiences do not seem to reflect this, I attached to DNETC with an Nvidia card and the same evening it froze! The next day when I again checked the machine, I have a bunch, I noticed that the unit was over 20 hours in and that is VERY long for those units, so I stopped and restarted Boinc and the unit went back to only 15 minutes, roughly, of crunching time and then errored out after a few more minutes.

Quote:
Quote:
Also the Boinc Alpha mailing list is not always a nice place for us 'boinc users' especially if we seem to be whining! Being ignored on the list seems to be a fairly common occurrence. Of course impressions are what is important to us non programmers, not always deeds, so if one gets the impression they are being ignored, even if they are not, it tends to stick. The mailing list has never been what is called 'user friendly'. And NO I am not blaming anyone, it is what it is.

I hear what you say, and agree with the sentiments. But for the time being, boinc_alpha is what we've got, and if we want to improve BOINC, we can only work with it.

I know Dr A and crew are what they are, I am not slamming them anymore, I am just relating my own personal experiences. I encourage others to make their own, who knows they could be much different. And to be honest Rom and others can be OUTSTANDING!!

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.