All outcomes saying "client error"

Shaun Lippy
Shaun Lippy
Joined: 24 Mar 08
Posts: 3
Credit: 3314
RAC: 0
Topic 193603

Everything was working for a couple of days, with the outcomes all saying "success". Then, a couple of days ago, I started getting client errors on every single computation. Here is an example:

http://einsteinathome.org/task/94515676

They all have the same error:

*** glibc detected *** ../../projects/einstein.phys.uwm.edu/einstein_S5R3_4.38_i686-pc-linux-gnu_1: munmap_chunk(): invalid pointer: 0x08c11088 ***

Any ideas before I leave the project? Thanks.

Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4349
Credit: 253227491
RAC: 39405

All outcomes saying "client error"

Just for curiosity: What distro is this?

Please try to put a file named "CPU_TYPE_0" (without quotation marks) into your BOINC directory and see if the same thing happens with einstein_S5R3_4.38_i686-pc-linux-gnu_0.

BM

BM

Shaun Lippy
Shaun Lippy
Joined: 24 Mar 08
Posts: 3
Credit: 3314
RAC: 0

RE: Just for curiosity:

Message 80525 in response to message 80524

Quote:

Just for curiosity: What distro is this?

Please try to put a file named "CPU_TYPE_0" (without quotation marks) into your BOINC directory and see if the same thing happens with einstein_S5R3_4.38_i686-pc-linux-gnu_0.

BM

The distro is Red Hat Advanced Server 5. I will try your suggestion when I get a chance.

Shaun Lippy
Shaun Lippy
Joined: 24 Mar 08
Posts: 3
Credit: 3314
RAC: 0

RE: RE: Just for

Message 80526 in response to message 80525

Quote:
Quote:

Just for curiosity: What distro is this?

Please try to put a file named "CPU_TYPE_0" (without quotation marks) into your BOINC directory and see if the same thing happens with einstein_S5R3_4.38_i686-pc-linux-gnu_0.

BM

The distro is Red Hat Advanced Server 5. I will try your suggestion when I get a chance.

No dice. Thanks for trying, though.

Gary Roberts
Gary Roberts
Moderator
Joined: 9 Feb 05
Posts: 5887
Credit: 119348472720
RAC: 25802364

RE: No dice. Thanks for

Message 80527 in response to message 80526

Quote:
No dice. Thanks for trying, though.

Just out of curiosity, I had a look through your tasks list for that machine - actually the page showing the last successful results and the first of the errors. It was interesting to note that the first two tasks (dual core machine) showing errors also showed that crunching had been stopped approximately half way through in each case and then restarted after what appears to be quite a time lapse. For instance, the first error result was started at 04:43 UTC on March 28 and was restarted at 20:00 UTC at skypoint position 704/1202, ie. just over half way through. At that point about 4 hours of actual crunch time would have been accumulated so crunching had been stopped for about 11 hours or so. The other task seemingly shows a similar position.

I'm guessing that the machine may have been down for maintenance/upgrade of some sort and that perhaps new or updated software may have been installed? Is it possible that something happened at this time that is now having an unintended affect on BOINC? It doesn't seem to be anything to do with the science app as the crunching seems to be completing quite successfully in most cases.

Cheers,
Gary.

rroonnaalldd
rroonnaalldd
Joined: 12 Dec 05
Posts: 116
Credit: 537221
RAC: 0

RE: RE: No dice. Thanks

Message 80528 in response to message 80527

Quote:
Quote:
No dice. Thanks for trying, though.

Just out of curiosity, I had a look through your tasks list for that machine - actually the page showing the last successful results and the first of the errors. It was interesting to note that the first two tasks (dual core machine) showing errors also showed that crunching had been stopped approximately half way through in each case and then restarted after what appears to be quite a time lapse. For instance, the first error result was started at 04:43 UTC on March 28 and was restarted at 20:00 UTC at skypoint position 704/1202, ie. just over half way through. At that point about 4 hours of actual crunch time would have been accumulated so crunching had been stopped for about 11 hours or so. The other task seemingly shows a similar position.

I'm guessing that the machine may have been down for maintenance/upgrade of some sort and that perhaps new or updated software may have been installed? Is it possible that something happened at this time that is now having an unintended affect on BOINC? It doesn't seem to be anything to do with the science app as the crunching seems to be completing quite successfully in most cases.

Therefor i had made the decision to not change anything anymore on my cruching pc. I had make experiences with team-networking for load-balancing and fault-redundance or had make the OS-updates automatic in the background while boinc was also running and ended up with a screwed host and unexpected error 0x000005 on read request...
Murphy's Law is true and the often used words: "Never change a running system!" are gone into my blood.

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.