Unrecoverable error

Ensor
Ensor
Joined: 9 Feb 05
Posts: 49
Credit: 1450362
RAC: 0
Topic 188233

Hi, I've just encountered a problem with the "Einstein" screen saver.

My machine had been sat around idle (aside from running BOINC/Einstein) for about an hour when the screensaver froze. Processing of the current WU had reached around 50%.

I found the following in the BOINC log:

Einstein@Home - 2005-03-03 20:21:32 - Unrecoverable error for result H1_1494.4__1494.9_0.1_T00_Test02_4 (The system cannot find the path specified. (0x3) - exit code 3 (0x3))
Einstein@Home - 2005-03-03 20:21:32 - Computation for result H1_1494.4__1494.9_0.1_T00_Test02 finished

Any ideas what may have caused this problem, it's the first time I've encountered such a problem (aside from the time I accidentally terminated the "einstein" process in task manager....errm, oops).

Oh, the WU in question is #406723. Thanks.

TTFN - Pete.


Ensor
Ensor
Joined: 9 Feb 05
Posts: 49
Credit: 1450362
RAC: 0

Unrecoverable error

Just to add to the above, the actual result ID for the WU which caused the problem is #1554663.

TTFN - Pete.


Gilles Dorion
Gilles Dorion
Joined: 3 Mar 05
Posts: 8
Credit: 653367
RAC: 0

From the message

From the message pane:

Einstein@Home - 2005-03-04 20:10:29 - Pausing result H1_0694.4__0694.9_0.1_T03_Test02_3 (removed from memory)

SETI@home - 2005-03-04 20:10:29 - Restarting result 06ja05aa.22917.29426.373588.61_1 using setiathome version 4.09

Einstein@Home - 2005-03-04 20:10:40 - Unrecoverable error for result H1_0694.4__0694.9_0.1_T03_Test02_3 ( - exit code -1073741819 (0xc0000005))

Einstein@Home - 2005-03-04 20:10:40 - Deferring communication with project for 1 minutes and 0 seconds

Einstein@Home - 2005-03-04 20:10:40 - Computation for result H1_0694.4__0694.9_0.1_T03_Test02 finished

Using einstein v 4.79

Any ideas?

Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4332
Credit: 251635715
RAC: 36178

0xc0000005 is a general

0xc0000005 is a general memory access violation. Could be about anything. Is there something in stderr.txt saying on which address it occured?

BM

BM

Gilles Dorion
Gilles Dorion
Joined: 3 Mar 05
Posts: 8
Credit: 653367
RAC: 0

> 0xc0000005 is a general

Message 6922 in response to message 6921

> 0xc0000005 is a general memory access violation. Could be about anything. Is
> there something in stderr.txt saying on which address it occured?
>
> BM
>
>
Hi,
This is what the event viewer reported. So far it only occured once. Will keep an eye. Second line below gives the instruction at fault and mem address.

Application popup : einstein_4.79_windows_intelx86.exe - Erreur d'application : L'instruction à "0x69685e95" emploie l'adresse mémoire "0x00040004". La mémoire ne peut pas être "written".

Thanks

Ensor
Ensor
Joined: 9 Feb 05
Posts: 49
Credit: 1450362
RAC: 0

And it happens again. From

Message 6923 in response to message 6919

And it happens again.

From BOINC logs:

Einstein@Home - 2005-03-05 08:35:35 - Starting result H1_1494.4__1494.7_0.1_T02_Test02_2 using einstein version 4.79
Einstein@Home - 2005-03-05 14:53:06 - Unrecoverable error for result H1_1494.4__1494.7_0.1_T02_Test02_2 (The system cannot find the path specified. (0x3) - exit code 3 (0x3))
Einstein@Home - 2005-03-05 14:53:06 - Computation for result H1_1494.4__1494.7_0.1_T02_Test02 finished

Same error as on Thursday. Result is question this time is: #1636139.

Unfortunately I wasn't near my machine when the error occurred so I don't know how much of the WU my machine had processed when this happened. :-(

Any suggestions for tracking down this problem appreciated.

TTFN - Pete.


Ensor
Ensor
Joined: 9 Feb 05
Posts: 49
Credit: 1450362
RAC: 0

Bump. > And it happens

Message 6924 in response to message 6923


Bump.

> And it happens again.

And yet again, same error as a few weeks ago. From BOINC logs:

Einstein@Home - 2005-03-19 21:40:50 - Unrecoverable error for result H1_1494.4__1494.6_0.1_T08_Test02_1 (The system cannot find the path specified. (0x3) - exit code 3 (0x3))
Einstein@Home - 2005-03-19 21:40:50 - Computation for result H1_1494.4__1494.6_0.1_T08_Test02 finished
Einstein@Home - 2005-03-19 21:40:51 - Starting result H1_1494.4__1494.7_0.1_T08_Test02_1 using einstein version 4.79

The result is question this time is: #2092207.

Can ANYONE please give me a clue as to why this error might be occuring; is it a configuration problem at this end which I can fix or a BOINC/Einstein bug I just have to live with for now?

Thanks.

TTFN - Pete.


Ensor
Ensor
Joined: 9 Feb 05
Posts: 49
Credit: 1450362
RAC: 0

And again....result in

Message 6925 in response to message 6924


And again....result in question this time is: #2217511.

If the pattern from last time is repeated, I should suffer a screensaver lockup in the next couple of days.

TTFN - Pete.


Alex Walker
Alex Walker
Joined: 21 Mar 05
Posts: 1
Credit: 40977
RAC: 0

I've got the same damn

I've got the same damn problem. Just downloaded BOINC yesterday (upgrade from SETI Classic...). So far I have received 6 WU's, the first 4 of which had download errors. Now 2 today have had computational errors, producing logs similar to the following:

22/03/2005 11:31:00 AM|Einstein@Home|Unrecoverable error for result H1_0733.4__0733.9_0.1_T17_Test02_2 ( - exit code -1073741819 (0xc0000005))
22/03/2005 11:31:00 AM|Einstein@Home|Deferring communication with project for 59 seconds
22/03/2005 11:31:00 AM|Einstein@Home|Computation for result H1_0733.4__0733.9_0.1_T17_Test02 finished

What the hell am I supposed to do about it??

Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4332
Credit: 251635715
RAC: 36178

> I've got the same damn

Message 6927 in response to message 6926

> I've got the same damn problem.

Not precisely. You error code is different, it describes a general access violation. Looking at your Results I see that it happens at address 0x7C918FEA, which is not in the E@H code. Usually (but not always) the graphics driver is mapped there, looks like you have a problem with it. Probably the App crashes when trying to start the screensaver. Try to (re-)install the latest driver for your card.

BM

BM

Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4332
Credit: 251635715
RAC: 36178

Ensor, your problem

Ensor,

your problem (return code 3) indicates either a problem in our code or in the data that is intermediatly written. It might be a problem on your machine or a problem related to the data file you got. Resetting the project should help (for you) if it is related to the data, but not if there is a problem on your system (access rights, mybe memory problem). I'm keeping an eye on the code side.

BM

BM

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.