Unrecoverable error for result - exit code 99 (0x63)

Thomas Madigan
Thomas Madigan
Joined: 26 May 05
Posts: 6
Credit: 4985448
RAC: 0
Topic 192843

This error has been occurring for over 2 months. I've looked on this board and on the Internet and there doesn't seem to be a clear resolution to the problem. There was some discussion on the CPDN board concerning this problem, again with no clear resolution to the problem. Seti@home works fine, every WU for Einstein@home fails with this error. If this persists, I'll be forced to detach from the project as I don't relish wasting power and CPU cycles on something that will ultimately fail. The following is the recent log showing the failure of the latest E@H WU. Following some suggestions on the CPDN board, I've upgraded to BOINC 5.9.12 and have detached CPDN which had been running.

6/12/2007 7:05:01 PM||Starting BOINC client version 5.9.12 for windows_intelx86
6/12/2007 7:05:01 PM||log flags: task, file_xfer, sched_ops
6/12/2007 7:05:01 PM||Libraries: libcurl/7.16.1 OpenSSL/0.9.8e zlib/1.2.3
6/12/2007 7:05:01 PM||Data directory: D:\\Program Files\\BOINC
6/12/2007 7:05:01 PM||Processor: 1 GenuineIntel Intel(R) Pentium(R) III CPU family 1400MHz [x86 Family 6 Model 11 Stepping 1]
6/12/2007 7:05:01 PM||Processor features: fpu tsc sse mmx
6/12/2007 7:05:01 PM||Memory: 1023.47 MB physical, 2.40 GB virtual
6/12/2007 7:05:01 PM||Disk: 19.53 GB total, 9.24 GB free
6/12/2007 7:05:01 PM|Einstein@Home|URL: http://einstein.phys.uwm.edu/; Computer ID: 417189; location: home; project prefs: default
6/12/2007 7:05:01 PM|SETI@home|URL: http://setiathome.berkeley.edu/; Computer ID: 1445505; location: home; project prefs: default
6/12/2007 7:05:01 PM||General prefs: from Einstein@Home (last modified 2005-05-26 02:29:49)
6/12/2007 7:05:01 PM||Host location: home
6/12/2007 7:05:01 PM||General prefs: no separate prefs for home; using your defaults
6/12/2007 7:05:01 PM||Preferences limit memory usage when active to 511.73MB
6/12/2007 7:05:01 PM||Preferences limit memory usage when idle to 921.12MB
6/12/2007 7:05:01 PM||Preferences limit disk usage to 9.15GB
6/12/2007 7:05:01 PM|SETI@home|Restarting task 16fe05aa.28127.19362.111068.3.157_1 using setiathome_enhanced version 515
6/12/2007 8:05:04 PM|Einstein@Home|Restarting task h1_0451.60_S5R2__212_S5R2c_1 using einstein_S5R2 version 417
6/12/2007 8:05:21 PM|Einstein@Home|Deferring communication for 1 min 0 sec
6/12/2007 8:05:21 PM|Einstein@Home|Reason: Unrecoverable error for result h1_0451.60_S5R2__212_S5R2c_1 ( - exit code 99 (0x63))
6/12/2007 8:05:21 PM|Einstein@Home|Computation for task h1_0451.60_S5R2__212_S5R2c_1 finished
6/12/2007 8:05:21 PM|Einstein@Home|Output file h1_0451.60_S5R2__212_S5R2c_1_0 for task h1_0451.60_S5R2__212_S5R2c_1 absent
6/12/2007 8:05:21 PM|SETI@home|Restarting task 16fe05aa.28127.19362.111068.3.157_1 using setiathome_enhanced version 515
6/12/2007 10:29:23 PM|Einstein@Home|Sending scheduler request: To report completed tasks
6/12/2007 10:29:23 PM|Einstein@Home|Reporting 1 tasks
6/12/2007 10:29:28 PM|Einstein@Home|Scheduler RPC succeeded [server version 509]
6/12/2007 10:29:28 PM|Einstein@Home|Deferring communication for 1 min 0 sec
6/12/2007 10:29:28 PM|Einstein@Home|Reason: requested by project

Jord
Joined: 26 Jan 05
Posts: 2952
Credit: 5893653
RAC: 0

Unrecoverable error for result - exit code 99 (0x63)

Please try to reset Einstein. The exit code 99 is a problem with the datafile you are crunching (the big one that is sliced into tasks for you to run). Resetting the project should get you a new datafile.

Thomas Madigan
Thomas Madigan
Joined: 26 May 05
Posts: 6
Credit: 4985448
RAC: 0

Thank you! I'll report back

Message 68137 in response to message 68136

Thank you! I'll report back if I have additional problems.

Quote:
Please try to reset Einstein. The exit code 99 is a problem with the datafile you are crunching (the big one that is sliced into tasks for you to run). Resetting the project should get you a new datafile.


Thomas Madigan
Thomas Madigan
Joined: 26 May 05
Posts: 6
Credit: 4985448
RAC: 0

Resetting the project had no

Message 68138 in response to message 68136

Resetting the project had no effect. Same problem occurs. The following is the text of the log that includes the abend. I did note, that the problem did occur very soon after I started participating in CPDN. I have since disconnected from that project.

The following are the relevant messages surrounding the error:

6/21/2007 4:53:43 PM|SETI@home|Restarting task 10ap99ab.14928.16513.36084.3.217_0 using setiathome_enhanced version 515
6/21/2007 6:22:56 PM|Einstein@Home|Restarting task h1_0447.15_S5R2__204_S5R2c_3 using einstein_S5R2 version 417
6/21/2007 6:23:20 PM|Einstein@Home|Deferring communication for 1 min 0 sec
6/21/2007 6:23:20 PM|Einstein@Home|Reason: Unrecoverable error for result h1_0447.15_S5R2__204_S5R2c_3 ( - exit code 99 (0x63))
6/21/2007 6:23:20 PM|Einstein@Home|Computation for task h1_0447.15_S5R2__204_S5R2c_3 finished
6/21/2007 6:23:20 PM|Einstein@Home|Output file h1_0447.15_S5R2__204_S5R2c_3_0 for task h1_0447.15_S5R2__204_S5R2c_3 absent

The remaining are included for continuity.

6/21/2007 8:44:58 AM||Starting BOINC client version 5.8.16 for windows_intelx86
6/21/2007 8:44:58 AM||log flags: task, file_xfer, sched_ops
6/21/2007 8:44:58 AM||Libraries: libcurl/7.16.0 OpenSSL/0.9.8a zlib/1.2.3
6/21/2007 8:44:58 AM||Data directory: D:\\Program Files\\BOINC
6/21/2007 8:44:59 AM||Processor: 1 GenuineIntel Intel(R) Pentium(R) III CPU family 1400MHz [x86 Family 6 Model 11 Stepping 1] [fpu tsc sse mmx]
6/21/2007 8:44:59 AM||Memory: 1023.47 MB physical, 2.41 GB virtual
6/21/2007 8:44:59 AM||Disk: 19.53 GB total, 10.15 GB free
6/21/2007 8:44:59 AM|Einstein@Home|URL: http://einstein.phys.uwm.edu/; Computer ID: 965546; location: home; project prefs: default
6/21/2007 8:44:59 AM||General prefs: from Einstein@Home (last modified 2005-05-26 02:29:49)
6/21/2007 8:44:59 AM||Host location: home
6/21/2007 8:44:59 AM||General prefs: no separate prefs for home; using your defaults
6/21/2007 9:45:04 AM|Einstein@Home|Restarting task h1_0447.15_S5R2__204_S5R2c_3 using einstein_S5R2 version 417
6/21/2007 11:45:40 AM|Einstein@Home|Restarting task h1_0447.15_S5R2__204_S5R2c_3 using einstein_S5R2 version 417
6/21/2007 1:53:09 PM|Einstein@Home|Restarting task h1_0447.15_S5R2__204_S5R2c_3 using einstein_S5R2 version 417
6/21/2007 3:53:32 PM|Einstein@Home|Restarting task h1_0447.15_S5R2__204_S5R2c_3 using einstein_S5R2 version 417
6/21/2007 6:22:56 PM|Einstein@Home|Restarting task h1_0447.15_S5R2__204_S5R2c_3 using einstein_S5R2 version 417
6/21/2007 6:23:20 PM|Einstein@Home|Deferring communication for 1 min 0 sec
6/21/2007 6:23:20 PM|Einstein@Home|Reason: Unrecoverable error for result h1_0447.15_S5R2__204_S5R2c_3 ( - exit code 99 (0x63))
6/21/2007 6:23:20 PM|Einstein@Home|Computation for task h1_0447.15_S5R2__204_S5R2c_3 finished
6/21/2007 6:23:20 PM|Einstein@Home|Output file h1_0447.15_S5R2__204_S5R2c_3_0 for task h1_0447.15_S5R2__204_S5R2c_3 absent
6/21/2007 6:23:20 PM|SETI@home|Restarting task 10ap99ab.14928.16513.36084.3.217_0 using setiathome_enhanced version 515
6/21/2007 6:24:21 PM|Einstein@Home|Sending scheduler request: To fetch work
6/21/2007 6:24:21 PM|Einstein@Home|Requesting 3382 seconds of new work, and reporting 1 completed tasks
6/21/2007 6:24:26 PM|Einstein@Home|Scheduler RPC succeeded [server version 509]
6/21/2007 6:24:26 PM|Einstein@Home|Deferring communication for 1 min 0 sec
6/21/2007 6:24:26 PM|Einstein@Home|Reason: requested by project
6/21/2007 7:23:29 PM|Einstein@Home|Starting h1_0447.15_S5R2__126_S5R2c_1
6/21/2007 7:23:29 PM|Einstein@Home|Starting task h1_0447.15_S5R2__126_S5R2c_1 using einstein_S5R2 version 417
6/21/2007 9:24:47 PM|Einstein@Home|Restarting task h1_0447.15_S5R2__126_S5R2c_1 using einstein_S5R2 version 417
6/21/2007 10:42:09 PM|Einstein@Home|Restarting task h1_0447.15_S5R2__126_S5R2c_1 using einstein_S5R2 version 417
6/21/2007 10:51:35 PM|Einstein@Home|Resetting project
6/21/2007 10:51:36 PM|Einstein@Home|Sending scheduler request: To fetch work
6/21/2007 10:51:36 PM|Einstein@Home|Requesting 4320 seconds of new work
6/21/2007 10:51:41 PM|Einstein@Home|Scheduler RPC succeeded [server version 509]
6/21/2007 10:51:41 PM|Einstein@Home|Message from server: Resent lost result h1_0447.15_S5R2__126_S5R2c_1
6/21/2007 10:51:41 PM|Einstein@Home|Deferring communication for 1 min 0 sec
6/21/2007 10:51:41 PM|Einstein@Home|Reason: requested by project
6/21/2007 10:51:43 PM|Einstein@Home|[file_xfer] Started download of file einstein_S5R2_4.17_windows_intelx86.exe
6/21/2007 10:51:43 PM|Einstein@Home|[file_xfer] Started download of file einstein_S5R2_4.17_windows_intelx86.pdb
6/21/2007 10:51:49 PM|Einstein@Home|[file_xfer] Finished download of file einstein_S5R2_4.17_windows_intelx86.exe
6/21/2007 10:51:49 PM|Einstein@Home|[file_xfer] Throughput 751366 bytes/sec
6/21/2007 10:51:50 PM|Einstein@Home|[file_xfer] Finished download of file einstein_S5R2_4.17_windows_intelx86.pdb
6/21/2007 10:51:50 PM|Einstein@Home|[file_xfer] Throughput 740433 bytes/sec
6/21/2007 10:51:51 PM|Einstein@Home|Starting h1_0447.15_S5R2__126_S5R2c_1
6/21/2007 10:51:51 PM|Einstein@Home|Starting task h1_0447.15_S5R2__126_S5R2c_1 using einstein_S5R2 version 417

Quote:
Please try to reset Einstein. The exit code 99 is a problem with the datafile you are crunching (the big one that is sliced into tasks for you to run). Resetting the project should get you a new datafile.


Jord
Joined: 26 Jan 05
Posts: 2952
Credit: 5893653
RAC: 0

6/21/2007 10:51:41

6/21/2007 10:51:41 PM|Einstein@Home|Message from server: Resent lost result h1_0447.15_S5R2__126_S5R2c_1

You got the same datafile resent. So it's still corrupt and will still break down.
You could try a detach/reattach, or keep resetting until you get a new datafile.

Thomas Madigan
Thomas Madigan
Joined: 26 May 05
Posts: 6
Credit: 4985448
RAC: 0

I've done this repeatedly. I

Message 68140 in response to message 68139

I've done this repeatedly. I receive what appears to be a new datafile with the same result on each occasion. I have over 12 PCs running E@H without any problems. I only have a problem on this PC. Why would this PC be the *only* recipient of a corrupt data file and *WHY* are corrupt data files even available for download???

It's clear that there is more to this than you suggest.

A couple of suggestions: 1). Provide meaningful error messages with actionable solutions that addresses each one of them. Don't provide a cryptic error message that only means something to the programmer who wrote the code; 2). Provide users with a real email address where bug reports can be sent with a reasonable expectation of a reply from a real technician or programmer.

Without a practicable, easily implemented solution to this forthcoming in the very near future, I'm afraid I'll be detaching my PC from this project, regrettable as that is. I simply can't afford to be either processing corrupt data files and / or work units or have the PC run hours on end only to have a failed result in the end.

Quote:

6/21/2007 10:51:41 PM|Einstein@Home|Message from server: Resent lost result h1_0447.15_S5R2__126_S5R2c_1

You got the same datafile resent. So it's still corrupt and will still break down.
You could try a detach/reattach, or keep resetting until you get a new datafile.


Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.