I hope it soon gets followed by another App that has some bug fixed I recently found in the BOINC library.
BM
OK, I just want to make sure I understand what to do. I should delete app_info.xml from the einstein.phys.uwm.edu folder, and then I should restart BOINC manager. Right?
OK, I just want to make sure I understand what to do. I should delete app_info.xml from the einstein.phys.uwm.edu folder, and then I should restart BOINC manager. Right?
This is right in principle. However with the current project setting the app will not be accepted without a signature, i.e. the client will terminate the tasks in progress with a client error and download the "official" 4.15 App again (not knowing that it is identical).
The convenient way without wasting to much CPU time is to do this when a new task has just started.
The clean way would be to set the project to "no new work", finish all the tasks in the cache, stop the client, remove the file, start the client again and "allow more work" for the project.
OK, I just want to make sure I understand what to do. I should delete app_info.xml from the einstein.phys.uwm.edu folder, and then I should restart BOINC manager. Right?
This is right in principle. However with the current project setting the app will not be accepted without a signature, i.e. the client will terminate the tasks in progress with a client error and download the "official" 4.15 App again (not knowing that it is identical).
The convenient way without wasting too much CPU time is to do this when a new task has just started.
The clean way would be to set the project to "no new work", finish all the tasks in the cache, stop the client, remove the file, start the client again and "allow more work" for the project.
13-Nov-07 1:06:37|Einstein@Home|Restarting task h1_0520.20_S5R2__70_S5R3a_1 using einstein_S5R3 version 415
13-Nov-07 1:07:12|Einstein@Home|[sched_op_debug] Deferring communication for 1 min 0 sec
13-Nov-07 1:07:12|Einstein@Home|[sched_op_debug] Reason: Unrecoverable error for result h1_0520.20_S5R2__70_S5R3a_1 (The environment is incorrect. (0xa) - exit code 10 (0xa))
13-Nov-07 1:07:12|Einstein@Home|Computation for task h1_0520.20_S5R2__70_S5R3a_1 finished
13-Nov-07 1:07:12|Einstein@Home|Output file h1_0520.20_S5R2__70_S5R3a_1_0 for task h1_0520.20_S5R2__70_S5R3a_1 absent
Interesting detail in the stderr.txt:
2007-11-13 01:07:06.6875 [CRITICAL]: Checksum error: -262144
2007-11-13 01:07:06.6875 [CRITICAL]: Could not resume from checkpoint (-2)
2007-11-13 01:07:06.6875 [CRITICAL]: ERROR: MAIN() returned with error '10'
zip warning: name not matched: h1_0520.20_S5R2__70_S5R3a_1_0
zip error: Nothing to do! (../../projects/einstein.phys.uwm.edu/h1_0520.20_S5R2__70_S5R3a_1_0)
2007-11-13 01:07:06.7500 [normal]: WARNING: Can't zip output file 'h1_0520.20_S5R2__70_S5R3a_1_0'
I only just noticed that the earlier one also crashed with that error.
Will try a reset, see what that does. Perhaps that my datafile is corrupt?
I'll also check the disk BOINC runs from.
2007-11-13 01:07:06.6875 [CRITICAL]: Checksum error: -262144
2007-11-13 01:07:06.6875 [CRITICAL]: Could not resume from checkpoint (-2)
2007-11-13 01:07:06.6875 [CRITICAL]: ERROR: MAIN() returned with error '10'
zip warning: name not matched: h1_0520.20_S5R2__70_S5R3a_1_0
zip error: Nothing to do! (../../projects/einstein.phys.uwm.edu/h1_0520.20_S5R2__70_S5R3a_1_0)
2007-11-13 01:07:06.7500 [normal]: WARNING: Can't zip output file 'h1_0520.20_S5R2__70_S5R3a_1_0'
I only just noticed that the earlier one also crashed with that error.
Will try a reset, see what that does. Perhaps that my datafile is corrupt?
I'll also check the disk BOINC runs from.
It looks like a disk error. It's not a corrupt data file, it's actually the checkpoint file that's broken. The zip messages are misleading, they have nothing to do with the actual error.
I just checked the disk, nothing wrong there. But something was wrong with the driver for my Promise card, so I uninstalled the old one and went back to a prior version. The computer seems nippier, fetching data quicker.
In the mean time I detached though, will run DIRMS on the drive and reattach tomorrow.
Not much news. I have been busy hunting Bugs that lead to more serious client errors, and for some reason I don't fully understand yet the SSE code that works fine in the MacOS Intel App give wrong results when compiled with a diferent compiler (e.g. gcc on Linux).
I'm still working on this, but the next two weeks will be very busy for basically the whole project team (for resons / deadlines unrelated to this project), I don't know how much progress will be made during these.
RE: Strange how those
)
This board doesn't even come close to matching the thread drift I've seen elsewhere.
You can type Latin on any
)
You can type Latin on any keyboard. Like English, it has no diacritical marks.
Tullio
RE: I published the 4.15
)
OK, I just want to make sure I understand what to do. I should delete app_info.xml from the einstein.phys.uwm.edu folder, and then I should restart BOINC manager. Right?
Thanks.
RE: OK, I just want to make
)
This is right in principle. However with the current project setting the app will not be accepted without a signature, i.e. the client will terminate the tasks in progress with a client error and download the "official" 4.15 App again (not knowing that it is identical).
The convenient way without wasting to much CPU time is to do this when a new task has just started.
The clean way would be to set the project to "no new work", finish all the tasks in the cache, stop the client, remove the file, start the client again and "allow more work" for the project.
BM
BM
RE: RE: OK, I just want
)
Thanks, Bernd. Will do.
Hmm, my task just crashed on
)
Hmm, my task just crashed on a restart:
13-Nov-07 1:06:37|Einstein@Home|Restarting task h1_0520.20_S5R2__70_S5R3a_1 using einstein_S5R3 version 415
13-Nov-07 1:07:12|Einstein@Home|[sched_op_debug] Deferring communication for 1 min 0 sec
13-Nov-07 1:07:12|Einstein@Home|[sched_op_debug] Reason: Unrecoverable error for result h1_0520.20_S5R2__70_S5R3a_1 (The environment is incorrect. (0xa) - exit code 10 (0xa))
13-Nov-07 1:07:12|Einstein@Home|Computation for task h1_0520.20_S5R2__70_S5R3a_1 finished
13-Nov-07 1:07:12|Einstein@Home|Output file h1_0520.20_S5R2__70_S5R3a_1_0 for task h1_0520.20_S5R2__70_S5R3a_1 absent
Interesting detail in the stderr.txt:
zip error: Nothing to do! (../../projects/einstein.phys.uwm.edu/h1_0520.20_S5R2__70_S5R3a_1_0)
2007-11-13 01:07:06.7500 [normal]: WARNING: Can't zip output file 'h1_0520.20_S5R2__70_S5R3a_1_0'
I only just noticed that the earlier one also crashed with that error.
Will try a reset, see what that does. Perhaps that my datafile is corrupt?
I'll also check the disk BOINC runs from.
RE: Interesting detail in
)
It looks like a disk error. It's not a corrupt data file, it's actually the checkpoint file that's broken. The zip messages are misleading, they have nothing to do with the actual error.
BM
BM
RE: It looks like a disk
)
I just checked the disk, nothing wrong there. But something was wrong with the driver for my Promise card, so I uninstalled the old one and went back to a prior version. The computer seems nippier, fetching data quicker.
In the mean time I detached though, will run DIRMS on the drive and reattach tomorrow.
Hm, anything new about
)
Hm, anything new about optimized Apps that were announced some time ago? Like using SSE or something?
S5R3 seems to be very long, and faster Apps would help a lot....
Not much news. I have been
)
Not much news. I have been busy hunting Bugs that lead to more serious client errors, and for some reason I don't fully understand yet the SSE code that works fine in the MacOS Intel App give wrong results when compiled with a diferent compiler (e.g. gcc on Linux).
I'm still working on this, but the next two weeks will be very busy for basically the whole project team (for resons / deadlines unrelated to this project), I don't know how much progress will be made during these.
BM
BM