Recently started running "albert" and see the following message in the result stderr. WU seems to comple sucessfully and was just wondering if anyone else has seen this, what it means, and if it is a bug/problem?
Thanks
Rand
Copyright © 2024 Einstein@Home. All rights reserved.
MacOS Error -43 occurred in Mac_Lib.c line 64
)
Checked my first results from "Albert" today and found following text:
5.2.5
2005-12-30 05:36:00.1144 [normal]: Start of BOINC application 'albert_4.39_powerpc-apple-darwin'.
2005-12-30 05:36:00.1686 [normal]: Started search at lalDebugLevel = 0
MacOS Error -43 occured in Mac_Lib.c line 65
MacOS Error -43 occured in Mac_Lib.c line 65
2005-12-30 05:36:01.4232 [normal]: Checkpoint-file 'Fstat.out.ckp' not found.
2005-12-30 05:36:01.4237 [normal]: No usable checkpoint found, starting from beginning.
Detected CPU type 1
2005-12-30 05:43:04.4782 [normal]: Fstat file reached MaxFileSizeKB ==> compactifying ... done.
2005-12-30 10:14:12.7517 [normal]: Search finished successfully.
I checked the Apple Dev site and believe it's a "file not found error" and indeed the checkpoint file is not found, dont think its serious yet.'
Developer's ???
John,
RE: Checked my first
)
Windows "Albert" also does the "No usable checkpoint found, starting from beginning." messages. I think it's just normal ops - occurring only the first time Albert trys to write a checkpoint for a new WU.
Thanks for the info. Think
)
Thanks for the info. Think maybe the max number of work units per day needs to be increased due to the shortend time to process an albert work unit. At them moment, I'm not processing any because I've hit the daily limit :-(.
Rand
These errors are also
)
These errors are also reported in my results.
I checked the last 6 completed results and see that
* 2 results had the errors 1 time
* 2 results had the errors 2 times
* 1 result had the errors 4 times! (This error output included below.)
All the results completed eventually and are reported valid.
It sounds potentially inefficient to restart the calculations,
either from the beginning or from a checkpoint, so many times.
Thanks,
Pat McClure
Here's the reoprted stderr output from the result that had the errors
(and had to restart) 4 times.
[pre]
5.2.8
2005-12-30 04:29:27.0468 [normal]: Start of BOINC application 'albert_4.39_powerpc-apple-darwin'.
MacOS Error -43 occured in Mac_Lib.c line 65
MacOS Error -43 occured in Mac_Lib.c line 65
2005-12-30 04:29:27.0924 [normal]: Started search at lalDebugLevel = 0
2005-12-30 04:29:28.6781 [normal]: Checkpoint-file 'Fstat.out.ckp' not found.
2005-12-30 04:29:28.6786 [normal]: No usable checkpoint found, starting from beginning.
Detected CPU type 1
2005-12-30 04:35:27.6554 [normal]: Fstat file reached MaxFileSizeKB ==> compactifying ... done.
2005-12-30 05:36:41.2876 [normal]: Start of BOINC application 'albert_4.39_powerpc-apple-darwin'.
2005-12-30 05:36:41.4454 [normal]: Started search at lalDebugLevel = 0
MacOS Error -43 occured in Mac_Lib.c line 65
MacOS Error -43 occured in Mac_Lib.c line 65
2005-12-30 05:36:43.6664 [normal]: Found checkpoint-file 'Fstat.out.ckp'
2005-12-30 05:36:43.6780 [normal]: Trying to read Fstat-file into toplist ...
2005-12-30 05:36:51.4962 [normal]: Checksum Ok. Successfully read_toplist_from_fp()
2005-12-30 05:36:51.4965 [normal]: Resuming computation at (23887/115500604/2320471).
Detected CPU type 1
2005-12-30 06:42:31.4971 [normal]: Start of BOINC application 'albert_4.39_powerpc-apple-darwin'.
2005-12-30 06:42:31.6112 [normal]: Started search at lalDebugLevel = 0
MacOS Error -43 occured in Mac_Lib.c line 65
MacOS Error -43 occured in Mac_Lib.c line 65
2005-12-30 06:42:33.9748 [normal]: Found checkpoint-file 'Fstat.out.ckp'
2005-12-30 06:42:33.9828 [normal]: Trying to read Fstat-file into toplist ...
2005-12-30 06:42:43.7260 [normal]: Checksum Ok. Successfully read_toplist_from_fp()
2005-12-30 06:42:43.7277 [normal]: Resuming computation at (46886/138398348/2779873).
Detected CPU type 1
2005-12-30 09:05:33.5502 [normal]: Start of BOINC application 'albert_4.39_powerpc-apple-darwin'.
2005-12-30 09:05:33.6642 [normal]: Started search at lalDebugLevel = 0
MacOS Error -43 occured in Mac_Lib.c line 65
MacOS Error -43 occured in Mac_Lib.c line 65
2005-12-30 09:05:36.3465 [normal]: Found checkpoint-file 'Fstat.out.ckp'
2005-12-30 09:05:36.3685 [normal]: Trying to read Fstat-file into toplist ...
2005-12-30 09:05:48.3122 [normal]: Checksum Ok. Successfully read_toplist_from_fp()
2005-12-30 09:05:48.3125 [normal]: Resuming computation at (78850/156010777/3132175).
Detected CPU type 1
2005-12-30 09:16:12.4674 [normal]: Search finished successfully.
[/pre]
Other than the first instance
)
Other than the first instance where it couldn't find the checkpoint file and stared from the beginning, all other examples showed that it found the checkpoint file and resumed the computation. The other examples must have been some other file it couldn't find. It doesn't look like it restarted each time.
Rand
The MacOS Error reported in
)
The MacOS Error reported in the stderr.log isn't something to worry about, and it doesn't affect the computation at all. It's just that some machines (depending on OS version and isntalled system updates) seem to react with unexpected errors when trying to set resources (like the application Icon). I'm trying to fix this, but it doesn't have a high priority right now.
No worries.
BM
BM
There is something that
)
There is something that worries me.
http://einsteinathome.org/host/499147/tasks
All but two WU are invalid. All have the same stderr
stderr out
5.2.13
2005-12-29 17:37:42.0786 [normal]: Start of BOINC application 'albert_4.39_powerpc-apple-darwin'.
MacOS Error -43 occured in Mac_Lib.c line 65
MacOS Error -43 occured in Mac_Lib.c line 65
2005-12-29 17:37:42.1118 [normal]: Started search at lalDebugLevel = 0
2005-12-29 17:37:42.6780 [normal]: Checkpoint-file 'Fstat.out.ckp' not found.
2005-12-29 17:37:42.6784 [normal]: No usable checkpoint found, starting from beginning.
Detected CPU type 1
2005-12-29 17:41:10.3493 [normal]: Fstat file reached MaxFileSizeKB ==> compactifying ... done.
2005-12-29 19:25:15.9191 [normal]: Search finished successfully.
why?
thanks
ch