Well, if your videocard has those errors repeatedly, and it continues immediately after a reboot, there's a good chance it's a problem with the videocard (broken memory, broken capacitors, too much heat, etc.). I don't see any other option but to either replace that videocard, or test with another.
Don't have another one that would run GPU units. Don't see a heat issue, runs at 55C with or without doing the work.
I've been getting these errors for a while now [intermittently] on CPU tasks:
Mon Sep 12 20:29:14 2011 | Einstein@Home | Task h1_0315.05_S6GC1__167_S6BucketA_0 exited with zero status but no 'finished' file
Mon Sep 12 20:29:14 2011 | Einstein@Home | If this happens repeatedly you may need to reset the project.
Mon Sep 12 20:29:15 2011 | Einstein@Home | Task h1_0315.05_S6GC1__72_S6BucketA_1 exited with zero status but no 'finished' file
Mon Sep 12 20:29:15 2011 | Einstein@Home | If this happens repeatedly you may need to reset the project.
Mon Sep 12 20:29:16 2011 | Einstein@Home | Task h1_0300.65_S6GC1__339_S6BucketA_2 exited with zero status but no 'finished' file
Mon Sep 12 20:29:16 2011 | Einstein@Home | If this happens repeatedly you may need to reset the project.
Mon Sep 12 20:29:17 2011 | Einstein@Home | Task h1_0315.05_S6GC1__35_S6BucketA_0 exited with zero status but no 'finished' file
Mon Sep 12 20:29:17 2011 | Einstein@Home | If this happens repeatedly you may need to reset the project.
Mon Sep 12 20:29:18 2011 | Einstein@Home | Task h1_0315.05_S6GC1__45_S6BucketA_0 exited with zero status but no 'finished' file
Mon Sep 12 20:29:18 2011 | Einstein@Home | If this happens repeatedly you may need to reset the project.
Mon Sep 12 20:29:19 2011 | Einstein@Home | Task h1_0315.05_S6GC1__71_S6BucketA_1 exited with zero status but no 'finished' file
Mon Sep 12 20:29:19 2011 | Einstein@Home | If this happens repeatedly you may need to reset the project.
Mon Sep 12 20:29:20 2011 | Einstein@Home | Task h1_0315.05_S6GC1__166_S6BucketA_0 exited with zero status but no 'finished' file
Mon Sep 12 20:29:20 2011 | Einstein@Home | If this happens repeatedly you may need to reset the project.
Mon Sep 12 20:29:21 2011 | Einstein@Home | Task h1_0315.05_S6GC1__168_S6BucketA_0 exited with zero status but no 'finished' file
Mon Sep 12 20:29:21 2011 | Einstein@Home | If this happens repeatedly you may need to reset the project.
I've Started just A: Aborting as soon as I notice or B: letting them run.
The ones above went through at least twice before I aborted them [past deadline].
The first time I got these error messages I; set get no new tasks, let all tasks finish, reset the project, and rebooted. This was 6 weeks ago at least, been getting them off and on since.
Are there any hints in the stderr.txt file? For running tasks, that file is in the corresponding slots directory, for reported tasks in the online tasks details. You could also post a link to the task details page, since it's a little bit cumbersome to search your ten machines for such a file.
Gruß,
Gundolf
Computer sind nicht alles im Leben. (Kleiner Scherz)
I've bee away from that computer for the last several days.
I have no GPU's.
I have multiple Stderr.txt (one for each active 'slot') files. They all have entries similar to this:
2011-09-25 21:29:11.5086 (83374) [normal]: Start of BOINC application 'einstein_S6Bucket_1.01_i686-apple-darwin__SSE2'.
21:29:11 (83374): Can't acquire lockfile (-154) - waiting 35s
21:29:46 (83374): Can't acquire lockfile (-154) - exiting
2011-09-25 22:08:10.2882 (83741) [normal]: This program is published under the GNU General Public License, version 2
2011-09-25 22:08:10.2890 (83741) [normal]: For details see http://einstein.phys.uwm.edu/license.php
2011-09-25 22:08:10.2890 (83741) [normal]: This Einstein@home App was built at: May 5 2011 14:43:42
2011-09-25 22:08:10.2890 (83741) [normal]: Start of BOINC application 'einstein_S6Bucket_1.01_i686-apple-darwin__SSE2'.
22:08:10 (83741): Can't acquire lockfile (-154) - waiting 35s
22:08:45 (83741): Can't acquire lockfile (-154) - exiting
2011-09-25 23:29:21.1113 (19744) [normal]: This program is published under the GNU General Public License, version 2
2011-09-25 23:29:21.1240 (19744) [normal]: For details see http://einstein.phys.uwm.edu/license.php
2011-09-25 23:29:21.1240 (19744) [normal]: This Einstein@home App was built at: May 5 2011 14:43:42
2011-09-25 23:29:21.1240 (19744) [normal]: Start of BOINC application 'einstein_S6Bucket_1.01_i686-apple-darwin__SSE2'.
23:29:21 (19744): Can't acquire lockfile (-154) - waiting 35s
23:29:56 (19744): Can't acquire lockfile (-154) - exiting
2011-09-26 00:30:57.6972 (58081) [normal]: This program is published under the GNU General Public License, version 2
2011-09-26 00:30:57.6994 (58081) [normal]: For details see http://einstein.phys.uwm.edu/license.php
2011-09-26 00:30:57.6994 (58081) [normal]: This Einstein@home App was built at: May 5 2011 14:43:42
2011-09-26 00:30:57.6995 (58081) [normal]: Start of BOINC application 'einstein_S6Bucket_1.01_i686-apple-darwin__SSE2'.
00:30:57 (58081): Can't acquire lockfile (-154) - waiting 35s
00:31:32 (58081): Can't acquire lockfile (-154) - exiting
2011-09-26 03:52:51.2470 (59921) [normal]: This program is published under the GNU General Public License, version 2
2011-09-26 03:52:51.2473 (59921) [normal]: For details see http://einstein.phys.uwm.edu/license.php
2011-09-26 03:52:51.2473 (59921) [normal]: This Einstein@home App was built at: May 5 2011 14:43:42
2011-09-26 03:52:51.2473 (59921) [normal]: Start of BOINC application 'einstein_S6Bucket_1.01_i686-apple-darwin__SSE2'.
03:52:51 (59921): Can't acquire lockfile (-154) - waiting 35s
03:53:26 (59921): Can't acquire lockfile (-154) - exiting
2011-09-26 04:24:55.4955 (60250) [normal]: This program is published under the GNU General Public License, version 2
2011-09-26 04:24:55.4959 (60250) [normal]: For details see http://einstein.phys.uwm.edu/license.php
2011-09-26 04:24:55.4959 (60250) [normal]: This Einstein@home App was built at: May 5 2011 14:43:42
2011-09-26 04:24:55.4960 (60250) [normal]: Start of BOINC application 'einstein_S6Bucket_1.01_i686-apple-darwin__SSE2'.
04:24:55 (60250): Can't acquire lockfile (-154) - waiting 35s
04:25:30 (60250): Can't acquire lockfile (-154) - exiting
2011-09-26 05:31:58.5673 (60899) [normal]: This program is published under the GNU General Public License, version 2
2011-09-26 05:31:58.5678 (60899) [normal]: For details see http://einstein.phys.uwm.edu/license.php
2011-09-26 05:31:58.5678 (60899) [normal]: This Einstein@home App was built at: May 5 2011 14:43:42
2011-09-26 05:31:58.5678 (60899) [normal]: Start of BOINC application 'einstein_S6Bucket_1.01_i686-apple-darwin__SSE2'.
05:31:58 (60899): Can't acquire lockfile (-154) - waiting 35s
05:32:33 (60899): Can't acquire lockfile (-154) - exiting
2011-09-26 06:10:00.6603 (61252) [normal]: This program is published under the GNU General Public License, version 2
2011-09-26 06:10:00.6736 (61252) [normal]: For details see http://einstein.phys.uwm.edu/license.php
2011-09-26 06:10:00.6736 (61252) [normal]: This Einstein@home App was built at: May 5 2011 14:43:42
2011-09-26 06:10:00.6737 (61252) [normal]: Start of BOINC application 'einstein_S6Bucket_1.01_i686-apple-darwin__SSE2'.
06:10:00 (61252): Can't acquire lockfile (-154) - waiting 35s
06:10:35 (61252): Can't acquire lockfile (-154) - exiting
2011-09-26 06:51:11.6103 (61675) [normal]: This program is published under the GNU General Public License, version 2
2011-09-26 06:51:11.6110 (61675) [normal]: For details see http://einstein.phys.uwm.edu/license.php
2011-09-26 06:51:11.6110 (61675) [normal]: This Einstein@home App was built at: May 5 2011 14:43:42
I had THOUGHT that I was only experiencing this on one computer, but it now looks like ALL of my computers are showing this. It appears over half of my work units are not validating.
If I need to do ANYTHING else so that the community can look at these, and help find out out what is going on; PLEASE let me know. [in as much detail as possible, as I have not interacted much with the message boards.]
I am getting this message every time BOINC suspends my Test4Theory@home task to run other projects. Then it restarts it in due time, with PYTHIA jobs from CERN running in the BOINC_VM virtual machine. I am getting credits. so everything seems OK.
Tullio
The URL string must not contain blanks and you had started with the end-flag. :-)
Those "Can't acquire lockfile" messages from your previous post hint at problems with AV software. You should except the BOINC data directory from active scans to avoid those concurrent access problems.
Gruß,
Gundolf
Computer sind nicht alles im Leben. (Kleiner Scherz)
RE: Well, if your videocard
)
Don't have another one that would run GPU units. Don't see a heat issue, runs at 55C with or without doing the work.
I've been getting these
)
I've been getting these errors for a while now [intermittently] on CPU tasks:
Mon Sep 12 20:29:14 2011 | Einstein@Home | Task h1_0315.05_S6GC1__167_S6BucketA_0 exited with zero status but no 'finished' file
Mon Sep 12 20:29:14 2011 | Einstein@Home | If this happens repeatedly you may need to reset the project.
Mon Sep 12 20:29:15 2011 | Einstein@Home | Task h1_0315.05_S6GC1__72_S6BucketA_1 exited with zero status but no 'finished' file
Mon Sep 12 20:29:15 2011 | Einstein@Home | If this happens repeatedly you may need to reset the project.
Mon Sep 12 20:29:16 2011 | Einstein@Home | Task h1_0300.65_S6GC1__339_S6BucketA_2 exited with zero status but no 'finished' file
Mon Sep 12 20:29:16 2011 | Einstein@Home | If this happens repeatedly you may need to reset the project.
Mon Sep 12 20:29:17 2011 | Einstein@Home | Task h1_0315.05_S6GC1__35_S6BucketA_0 exited with zero status but no 'finished' file
Mon Sep 12 20:29:17 2011 | Einstein@Home | If this happens repeatedly you may need to reset the project.
Mon Sep 12 20:29:18 2011 | Einstein@Home | Task h1_0315.05_S6GC1__45_S6BucketA_0 exited with zero status but no 'finished' file
Mon Sep 12 20:29:18 2011 | Einstein@Home | If this happens repeatedly you may need to reset the project.
Mon Sep 12 20:29:19 2011 | Einstein@Home | Task h1_0315.05_S6GC1__71_S6BucketA_1 exited with zero status but no 'finished' file
Mon Sep 12 20:29:19 2011 | Einstein@Home | If this happens repeatedly you may need to reset the project.
Mon Sep 12 20:29:20 2011 | Einstein@Home | Task h1_0315.05_S6GC1__166_S6BucketA_0 exited with zero status but no 'finished' file
Mon Sep 12 20:29:20 2011 | Einstein@Home | If this happens repeatedly you may need to reset the project.
Mon Sep 12 20:29:21 2011 | Einstein@Home | Task h1_0315.05_S6GC1__168_S6BucketA_0 exited with zero status but no 'finished' file
Mon Sep 12 20:29:21 2011 | Einstein@Home | If this happens repeatedly you may need to reset the project.
I've Started just A: Aborting as soon as I notice or B: letting them run.
The ones above went through at least twice before I aborted them [past deadline].
The first time I got these error messages I; set get no new tasks, let all tasks finish, reset the project, and rebooted. This was 6 weeks ago at least, been getting them off and on since.
Russell
ANYONE know what's going on.
)
ANYONE know what's going on. I'm still getting these 'errors'
Russell
Are there any hints in the
)
Are there any hints in the stderr.txt file? For running tasks, that file is in the corresponding slots directory, for reported tasks in the online tasks details. You could also post a link to the task details page, since it's a little bit cumbersome to search your ten machines for such a file.
Gruß,
Gundolf
Computer sind nicht alles im Leben. (Kleiner Scherz)
My own solution was to
)
My own solution was to suspend GPU, no more errors...
Russell McGaha doesn't have
)
Russell McGaha doesn't have any GPUs (recognised by BOINC) in any host. :-)
Gruß,
Gundolf
Computer sind nicht alles im Leben. (Kleiner Scherz)
I've bee away from that
)
I've bee away from that computer for the last several days.
I have no GPU's.
I have multiple Stderr.txt (one for each active 'slot') files. They all have entries similar to this:
2011-09-25 21:29:11.5086 (83374) [normal]: Start of BOINC application 'einstein_S6Bucket_1.01_i686-apple-darwin__SSE2'.
21:29:11 (83374): Can't acquire lockfile (-154) - waiting 35s
21:29:46 (83374): Can't acquire lockfile (-154) - exiting
2011-09-25 22:08:10.2882 (83741) [normal]: This program is published under the GNU General Public License, version 2
2011-09-25 22:08:10.2890 (83741) [normal]: For details see http://einstein.phys.uwm.edu/license.php
2011-09-25 22:08:10.2890 (83741) [normal]: This Einstein@home App was built at: May 5 2011 14:43:42
2011-09-25 22:08:10.2890 (83741) [normal]: Start of BOINC application 'einstein_S6Bucket_1.01_i686-apple-darwin__SSE2'.
22:08:10 (83741): Can't acquire lockfile (-154) - waiting 35s
22:08:45 (83741): Can't acquire lockfile (-154) - exiting
2011-09-25 23:29:21.1113 (19744) [normal]: This program is published under the GNU General Public License, version 2
2011-09-25 23:29:21.1240 (19744) [normal]: For details see http://einstein.phys.uwm.edu/license.php
2011-09-25 23:29:21.1240 (19744) [normal]: This Einstein@home App was built at: May 5 2011 14:43:42
2011-09-25 23:29:21.1240 (19744) [normal]: Start of BOINC application 'einstein_S6Bucket_1.01_i686-apple-darwin__SSE2'.
23:29:21 (19744): Can't acquire lockfile (-154) - waiting 35s
23:29:56 (19744): Can't acquire lockfile (-154) - exiting
2011-09-26 00:30:57.6972 (58081) [normal]: This program is published under the GNU General Public License, version 2
2011-09-26 00:30:57.6994 (58081) [normal]: For details see http://einstein.phys.uwm.edu/license.php
2011-09-26 00:30:57.6994 (58081) [normal]: This Einstein@home App was built at: May 5 2011 14:43:42
2011-09-26 00:30:57.6995 (58081) [normal]: Start of BOINC application 'einstein_S6Bucket_1.01_i686-apple-darwin__SSE2'.
00:30:57 (58081): Can't acquire lockfile (-154) - waiting 35s
00:31:32 (58081): Can't acquire lockfile (-154) - exiting
2011-09-26 03:52:51.2470 (59921) [normal]: This program is published under the GNU General Public License, version 2
2011-09-26 03:52:51.2473 (59921) [normal]: For details see http://einstein.phys.uwm.edu/license.php
2011-09-26 03:52:51.2473 (59921) [normal]: This Einstein@home App was built at: May 5 2011 14:43:42
2011-09-26 03:52:51.2473 (59921) [normal]: Start of BOINC application 'einstein_S6Bucket_1.01_i686-apple-darwin__SSE2'.
03:52:51 (59921): Can't acquire lockfile (-154) - waiting 35s
03:53:26 (59921): Can't acquire lockfile (-154) - exiting
2011-09-26 04:24:55.4955 (60250) [normal]: This program is published under the GNU General Public License, version 2
2011-09-26 04:24:55.4959 (60250) [normal]: For details see http://einstein.phys.uwm.edu/license.php
2011-09-26 04:24:55.4959 (60250) [normal]: This Einstein@home App was built at: May 5 2011 14:43:42
2011-09-26 04:24:55.4960 (60250) [normal]: Start of BOINC application 'einstein_S6Bucket_1.01_i686-apple-darwin__SSE2'.
04:24:55 (60250): Can't acquire lockfile (-154) - waiting 35s
04:25:30 (60250): Can't acquire lockfile (-154) - exiting
2011-09-26 05:31:58.5673 (60899) [normal]: This program is published under the GNU General Public License, version 2
2011-09-26 05:31:58.5678 (60899) [normal]: For details see http://einstein.phys.uwm.edu/license.php
2011-09-26 05:31:58.5678 (60899) [normal]: This Einstein@home App was built at: May 5 2011 14:43:42
2011-09-26 05:31:58.5678 (60899) [normal]: Start of BOINC application 'einstein_S6Bucket_1.01_i686-apple-darwin__SSE2'.
05:31:58 (60899): Can't acquire lockfile (-154) - waiting 35s
05:32:33 (60899): Can't acquire lockfile (-154) - exiting
2011-09-26 06:10:00.6603 (61252) [normal]: This program is published under the GNU General Public License, version 2
2011-09-26 06:10:00.6736 (61252) [normal]: For details see http://einstein.phys.uwm.edu/license.php
2011-09-26 06:10:00.6736 (61252) [normal]: This Einstein@home App was built at: May 5 2011 14:43:42
2011-09-26 06:10:00.6737 (61252) [normal]: Start of BOINC application 'einstein_S6Bucket_1.01_i686-apple-darwin__SSE2'.
06:10:00 (61252): Can't acquire lockfile (-154) - waiting 35s
06:10:35 (61252): Can't acquire lockfile (-154) - exiting
2011-09-26 06:51:11.6103 (61675) [normal]: This program is published under the GNU General Public License, version 2
2011-09-26 06:51:11.6110 (61675) [normal]: For details see http://einstein.phys.uwm.edu/license.php
2011-09-26 06:51:11.6110 (61675) [normal]: This Einstein@home App was built at: May 5 2011 14:43:42
2011-09-26 06:51:11.6110 (61675) [normal]: Start of BOINC application 'einstein_S6Bucket_1.01_i686-apple-darwin__SSE2'.
06:51:11 (61675): Can't acquire lockfile (-154) - waiting 35s
06:51:46 (61675): Can't acquire lockfile (-154) - exiting
What/Where else should I look?
Russell
I had THOUGHT that I was only
)
I had THOUGHT that I was only experiencing this on one computer, but it now looks like ALL of my computers are showing this. It appears over half of my work units are not validating.
I THINK this is the link to the computer that first showed it happening:
[ /url ] http://einsteinathome.org/host/3659195 [/url]
If I need to do ANYTHING else so that the community can look at these, and help find out out what is going on; PLEASE let me know. [in as much detail as possible, as I have not interacted much with the message boards.]
Russell
I am getting this message
)
I am getting this message every time BOINC suspends my Test4Theory@home task to run other projects. Then it restarts it in due time, with PYTHIA jobs from CERN running in the BOINC_VM virtual machine. I am getting credits. so everything seems OK.
Tullio
RE: I THINK this is the
)
The URL string must not contain blanks and you had started with the end-flag. :-)
Those "Can't acquire lockfile" messages from your previous post hint at problems with AV software. You should except the BOINC data directory from active scans to avoid those concurrent access problems.
Gruß,
Gundolf
Computer sind nicht alles im Leben. (Kleiner Scherz)