What in the Hell is going on?

Dr. Ronald C. Spencer
Dr. Ronald C. S...
Joined: 20 Feb 05
Posts: 18
Credit: 831130
RAC: 0
Topic 189350

I finished another data packet you sent me. Then it was uploaded. Einstein@HOME then sent me 3 packets which were empty and 3 that said I have a claimed credit of 0.01 Now that was bright. Now it says I can't get more work for 21 hours,25 minutes and 43 seconds. If that isn't a royal flaw in the system , I don't know what is. I would like to continue to help you for years but I can't help you if you send me repeated data packets with nothing in them and then because of that I can't get viable work to do for almost 24 hours.

Blank Reg
Blank Reg
Joined: 18 Jan 05
Posts: 228
Credit: 40599
RAC: 0

What in the Hell is going on?

What version of Boinc do you have.

alex
alex
Joined: 18 Jan 05
Posts: 33
Credit: 7515
RAC: 0

RE: What version of Boinc

Message 12516 in response to message 12515

Quote:
What version of Boinc do you have.

He's got 4.45 since June 12.
http://einsteinathome.org/host/154257/tasks

Looks like the last set of work units had errors in them.

What's more interesting is the Unhandled Exception error that preceeded the string of 'zero' work units.
http://einsteinathome.org/task/5395603

Could be a Boinc issue.

Gary Roberts
Gary Roberts
Moderator
Joined: 9 Feb 05
Posts: 5874
Credit: 118405965297
RAC: 25774352

RE: I finished another data

Quote:
I finished another data packet you sent me. Then it was uploaded. Einstein@HOME then sent me 3 packets which were empty and 3 that said I have a claimed credit of 0.01 Now that was bright. Now it says I can't get more work for 21 hours,25 minutes and 43 seconds. If that isn't a royal flaw in the system , I don't know what is. I would like to continue to help you for years but I can't help you if you send me repeated data packets with nothing in them and then because of that I can't get viable work to do for almost 24 hours.

Actually, you appear to be misunderstanding what is going on. The data file you are working on is very large and you have been returning successful results from it for some time. It has a name starting H1_1013.0xxxxxx and your last successful result has a work unit ID of 1298116. When you receive new work you actually get small packets of data which tell the science app how to "slice up" the large data file for further calculations. In your case the last successful result is a bit odd because it suddenly took a much larger amount of time to complete compared with the previous results. However the result was validated so all must have been OK.

From that point on you have received 8 further "work units" or sets of instructions on how to process H1_1013.0xxxxxxx. These 8 were completed very rapidly, 5 taking 0 seconds and 3 taking 7 seconds. The claim of 0.01 is simply because the time was only 7 seconds. Obviously something has gone very wrong with the processing of the large data file even though the science app thinks all is OK because of the zero exit status. Why this is happening is a mystery but it is most likely that something has gone drastically wrong on your computer. You have to look in the log files on your computer to see if there is any output from BOINC or the science app to indicate the problem. Has your machine crashed at all lately? Do you leave it running 24/7? How often do you have to reboot? I notice you run WinMe. That in itself is a bit of a cause for concern as Me is not renowned for stability.

Do you actually have 4 computers as listed? If you only have one, you should get rid of the redundant instances by using the "merge" function on the website or simply deleting the old IDs. If you do only have one, is there any particular reason why you keep needing to get a new ID? It may have something to do with your current problem.

From what your successful results are claiming, you would appear to be running BOINC 4.19. Although old, that version is fine and shouldn't be the cause of your problems. You'll have to give a lot more information about how you run things and what messages are showing up in the BOINC window if you want others to help with the problem. Please realise that E@H is simply protecting itself by limiting you to 8 work units per day. If a work unit is trashed every few seconds, imagine how much could be wasted in a day if there wasn't some limit.

Please try to gather as much information as possible about what is happening on your computer and then give that detail here. Good luck!!

Edit: I've just seen Alex's post where he points out that the version of BOINC is visible in the stderr.txt output reported on the Result ID page. Didn't realise that so I've learnt something new, thanks Alex. It appears that you have just recently upgraded from 4.19 to 4.45. That puts a whole new perspective on things. 4.45 is quite new and, with the recent history of very rapid version changes, quite likely to have unforseen bugs. If I were you, I'd consider going back to 4.19 (which is very easy to do) until 4.45 has had a bit more time to mature.

Cheers,
Gary.

alex
alex
Joined: 18 Jan 05
Posts: 33
Credit: 7515
RAC: 0

If it's a boinc thing, the

If it's a boinc thing, the programmers may be able to fix this if they were to get some details from the DrWatson logs.

Perhaps he could search his pc for the drwatson log
Clicking Start/Run drwtsn32 will give a popup which shows where drwatson leaves it's logs. (it's probably in the windows folder, C drive root, or the user data folder)
The DR watson logs can be pretty big, so cutting and pasting the top 6 or 7 pages of info to a developer may help.

Dr. Ronald C. Spencer
Dr. Ronald C. S...
Joined: 20 Feb 05
Posts: 18
Credit: 831130
RAC: 0

RE: RE: I finished

Message 12519 in response to message 12517

Quote:
Quote:
I finished another data packet you sent me. Then it was uploaded. Einstein@HOME then sent me 3 packets which were empty and 3 that said I have a claimed credit of 0.01 Now that was bright. Now it says I can't get more work for 21 hours,25 minutes and 43 seconds. If that isn't a royal flaw in the system , I don't know what is. I would like to continue to help you for years but I can't help you if you send me repeated data packets with nothing in them and then because of that I can't get viable work to do for almost 24 hours.

Actually, you appear to be misunderstanding what is going on. The data file you are working on is very large and you have been returning successful results from it for some time. It has a name starting H1_1013.0xxxxxx and your last successful result has a work unit ID of 1298116. When you receive new work you actually get small packets of data which tell the science app how to "slice up" the large data file for further calculations. In your case the last successful result is a bit odd because it suddenly took a much larger amount of time to complete compared with the previous results. However the result was validated so all must have been OK.

From that point on you have received 8 further "work units" or sets of instructions on how to process H1_1013.0xxxxxxx. These 8 were completed very rapidly, 5 taking 0 seconds and 3 taking 7 seconds. The claim of 0.01 is simply because the time was only 7 seconds. Obviously something has gone very wrong with the processing of the large data file even though the science app thinks all is OK because of the zero exit status. Why this is happening is a mystery but it is most likely that something has gone drastically wrong on your computer. You have to look in the log files on your computer to see if there is any output from BOINC or the science app to indicate the problem. Has your machine crashed at all lately? Do you leave it running 24/7? How often do you have to reboot? I notice you run WinMe. That in itself is a bit of a cause for concern as Me is not renowned for stability.

Do you actually have 4 computers as listed? If you only have one, you should get rid of the redundant instances by using the "merge" function on the website or simply deleting the old IDs. If you do only have one, is there any particular reason why you keep needing to get a new ID? It may have something to do with your current problem.

From what your successful results are claiming, you would appear to be running BOINC 4.19. Although old, that version is fine and shouldn't be the cause of your problems. You'll have to give a lot more information about how you run things and what messages are showing up in the BOINC window if you want others to help with the problem. Please realise that E@H is simply protecting itself by limiting you to 8 work units per day. If a work unit is trashed every few seconds, imagine how much could be wasted in a day if there wasn't some limit.

Please try to gather as much information as possible about what is happening on your computer and then give that detail here. Good luck!!

Edit: I've just seen Alex's post where he points out that the version of BOINC is visible in the stderr.txt output reported on the Result ID page. Didn't realise that so I've learnt something new, thanks Alex. It appears that you have just recently upgraded from 4.19 to 4.45. That puts a whole new perspective on things. 4.45 is quite new and, with the recent history of very rapid version changes, quite likely to have unforseen bugs. If I were you, I'd consider going back to 4.19 (which is very easy to do) until 4.45 has had a bit more time to mature.


Dr. Ronald C. Spencer
Dr. Ronald C. S...
Joined: 20 Feb 05
Posts: 18
Credit: 831130
RAC: 0

RE: RE: RE: I finished

Message 12520 in response to message 12519

Quote:
Quote:
Quote:
I finished another data packet you sent me. Then it was uploaded. Einstein@HOME then sent me 3 packets which were empty and 3 that said I have a claimed credit of 0.01 Now that was bright. Now it says I can't get more work for 21 hours,25 minutes and 43 seconds. If that isn't a royal flaw in the system , I don't know what is. I would like to continue to help you for years but I can't help you if you send me repeated data packets with nothing in them and then because of that I can't get viable work to do for almost 24 hours.

Actually, you appear to be misunderstanding what is going on. The data file you are working on is very large and you have been returning successful results from it for some time. It has a name starting H1_1013.0xxxxxx and your last successful result has a work unit ID of 1298116. When you receive new work you actually get small packets of data which tell the science app how to "slice up" the large data file for further calculations. In your case the last successful result is a bit odd because it suddenly took a much larger amount of time to complete compared with the previous results. However the result was validated so all must have been OK.

From that point on you have received 8 further "work units" or sets of instructions on how to process H1_1013.0xxxxxxx. These 8 were completed very rapidly, 5 taking 0 seconds and 3 taking 7 seconds. The claim of 0.01 is simply because the time was only 7 seconds. Obviously something has gone very wrong with the processing of the large data file even though the science app thinks all is OK because of the zero exit status. Why this is happening is a mystery but it is most likely that something has gone drastically wrong on your computer. You have to look in the log files on your computer to see if there is any output from BOINC or the science app to indicate the problem. Has your machine crashed at all lately? Do you leave it running 24/7? How often do you have to reboot? I notice you run WinMe. That in itself is a bit of a cause for concern as Me is not renowned for stability.

Do you actually have 4 computers as listed? If you only have one, you should get rid of the redundant instances by using the "merge" function on the website or simply deleting the old IDs. If you do only have one, is there any particular reason why you keep needing to get a new ID? It may have something to do with your current problem.

From what your successful results are claiming, you would appear to be running BOINC 4.19. Although old, that version is fine and shouldn't be the cause of your problems. You'll have to give a lot more information about how you run things and what messages are showing up in the BOINC window if you want others to help with the problem. Please realise that E@H is simply protecting itself by limiting you to 8 work units per day. If a work unit is trashed every few seconds, imagine how much could be wasted in a day if there wasn't some limit.

Please try to gather as much information as possible about what is happening on your computer and then give that detail here. Good luck!!

Edit: I've just seen Alex's post where he points out that the version of BOINC is visible in the stderr.txt output reported on the Result ID page. Didn't realise that so I've learnt something new, thanks Alex. It appears that you have just recently upgraded from 4.19 to 4.45. That puts a whole new perspective on things. 4.45 is quite new and, with the recent history of very rapid version changes, quite likely to have unforseen bugs. If I were you, I'd consider going back to 4.19 (which is very easy to do) until 4.45 has had a bit more time to mature.



Dr. Ronald C. Spencer
Dr. Ronald C. S...
Joined: 20 Feb 05
Posts: 18
Credit: 831130
RAC: 0

Thanks Gary & Alex. I am

Thanks Gary & Alex. I am going back to th eolder version I had which was working fine. Thanks for your help and advice. Much appreciated. Ron

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.