December 5, 2005
We are experiencing heavy traffic on our data server. This is preventing some result uploads/workunit downloads. We are working on the problem. More in Technical News.
A snippet of the News: Meanwhile, we are still dropping connections on the upload server. But the good news is that we are successfully handling about 4 result uploads for every workunit download, which means the upload server is indeed catching up.
We're getting about 35 results a second and sending out about 8 workunits a second at the time of writing.
So 35 results at 10.5KB/result = 367.5KB + (8 times 354KB = 2,832KB) = 3,199.5KB per second... (that's 25.596 Mbit on the connection)
You call that down? I guess you're not satisfied easily. ;)
Sorry Jord. It was a poor choice of words on my part. I don't run SETI but I know enough from reading the BOINC boards that the servers are getting hammered and are having a hard time keeping up. I'll choose my words more carefully in the future.
I KNEW I should have just copied them all to a text file... but I didn't. Sorry. At least from my mistake I can proudly say "I am experienced!" ;0)
Jim,
You haven't lost any of that stuff. You will find it all in files in your BOINC folder. The files are stdout*.txt and stderr*.txt where * means either gui or dae, ie GUI or Daemon messages respectively. There are 4 files altogether.
However, a lot has happened in the last 24 hours that has made me aware of why you are not getting EAH work. It is due to those pesky Seti server overload problems and the fact that you probably have work that is stuck trying to download from Seti. I think your original comment was about not getting EAH work and your machine being idle.
Here's what you need to do. Start up the BOINC Manager and make sure you have an internet connection established. Go to your projects tab and click on the seti project to select it. On the left, various controls will become active and you need to "Suspend" seti by hitting the appropriate control. With seti suspended, you should get new EAH work downloading pretty soon thereafter. If nothing starts in a couple of minutes, select the EAH project and "Update" it. In each case you can watch the action in the messages window.
You will need to leave Seti "suspended" for the duration of their server problems. If you see any announcement on the front page of the Seti website to the effect that they have resolved their server issues, you could try "unsuspend"ing Seti to see if your stuck transfers get cleared. If they don't and it seems to do the endless retries thing, just "suspend" seti again for a while and try again later.
Please let us know if "suspend"ing Seti gets your problems sorted.
Quote:
Quote:
Please don't take this as some sort of inquisition - that's certainly not my intention. I'd just like to find out what is causing your problems when you shouldn't really be having any. Not from EAH anyway.
No offense taken - I'm very easy going, don't get offended easily and have LOTS of patience. That's why I don't take married life seriously. Oops! That remark also got me some sore ribs from my wife's elbow! :0|
I copied & printed the details you stated from another post that a user should provide to assist with any problems. I'll keep that in mind.
Thanks Gary.
Jim
Thanks for being so good about this. When I read back over all that I'd written I was concerned that you might have taken it the wrong way. I was struggling a bit to properly understand your problem :).
You call that down? I guess you're not satisfied easily. ;)
I don't know about Kathryn, but since _I_ still have the same 3 results sitting here that I had three days ago, as far as _my_ machines are concerned, SETI is down! :-P
The technical answer is the servers aren't "down", they're just overloaded. But that means that while some people can eventually get through, for others who can't, the "total system viewed at once" is down.
You call that down? I guess you're not satisfied easily. ;)
I don't know about Kathryn, but since _I_ still have the same 3 results sitting here that I had three days ago, as far as _my_ machines are concerned, SETI is down! :-P
The technical answer is the servers aren't "down", they're just overloaded. But that means that while some people can eventually get through, for others who can't, the "total system viewed at once" is down.
I no longer receive work for Einstein or Rosetta. Is this part of the same problem? In fact if it weren't for Predictor my machine would be idle since all my SETI data is backed up waiting to upload and new stuff won't download.
Colin
When did you stall out. I have download new WUs as recently as 21:57 utc.
It looks like you like you stall at 17:55 utc. I have pulled 9 new WUs since then. Anything in your meesages?
Colin
When did you stall out. I have download new WUs as recently as 21:57 utc.
It looks like you like you stall at 17:55 utc. I have pulled 9 new WUs since then. Anything in your meesages?
I stalled yesterday. Messages include: "Cannot connect to hostname [setiboincdata.ssl.berkely.edu" and "Temporarily failed upload (or download) of ......: system I/O". Predictor is running fine however. I just attached it a short time ago. I tried resetting projects. No help. I have 3 SETI's queued for upload and one for download. Nothing from Rosetta or Einstein.
Colin
Sorry, I don't know anything about multi-project configuration. Gary has posted a recommendation to suspend Seti for the time being. I suggest you look at his thread pinned to the top of this forum.
Gary has posted a recommendation to suspend Seti for the time being.
Colin,
That's exactly what you need to do. Suspend Seti (on the projects tab) and then the other projects should get work. Tell us how you get on. Read the whole thread that Mark referred to if you want the gory details.
This almost never helps, except in certain very limited cases (corrupt application, entire string of corrupt WUs issued by project, no work but extreme LTD...) but does destroy any work you have on hand, and adds to the load on the servers, as they have to resend you all the application files, etc.
Not to pick on you, as I see "oh, just reset the project" advised all the time. But PLEASE DON'T DO THAT, not until _EVERY_ other option has been tried, and failed. If I were put in charge of redesigning BOINC Manager, the first thing I'd do is make "reset" and "detach" much harder to get to...
Hi Kathryn, Thanks for
)
Hi Kathryn,
Thanks for that, I will do as you suggest
Lynette
RE: RE: Lynette. The
)
Sorry Jord. It was a poor choice of words on my part. I don't run SETI but I know enough from reading the BOINC boards that the servers are getting hammered and are having a hard time keeping up. I'll choose my words more carefully in the future.
kathryn
Kathryn :o)
Einstein@Home Moderator
RE: I KNEW I should have
)
Jim,
You haven't lost any of that stuff. You will find it all in files in your BOINC folder. The files are stdout*.txt and stderr*.txt where * means either gui or dae, ie GUI or Daemon messages respectively. There are 4 files altogether.
However, a lot has happened in the last 24 hours that has made me aware of why you are not getting EAH work. It is due to those pesky Seti server overload problems and the fact that you probably have work that is stuck trying to download from Seti. I think your original comment was about not getting EAH work and your machine being idle.
Here's what you need to do. Start up the BOINC Manager and make sure you have an internet connection established. Go to your projects tab and click on the seti project to select it. On the left, various controls will become active and you need to "Suspend" seti by hitting the appropriate control. With seti suspended, you should get new EAH work downloading pretty soon thereafter. If nothing starts in a couple of minutes, select the EAH project and "Update" it. In each case you can watch the action in the messages window.
You will need to leave Seti "suspended" for the duration of their server problems. If you see any announcement on the front page of the Seti website to the effect that they have resolved their server issues, you could try "unsuspend"ing Seti to see if your stuck transfers get cleared. If they don't and it seems to do the endless retries thing, just "suspend" seti again for a while and try again later.
Please let us know if "suspend"ing Seti gets your problems sorted.
Thanks for being so good about this. When I read back over all that I'd written I was concerned that you might have taken it the wrong way. I was struggling a bit to properly understand your problem :).
Good luck and let us know how you get on.
Cheers,
Gary.
RE: You call that down? I
)
I don't know about Kathryn, but since _I_ still have the same 3 results sitting here that I had three days ago, as far as _my_ machines are concerned, SETI is down! :-P
The technical answer is the servers aren't "down", they're just overloaded. But that means that while some people can eventually get through, for others who can't, the "total system viewed at once" is down.
RE: RE: You call that
)
I no longer receive work for Einstein or Rosetta. Is this part of the same problem? In fact if it weren't for Predictor my machine would be idle since all my SETI data is backed up waiting to upload and new stuff won't download.
Colin Jameson ---
Colin When did you stall out.
)
Colin
When did you stall out. I have download new WUs as recently as 21:57 utc.
It looks like you like you stall at 17:55 utc. I have pulled 9 new WUs since then. Anything in your meesages?
RE: Colin When did you
)
I stalled yesterday. Messages include: "Cannot connect to hostname [setiboincdata.ssl.berkely.edu" and "Temporarily failed upload (or download) of ......: system I/O". Predictor is running fine however. I just attached it a short time ago. I tried resetting projects. No help. I have 3 SETI's queued for upload and one for download. Nothing from Rosetta or Einstein.
Colin Sorry, I don't know
)
Colin
Sorry, I don't know anything about multi-project configuration. Gary has posted a recommendation to suspend Seti for the time being. I suggest you look at his thread pinned to the top of this forum.
RE: Gary has posted a
)
Colin,
That's exactly what you need to do. Suspend Seti (on the projects tab) and then the other projects should get work. Tell us how you get on. Read the whole thread that Mark referred to if you want the gory details.
Cheers,
Gary.
RE: I tried resetting
)
This almost never helps, except in certain very limited cases (corrupt application, entire string of corrupt WUs issued by project, no work but extreme LTD...) but does destroy any work you have on hand, and adds to the load on the servers, as they have to resend you all the application files, etc.
Not to pick on you, as I see "oh, just reset the project" advised all the time. But PLEASE DON'T DO THAT, not until _EVERY_ other option has been tried, and failed. If I were put in charge of redesigning BOINC Manager, the first thing I'd do is make "reset" and "detach" much harder to get to...