Downtime Tuesday July 9

Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4312
Credit: 250354652
RAC: 35916
Topic 231271

Next Tuesday, July 9, we will stop the project for migrating our DBs to new server hardware. The downtime will start at 8 AM UTC and shouldn't last longer than a few hours, certainly not extend the business day (4 PM UTC).

BM

GWGeorge007
GWGeorge007
Joined: 8 Jan 18
Posts: 3060
Credit: 4960994353
RAC: 1382875

 Bernd,Thanks for the

 

Bernd,

Thanks for the heads up!  We'll be prepared for your down time.

 

George

Proud member of the Old Farts Association

Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4312
Credit: 250354652
RAC: 35916

We're back!

We're back!

BM

Keith Myers
Keith Myers
Joined: 11 Feb 11
Posts: 4963
Credit: 18696120051
RAC: 6195352

Yes, the project is back but

Yes, the project is back but is NOT working correctly now.

All hosts are receiving an error message about missing scheduler URL's in the master_einstein.phys.uwm.edu.xml file.

Computer: Numbskull

8147    Einstein@Home    Jul 09, 2024, 02:57:03 PM    [error] No scheduler URLs found in master file

 

Computer: Numbskull

8149    Einstein@Home    Jul 09, 2024, 02:57:03 PM    [sched_op] Reason: 727 consecutive failures fetching scheduler list
 

 

Oliver Behnke
Oliver Behnke
Moderator
Administrator
Joined: 4 Sep 07
Posts: 984
Credit: 25171376
RAC: 50

Thanks. Looking into it...

Thanks. Looking into it...

Einstein@Home Project

Keith Myers
Keith Myers
Joined: 11 Feb 11
Posts: 4963
Credit: 18696120051
RAC: 6195352

Thanks the current master.xml

Thanks the current master.xml file being sent out is still the maintenance outage notice.

Site off-line

Einstein@Home is currently under maintenance. We should be back shortly. Thank you for your patience.

Copyright © 2024 Einstein@Home. All rights reserved.

 

Oliver Behnke
Oliver Behnke
Moderator
Administrator
Joined: 4 Sep 07
Posts: 984
Credit: 25171376
RAC: 50

Yep, I noticed that but I

Yep, I noticed that but I can't figure out why. The page isn't configured anymore and even now got moved away entirely, followed by multiple web server restarts. This doesn't make any sense. Also I can't reproduce this outside of BOINC...

Einstein@Home Project

Ian&Steve C.
Ian&Steve C.
Joined: 19 Jan 20
Posts: 3942
Credit: 46577142642
RAC: 64183848

Maybe local issues or issues

Maybe local issues or issues with site communication through certain nodes? 
 

All of my hosts are working normally. 

_________________________________________________________________________

Oliver Behnke
Oliver Behnke
Moderator
Administrator
Joined: 4 Sep 07
Posts: 984
Credit: 25171376
RAC: 50

Darn, I found it. It's the

Darn, I found it. It's the hideous on-disk cache used by our web server. A simple server restart doesn't purge it, so this needed manual treatment. I tested it locally and a simple "Update" in the BOINC Manager should now do the trick.

Sorry for the hickup!

Oliver

Einstein@Home Project

Keith Myers
Keith Myers
Joined: 11 Feb 11
Posts: 4963
Credit: 18696120051
RAC: 6195352

Thanks for the successful

Thanks for the successful bug-hunt, Oliver.  All my hosts are communicating again.

 

Ian&Steve C.
Ian&Steve C.
Joined: 19 Jan 20
Posts: 3942
Credit: 46577142642
RAC: 64183848

another artifact from the

another artifact from the project migration seems to be that stats exporting is no longer happening.

_________________________________________________________________________

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.