Web replica down

Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4312
Credit: 250468812
RAC: 35303
Topic 196836

It looks like the server that runs our replica DB went down this morning (UWM time). So far we (at AEI) don't have any other information from UWM, local admins have been notified.

The master DB is working ok, so the only things affected are the web pages (server status page, display of tasks and computers in "your account" etc.).

We'll keep you posted.

BM

BM

astro-marwil
astro-marwil
Joined: 28 May 05
Posts: 532
Credit: 645316543
RAC: 1109179

Web replica down

Thankyou for information!
Martin

ggesmundo
ggesmundo
Joined: 3 Jun 12
Posts: 31
Credit: 18699116
RAC: 0

Is this the reason that there

Is this the reason that there have been no stats reported today?

MAGIC Quantum Mechanic
MAGIC Quantum M...
Joined: 18 Jan 05
Posts: 1886
Credit: 1407471243
RAC: 1141946

Thanks Bernd, Everything


Thanks Bernd,

Everything is working at my part of the planet.

-Samson

Nobody316
Nobody316
Joined: 14 Jan 13
Posts: 141
Credit: 2008126
RAC: 0

RE: Is this the reason that

Quote:
Is this the reason that there have been no stats reported today?

I think so as mine has not updated through BOINCstats but on site here under my account it has. Also http://einstein.phys.uwm.edu/server_status.html only shows half a page and at the top says The database server is not accessible. Give them some time and everything should work out and be back up and running.

PC setup MSI-970A-G46 AMD FX-8350 8 core OC'd 4.45GHz 16GB ram PC3-10700 Geforce GTX 650Ti Windows 7 x64 Einstein@Home

Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4312
Credit: 250468812
RAC: 35303

A central fileserver crashed

A central fileserver crashed @UWM over the weekend, affecting many internal and external services, including communication (mail, user accounts, software repositories etc.). Admins there are busy to get that working first, I guess they don't have much time to spend on Einstein@Home today.

BM

BM

TPCBF
TPCBF
Joined: 24 Nov 12
Posts: 17
Credit: 216190468
RAC: 1554616

Well, any updates on when

Well, any updates on when this issue might be resolved?

Neil Newell
Neil Newell
Joined: 20 Nov 12
Posts: 176
Credit: 169699457
RAC: 0

There's some recent info in

There's some recent info in this post (apparently it's still down while they rebuild a large ZFS storage appliance).

The issue doesn't seem to be having much impact; I imagine there's going to be a blip for e@h volunteers on the stats sites when it comes back up, but otherwise is it affecting anything?

TPCBF
TPCBF
Joined: 24 Nov 12
Posts: 17
Credit: 216190468
RAC: 1554616

RE: There's some recent

Quote:
There's some recent info in this post (apparently it's still down while they rebuild a large ZFS storage appliance).

Well, that was kind of a "someone heard that someone else said that..."

Quote:
The issue doesn't seem to be having much impact; I imagine there's going to be a blip for e@h volunteers on the stats sites when it comes back up, but otherwise is it affecting anything?

Correct, so far basic operation of E@H seems to continue as usual.
Don't really mind any "blip effect" on the stats, beside that I usually use the stats (BOINCstats in particular) as an indicator if everything is running smoothly or if there are any issues, either project/server or client wise.
Without the stats pages updating properly, I have to check on each and every project now...

Ralf

Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4312
Credit: 250468812
RAC: 35303

RE: Well, any updates on

Quote:
Well, any updates on when this issue might be resolved?

Not really.

These days we don't hear much from UWM, and every time we do it's rather bad news (more data being affected and possibly lost, one more service not working and can't be revived before yet another one is working etc.).

The last thing I heard was that they are trying to collect some storage they could back up some 90TB of data to from the crashed fileserver before trying various things to revive it.

They are all flattened out to keep the chaos within bounds. My rough estimate is that getting this fixed will take the rest of the week.

As Einstein@Home is working relatively good and reliably atm. I don't think they will bother spending a minute on it.

I'll try one or two things tomorrow to get at least the stats updated again.

BM

BM

Nobody316
Nobody316
Joined: 14 Jan 13
Posts: 141
Credit: 2008126
RAC: 0

RE: RE: Well, any updates

Quote:
Quote:
Well, any updates on when this issue might be resolved?

Not really.

These days we don't hear much from UWM, and every time we do it's rather bad news (more data being affected and possibly lost, one more service not working and can't be revived before yet another one is working etc.).

The last thing I heard was that they are trying to collect some storage they could back up some 90TB of data to from the crashed fileserver before trying various things to revive it.

They are all flattened out to keep the chaos within bounds. My rough estimate is that getting this fixed will take the rest of the week.

As Einstein@Home is working relatively good and reliably atm. I don't think they will bother spending a minute on it.

I'll try one or two things tomorrow to get at least the stats updated again.

BM

Thanks for the update BM. Will just have to sit and wait until they get it fixed which is fine by me though others worry about stats being updated. "updated threw boinc stats not so much on E@H site stats"

PC setup MSI-970A-G46 AMD FX-8350 8 core OC'd 4.45GHz 16GB ram PC3-10700 Geforce GTX 650Ti Windows 7 x64 Einstein@Home

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.