Pending tasks

Betreger

Joined: 25 Feb 05

Posts: 992

Credit: 1639761715

RAC: 478385

7 May 2017 0:53:24 UTC

Topic 207383

(moderation:

)

Has anyone else noticed an increase in pending tasks? The reason I ask is in the last 2 days my pendings have gone from a typical 70 or so to over 150. Only 4 are inconclusive so I'm pretty sure my boxes are OK. I know pendings go up and down but this is very abnormal.

juan BFP

Joined: 18 Nov 11

Posts: 839

Credit: 421443712

RAC: 0

Something is realy wrong,

7 May 2017 1:27:36 UTC

Message 157585

(moderation:

)

Something is realy wrong, besides the pendings, if you look the daily production of the Top hosts all down about 40%, that can´t be just a coincidence.

Betreger

Joined: 25 Feb 05

Posts: 992

Credit: 1639761715

RAC: 478385

That is consistent to what

7 May 2017 1:38:47 UTC

Message 157586 in response to message 157585

(moderation:

)

That is consistent to what I'm seeing.

Gary Roberts

Moderator

Joined: 9 Feb 05

Posts: 5887

Credit: 119496921813

RAC: 25442147

There's really no need to be

7 May 2017 2:18:04 UTC

Message 157587 in response to message 157585

(moderation:

)

There's really no need to be alarmist about it.

juan BFP wrote:

Something is realy wrong, besides the pendings...

You can't really draw that conclusion. It's pretty clear that there has been some sort of drop in the rate at which results are being validated. It's likely that is just the same thing as the increase in pendings. Perhaps there is some blockage in the pipeline that gets returned results to the validator. There is no backlog showing on the server status page so it is perhaps before the point where queued numbers are measured. I imagine at some point someone will come along and release the blockage :-). This is not a major crisis (yet) :-). It seems to be a drop in apparent production which, later on, will probably turn into a spike in apparent production. I think we should just wait calmly until the Devs resolve the issue.

juan BFP wrote:

... if you look the daily production of the Top hosts all down about 40%, that can´t be just a coincidence.

I don't know where you get the 40% from - I haven't even tried to check. In my case, my fleet has shed about 7% so far which seems to be just a bit less than 40% :-).

Cheers,
Gary.

Betreger

Joined: 25 Feb 05

Posts: 992

Credit: 1639761715

RAC: 478385

This is not a major crisis

7 May 2017 2:36:27 UTC

Message 157588 in response to message 157587

(moderation:

)

"This is not a major crisis (yet) :-). It seems to be a drop in apparent production which, later on, will probably turn into a spike in apparent production. I think we should just wait calmly until the Devs resolve the issue."

Gary since that is our only choice that makes sense

juan BFP

Joined: 18 Nov 11

Posts: 839

Credit: 421443712

RAC: 0

There's really no need to be

7 May 2017 12:04:44 UTC

Message 157601 in response to message 157587

(moderation:

)

Quote:

There's really no need to be alarmist about it.

My apologies if my bad english leave you to that conclusions, i was not in <Panic> mode, just showing another side of the problem.

Quote:

I don't know where you get the 40% from - I haven't even tried to check. In my case, my fleet has shed about 7% so far which seems to be just a bit less than 40% :-).

Maybe that´s because i look at the daily production of the host, not the RAC. Yes the RAC drops a little (can´s say how much i don´t follow the RAC numbers).

What i can tell is what the numbers said... few examples could easely find in the Free DC last stasts.

On the top of the list, the E@H top host is a 4 MM/Day crucher, it´s yesterday´s production was about 2 MM.

My own main cruncher it´s a 1.8-2MM day and yesterdays production was in the range of 1.3 MM.

The same is happening with most of the hosts of the list if you exclude of course the new rising hosts who´s averages are not acurate.

Do the single math and you will see the aproximately 40% i was talk about. But seems like some hosts are more afected than others. Why i have no ideia.

Please be sure not was my intention to be alarmist, rise any problem or questions, just reporting something i was looking.

That was my mistake, i forget i was in an open (not our team) thread with people who dont´t know my dificulties to sometime choose the right words in english.

Wish you all a good weekend and happy crunchings.

archae86

Joined: 6 Dec 05

Posts: 3165

Credit: 7381161687

RAC: 2112148

Gary Roberts wrote:I don't

7 May 2017 13:08:27 UTC

Message 157604 in response to message 157587

(moderation:

)

Gary Roberts wrote:

I don't know where you get the 40% from - I haven't even tried to check. In my case, my fleet has shed about 7% so far which seems to be just a bit less than 40% :-).

Gary, as it happens I daily update a spreadsheet which gives me daily credit award rate for the last day and the last five days.

In the wonderful days when the website provided summary counts of pending jobs I also maintained an alternate column which added in the pending work, which aids detecting problems in my systems by smoothing out variation just attributable to luck of quorum partners. But that has not been available for about nine months.

Watching for unexpected changes in these results is a primary tool to alert me to problems in my small fleet, but it also can show me trouble at home base.

Based on numbers from that spreadsheet, I support Juan's observation of roughly a 40% reduction in credit award rate in the last couple of days. To be specific, I've been running at about 2,000,000/day, and the last two daily lines were first 1,293,208, then 1,214,236. These are wildly out of pattern, so something is going on somewhere. As all three of my hosts are similarly affected, it seems likely that "home base" rather than my systems is the somewhere.

Christian Beer

Joined: 9 Feb 05

Posts: 595

Credit: 197611612

RAC: 21898

There are currently two

7 May 2017 17:00:40 UTC

Message 157611

(moderation:

)

There are currently two things happening. One is that EinsteinAtHome is a BOINC Pentathlon project which means that a lot of users are downloading work but not reporting it back (known as bunkering). The number of GPU in progress tasks has doubled over the last days which means the throughput declined). This means there is not a lot of validation happening for GPUs right now until next Tuesday when the challenge starts and Penathlon users are reporting those tasks back.

The other thing is the new gravitational wave testrun which lacks windows support right now. The high rate of pending there is expected and will solve itself once the windows app is active and more users are able to crunch gravitational wave tasks.

Gary Roberts

Moderator

Joined: 9 Feb 05

Posts: 5887

Credit: 119496921813

RAC: 25442147

juan BFP wrote:Quote:There's

7 May 2017 22:18:57 UTC

Message 157616 in response to message 157601

(moderation:

)

juan BFP wrote:

Quote:
There's really no need to be alarmist about it.

My apologies if my bad english leave you to that conclusions, i was not in <Panic> mode, just showing another side of the problem.

There's no need to apologise and your English is fine. If anyone should apologise, it is me :-).

I'm in the middle of commissioning a 34kW solar array (100 x 340W panels) and was concerned that statements like, "there must be other things as well as just an increase in pendings" might lead a whole lot of people to jump to unwarranted conclusions and start making a lot of noise like, "get this fixed now or I'm leaving", sort of thing. I was just keen to nip any such reaction 'in the bud' as I don't have time to get distracted at the moment. So I'm sorry for over-reacting and I really wasn't intending to 'pick on you'.

I'm fully aware of how slow RAC is to react to a sudden change. My RAC had dropped 7% so I did realise that the instantaneous drop would be a lot bigger. I was hoping the 'blockage' might be fairly quickly fixed so that most people probably would not have even noticed. A statement of "40% drop" does tend to grab attention however :-).

Now that Christian has explained the cause, we can all relax. I've never participated in any of these 'challenges' or whatever they call them. I know nothing about them and I really don't want to know either :-). It's the first time I've heard this term 'bunkering'. I understand why it's an appropriate term - a coal or oil fired power plant taking on fuel :-). However it seems to me that 'bunkering' is not really a good analogy. Taking on fuel doesn't give a power plant the ability to produce a greater instantaneous output. Pre-crunching tasks and releasing them only when they can be measured seems to be an artificial way of making your system appear to be more powerful than it really is. I really don't get the point of that?? It must play havoc with the orderly running of a project.

Cheers,
Gary.

Der Mann mit de...

Joined: 12 Dec 05

Posts: 151

Credit: 302594178

RAC: 0

BTW: soon can be a

8 May 2017 10:08:47 UTC

Message 157634 in response to message 157604

(moderation:

)

BTW: soon can be a rubberband! :-(

https://einsteinathome.org/content/report-web-site-problems-here?page=5#comment-155007

Greetings from the North

Betreger

Joined: 25 Feb 05

Posts: 992

Credit: 1639761715

RAC: 478385

Christian you seem to have

9 May 2017 1:24:20 UTC

Message 157660 in response to message 157611

(moderation:

)

Christian you seem to have hit the nail on the head. Now that that the disruptive BOINC Pentathlon has moved over to this project My RAC is starting to go up and the pendings going down.

Pending tasks

Forums › Problems and Bug Reports

Comment viewing options

Forums › Problems and Bug Reports