new units not downloading

Sharky T
Sharky T
Joined: 19 Feb 05
Posts: 159
Credit: 1187722
RAC: 0

RE: However, it's not my

Quote:
However, it's not my day today :). I took your advice and cancelled running work that was in many cases 80-90% complete!!! And I'm still not mad at you in the slightest :). I'd rather lose the credits than hold up the science by doing work that will only have to be repeated anyway so my cancelling the partly completed work was still the right thing to do.

I aborted 1 ongoing h1_WU and its been granted the claimed credit,so I don't think you loose those credits. :)

Edit: Hmm.. 4.19.. Was there a abort/cancel-button on those?
Hope they got reported.(Haven't read all posts here.(too long))


Gary Roberts
Gary Roberts
Moderator
Joined: 9 Feb 05
Posts: 5874
Credit: 117975698290
RAC: 21944466

RE: Edit: Hmm.. 4.19.. Was

Message 13547 in response to message 13546

Quote:

Edit: Hmm.. 4.19.. Was there a abort/cancel-button on those?
Hope they got reported.(Haven't read all posts here.(too long))

Yep, you worked it out exactly!! There is no abort button in 4.19 which is why I reported my procedure earlier thinking I might be helping other 4.19ers. The computation on the WU gets zeroed when BOINC restarts after deleting the h1_nnnn file. So no credit will be coming for those.

However it doesn't matter in the slightest as it would be a waste of science to keep spending cycles on a WU that wont contribute.

Cheers,
Gary.

Bruce Allen
Bruce Allen
Moderator
Joined: 15 Oct 04
Posts: 1119
Credit: 172127663
RAC: 0

RE: RE: [Edit added 30

Message 13548 in response to message 13545

Quote:
Quote:


[Edit added 30 min later]
I found a script that I have used before, which I can use to grant credit to users/hosts/teams for workunits which I have cancelled. I am going to use this to grant credit to people who have had the misfortune of getting and doing work then having it cancelled.

Bruce

I'm very pleased that you have done that and it will be good for the silent majority who probably aren't even aware of the problem yet.

However, it's not my day today :). I took your advice and cancelled running work that was in many cases 80-90% complete!!! And I'm still not mad at you in the slightest :). I'd rather lose the credits than hold up the science by doing work that will only have to be repeated anyway so my cancelling the partly completed work was still the right thing to do.

Good news -- I'm giving credit for cancelled and 'download error' work as well as successful and valid results. Since these problems were my fault it seems the least I can do.

Quote:

It must have been one of those nightmare days (and nights) for you :).

I confess to being in a pretty foul mood for most of the day today!

Director, Einstein@Home

hih_tv-Greg
hih_tv-Greg
Joined: 11 Feb 05
Posts: 94
Credit: 31815
RAC: 0

I just aborted

I just aborted "h1_0118.0__0118.1_0.1_T00_S4ha_0" from my machine, 06/28/2005 08:11:06 PM|Einstein@Home|Starting result l1_0315.5__0315.9_0.1_T00_S4lA_0 using einstein version 4.79.

Greg

Mahray
Mahray
Joined: 11 Nov 04
Posts: 43
Credit: 137174464
RAC: 138018

I'd also like to say thanks

I'd also like to say thanks for keeping us informed. Screw-ups happen, and I'm quite happy as long as I'm reasonably well informed.

Gary Roberts
Gary Roberts
Moderator
Joined: 9 Feb 05
Posts: 5874
Credit: 117975698290
RAC: 21944466

RE: I confess to being in

Message 13551 in response to message 13548

Quote:

I confess to being in a pretty foul mood for most of the day today!

Actually you deserve heaps of praise for the way you handled everything. I don't think you could have done more and the issue was completely defused before there were any nasty surprises and the accompanying flood of complaints that would normally be expected to follow.

It is this kind of professionalism that makes me proud to give my full support to this project. Well done, and many thanks for all your efforts!!

Cheers,
Gary.

Ananas
Ananas
Joined: 22 Jan 05
Posts: 272
Credit: 2500681
RAC: 0

I agree, good work from the

I agree, good work from the country of cheese and packers :-)

Especially for the good communication I'll give an A++

gravywavy
gravywavy
Joined: 22 Jan 05
Posts: 392
Credit: 68962
RAC: 0

RE: Actually you deserve

Message 13553 in response to message 13551

Quote:
Actually you deserve heaps of praise for the way you handled everything. I don't think you could have done more

it wasn't till I saw this wu that I realised just how much Bruce had done to defuse anger: he has set things up so that people get credit for the part worked wu they cancel part way through - at least I think that is what this wu is telling us

Quote:

It is this kind of professionalism that makes me proud to give my full support to this project. Well done, and many thanks for all your efforts!!


agreed^2

~~gravywavy

Bruce Allen
Bruce Allen
Moderator
Joined: 15 Oct 04
Posts: 1119
Credit: 172127663
RAC: 0

RE: it wasn't till I saw

Message 13554 in response to message 13553

Quote:

it wasn't till I saw this wu that I realised just how much Bruce had done to defuse anger: he has set things up so that people get credit for the part worked wu they cancel part way through - at least I think that is what this wu is telling us

Your interpretation is entirely correct. I am giving credit for partial/aborted/failed/completed h1_* workunits. Note that this is not instantaneous and may take a few hours. I have to run the script by hand and only do it a few times per day.

Bruce

Director, Einstein@Home

gravywavy
gravywavy
Joined: 22 Jan 05
Posts: 392
Credit: 68962
RAC: 0

RE: RE: it wasn't till I

Message 13555 in response to message 13554

Quote:
Quote:

it wasn't till I saw this wu that I realised just how much Bruce had done to defuse anger: he has set things up so that people get credit for the part worked wu they cancel part way through - at least I think that is what this wu is telling us

Your interpretation is entirely correct. I am giving credit for partial/aborted/failed/completed h1_* workunits. Note that this is not instantaneous and may take a few hours. I have to run the script by hand and only do it a few times per day.

Bruce

Gary has pointed out to me that credit is not granted for wu that are killed by stealing their files. On consideration this makes sense if the xml that held the cpu time has gone. If the client re-starts the download when the files vanish, presumably it also deletes/overwrites the file that remembers the cpu time so far?

My thought is that it may be better, if running 4.19, to kill those wu from the operating system while BOINC is actually crunching them. This assumes the OS has some kind of task manager (eg not Win-98).

On win-XP for example, hit ctrl-alt-del and the task manager comes up. Highlight the Einstein task, right click, and kill process. The wu will report to BOINC that it ended with some error code that means killed. I think that this means that BOINC will report it back with a 'client error' message and they will get credit.

On linux: you probably already know how to use top or ps to get the pid, and how to use kill to abort. If not, I recommend the man pages on top, ps, kill.

Note: I have tried the win-xp method in the past, but not on these wu. If my suggestion won't work, please say so!

~~gravywavy

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.