Unsent WU

Pete
Pete
Joined: 31 Jul 10
Posts: 14
Credit: 1020243718
RAC: 0
Topic 198573

I wonder if anyone can tell me why a recent WU that I have processed has not been resent to another co worker? WU 243672480 was sent to me on the 8th April and to another on the 9th. On the 13th my co worker had an error whilst computing and since then the WU has not been resent although a new task number was generated.
Regards Peter

Holmis
Joined: 4 Jan 05
Posts: 1118
Credit: 1055935564
RAC: 0

Unsent WU

This is a side effect of how the scheduler here at Einstein works.

For every Gravity Wave workunit you need a set of large data files, this set of files can be used by several hundred workunits and the scheduler tries to send workunits to hosts that already have the required data files or that only has to download a few files to complete the set required.
This might sometimes lead to workunits remaining unsent for several days as the scheduler waits for a suitable host to come as for work. If enough time passes the workunit will be sent out to the next host asking for this kind of work even though that host might not have any data files required and so will have to download the complete set.

So you just need to have a bit more patience and the workunit will be sent out again.

Christian Beer
Christian Beer
Joined: 9 Feb 05
Posts: 595
Credit: 196961446
RAC: 202093

There was a problem with the

There was a problem with the locality scheduler that assigns work to hosts. Because of that the number of unsent tasks was increasing over the last 2 weeks. It took some time to understand what was happening but we finally fixed this last Friday. Since then the locality scheduler is assigning the already generated but still unsent tasks to requesting hosts instead of generating new work.

It took three days to go from 210k unsent to 150k now. So it will take until next week for the queue to empty.

Edit: What Holmis said is correct. But in this special case until last Friday the locality scheduler was not behaving like it should.

Pete
Pete
Joined: 31 Jul 10
Posts: 14
Credit: 1020243718
RAC: 0

Thank you both for the

Thank you both for the explanation.
Pete

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.