WU stopped processing

Daniel Fontenot
Daniel Fontenot
Joined: 13 Aug 10
Posts: 3
Credit: 1893075
RAC: 0
Topic 195478

Quote:

I have had several WUs stop processing and never finish.
No error messages show up.
Other WUs run to 100% and are transferred.
This is the last message on the WU.
11/26/2010 2:16:18 PM Einstein@Home Restarting task h1_1213.15_S5R4__138_S5GC1a_0 using einstein_S5GC1 version 302

The "Properties" of the WU are:
Application Global Correlations S% search #1 3.02(S5GCESSE2)
Workunit name h1_1213.15_S5R4__138_S5GC1a
State Waiting to run
Received 11/21/2010 4:42:11 AM
Report deadline 12/5/2010 4:42:10 AM
CPU time at last checkpoint 14:49:49
CPU time 14:56:36
Elapsed time 18:32:01
Estimated time remaining 01:59:59
Fraction done 90.288 %
Virtual memory size 191.51 MB
Working set size 204.00 MB
Directory slots/4
Process ID 3264

Can anyone tell me what is wrong?

BilBg
BilBg
Joined: 27 May 07
Posts: 56
Credit: 23998
RAC: 0

WU stopped processing

Quote:

I have had several WUs stop processing and never finish.
No error messages show up.
Other WUs run to 100% and are transferred.
This is the last message on the WU.
11/26/2010 2:16:18 PM Einstein@Home Restarting task h1_1213.15_S5R4__138_S5GC1a_0 using einstein_S5GC1 version 302

The "Properties" of the WU are:
Application Global Correlations S% search #1 3.02(S5GCESSE2)
Workunit name h1_1213.15_S5R4__138_S5GC1a
State Waiting to run
Received 11/21/2010 4:42:11 AM
Report deadline 12/5/2010 4:42:10 AM
CPU time at last checkpoint 14:49:49
CPU time 14:56:36
Elapsed time 18:32:01
Estimated time remaining 01:59:59
Fraction done 90.288 %
Virtual memory size 191.51 MB
Working set size 204.00 MB
Directory slots/4
Process ID 3264

Can anyone tell me what is wrong?

"State Waiting to run": There is no problem - just another task runs at the moment (High priority) and this task will be resumed (hours/days) later.

You Aborted a task after more than 17 hours of computation!:
http://einsteinathome.org/host/3256335/tasks&offset=0&show_names=0&state=5

Why?!
It is absolutely normal that some tasks will be paused ("Waiting to run") automatically by BOINC
and after some time resumed to continue the computation (from the % already done).

This is done by BOINC if it sees that some other task can miss the deadline
and so the current task is paused, another ("High priority" = may miss the deadline) tasks are done,
then the task is resumed when BOINC sees the time is appropriate.

P.S.
Just for testing purposes (no need/not recommended to do this regularly)
you can force BOINC to continue the task which is in state "Waiting to run":

Suspend all tasks but the one you want to run (to resume/start):

Select all tasks (click on any task, [Home], [Shift]-[End])
Un-Select the task you want to resume ([Ctrl]-click on it)
Press [Suspend] button at the left.

(of course you have to [Resume] the same way all the suspended tasks after the "wanted" task is finished)

[pre] [/pre]

- ALF - "Find out what you don't do well ..... then don't do it!" :)

Daniel Fontenot
Daniel Fontenot
Joined: 13 Aug 10
Posts: 3
Credit: 1893075
RAC: 0

I have suspended all other

I have suspended all other tasks and this task will get no further than 90.288% even after hours of running.

Gundolf Jahn
Gundolf Jahn
Joined: 1 Mar 05
Posts: 1079
Credit: 341280
RAC: 0

Did you try a

Did you try a reboot?

Gruß,
Gundolf

Computer sind nicht alles im Leben. (Kleiner Scherz)

Daniel Fontenot
Daniel Fontenot
Joined: 13 Aug 10
Posts: 3
Credit: 1893075
RAC: 0

Yes I did reboot. That did

Yes I did reboot. That did not get the WU restarted.

All is working now.
I shut down all possible applications and eventually the WU did complete.
Thanks for the help.

Gundolf Jahn
Gundolf Jahn
Joined: 1 Mar 05
Posts: 1079
Credit: 341280
RAC: 0

RE: I shut down all

Quote:
I shut down all possible applications and eventually the WU did complete.


Is it possible that the WU was "waiting for memory"? From your first post:

Quote:
The "Properties" of the WU are:
Application Global Correlations S% search #1 3.02(S5GCESSE2)
Workunit name h1_1213.15_S5R4__138_S5GC1a
State Waiting to run
Received 11/21/2010 4:42:11 AM
Report deadline 12/5/2010 4:42:10 AM
CPU time at last checkpoint 14:49:49
CPU time 14:56:36
Elapsed time 18:32:01
Estimated time remaining 01:59:59
Fraction done 90.288 %
Virtual memory size 191.51 MB
Working set size 204.00 MB

Directory slots/4
Process ID 3264


Gruß,
Gundolf

Computer sind nicht alles im Leben. (Kleiner Scherz)

BilBg
BilBg
Joined: 27 May 07
Posts: 56
Credit: 23998
RAC: 0

I think "waiting for memory"

I think "waiting for memory" will be shown as State of the task.

Also even if you free memory by exiting unneeded programs this will not make the task run.
The way to give more memory is to change the BOINC Preferences
Memory: when computer is in use, use at most 70% of total
Memory: when computer is not in use, use at most 80% of total

(With Memory 750.8 MB try at least 50%)

Maybe the problem is this preference?:
Suspend work if CPU usage is above --- %

(change it to 0 (zero) to switch if off)

[pre] [/pre]

- ALF - "Find out what you don't do well ..... then don't do it!" :)

Gundolf Jahn
Gundolf Jahn
Joined: 1 Mar 05
Posts: 1079
Credit: 341280
RAC: 0

RE: Also even if you free

Quote:
Also even if you free memory by exiting unneeded programs this will not make the task run.


Why not? I think that would be the primary function of the client in this respect.

Gruß,
Gundolf

Computer sind nicht alles im Leben. (Kleiner Scherz)

BilBg
BilBg
Joined: 27 May 07
Posts: 56
Credit: 23998
RAC: 0

Usually only a few MB of RAM

Usually only a few MB of RAM are really free.
If BOINC checks for free RAM it will (almost) never run any app.

If you tell BOINC that it can use up to 50% of 1 GB RAM (= 500 MB)
BOINC will allocate the needed RAM (e.g. 200 MB) even if the OS reports only 1 MB of free RAM

(Windows will lower the RAM used for disk cache and swap to disk some parts of used RAM (used by idle programs) to free 200 MB RAM)

You can check this - set BOINC to use only 1% of RAM and it will wait for memory even if you exit all unneeded programs and OS reports 100s of MB of RAM free.

[pre] [/pre]

- ALF - "Find out what you don't do well ..... then don't do it!" :)

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.