Unusually Fast O1 All Sky Completions on Problem Machine

Gamboleer
Gamboleer
Joined: 5 Dec 10
Posts: 173
Credit: 168389195
RAC: 0
Topic 198599

Hello,

I have a dual E5-2670 machine with an AMD 7970 inside. The 7970 had previously been giving me frequent validate errors and invalid results on BRP6 tasks. I had it fixed by underclocking it, but the problem started again.

At the same time, I noticed that many of my O1 all-sky tasks were taking about 30% longer than normal (normal time, ~55k seconds). And a few took only 15k seconds, 350% faster than normal, yet were validated as good results. I thought perhaps these were "small" tasks, but comparing times to my wingmen, their times appear normal for their CPUs. I am posting here so these can be examined to be sure the validator is working correctly.

Sample works units completed 350% faster than normal, yet validated:

244989928, 244286371, 244989703, 244581088, 244989369, 244989364, 244273986, 244970188, 244970148

The computer is 12232333. O1 tasks are run hyperthreaded. I have removed the AMD card and the O1 result times seem to be normalizing again.

AgentB
AgentB
Joined: 17 Mar 12
Posts: 915
Credit: 513211304
RAC: 0

Unusually Fast O1 All Sky Completions on Problem Machine

Quote:

At the same time, I noticed that many of my O1 all-sky tasks were taking about 30% longer than normal (normal time, ~55k seconds). And a few took only 15k seconds, 350% faster than normal, yet were validated as good results.

Looking at Task 557462853

We see

2016-05-05 11:26:57.4354 (5688) [debug]: Successfully read checkpoint:24326

So it looks like the task times reported are wrong, after restarting from a checkpoint. I looked at a couple of others and they also restart from checkpoint at that time.

I'm not 100% sure but there is something for the devs to look at.

Christian Beer
Christian Beer
Joined: 9 Feb 05
Posts: 595
Credit: 196961446
RAC: 202093

Yes, I saw something similar

Yes, I saw something similar too from the dedicated cluster machines. That after a checkpoint the runtime is also zeroed and reported incorrectly. I wonder if this is a problem with a specific Client version.

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.