exit code 21 on linux

Hartmut Geissbauer
Hartmut Geissbauer
Joined: 5 Jan 06
Posts: 31
Credit: 152,941,307
RAC: 0
Topic 191877

the last two weeks i'm not able to process any WU.

My system:
Athlon XP 2700+
Linux 2.6.18
2G RAM
Boinc 5.4.9

Mostly the calculation fails like this:

2006-09-28 01:15:06 [Einstein@Home] Starting task l1_0206.5_S5R1__519_S5R1a_0 using einstein_S5R1 version 417
Archive: ../../projects/einstein.phys.uwm.edu/grid_0210_l_T01_S5R1.dat
inflating: ./grid_0210_l_T01_S5R1.dat
2006-09-28 01:15:49 [Einstein@Home] Unrecoverable error for result l1_0206.5_S5R1__519_S5R1a_0 (process exited with code 21 (0x15))

5.4.9

process exited with code 21 (0x15)

2006-09-28 01:15:06.9125 [normal]: Start of BOINC application 'einstein_S5R1_4.17_i686-pc-linux-gnu'.
2006-09-28 01:15:06.9418 [normal]: Started search at lalDebugLevel = 0
2006-09-28 01:15:07.1598 [normal]: Checkpoint-file 'Fstat.out.ckp' not found.
2006-09-28 01:15:07.1599 [normal]: No usable checkpoint found, starting from beginning.
Detected CPU type 1
xTemp >= 0 failed
DEBUG: loop=154, xTemp=-26179.802602, f=206.947044, alpha=46, tempInt1[alpha]=185
DEBUG: skyConst[ tempInt1[ alpha ] ] = 1799.976566, xSum[ alpha ]=-0.409293

What means "xTemp >= 0 failed"?

I've reset the project --> no success
I've moved the boinc folder and created it from scratch --> no success.

Any help would be appreciated.

Hartmut

Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4,312
Credit: 250,559,893
RAC: 34,635

exit code 21 on linux

Quote:
What means "xTemp >= 0 failed"?


This is a sanity check in the program. This failing usually points to a hardware problem, most frequently a too hot CPU (overclocked, fan failing, whatever), less frequently a problem with RAM. Check your hardware.

BM

BM

Hartmut Geissbauer
Hartmut Geissbauer
Joined: 5 Jan 06
Posts: 31
Credit: 152,941,307
RAC: 0

Thamk you for your

Thamk you for your response.
I've done some testing by running the mprime (linux version of prime95) torture test.
The test failed after a few seconds. So I suppose, the problem is a defect CPU.
If it where a thermical problem, the test would run a little bit longer.
Now I'm trying to get a replacment CPU for my system.
Unfortunatley this type is not sold on every corner nowadays. So I have to try the market for used parts; not to be sure, getting a error free one.

Hartmut

Desti
Desti
Joined: 20 Aug 05
Posts: 117
Credit: 23,762,214
RAC: 0

RE: Thamk you for your

Message 47323 in response to message 47322

Quote:

Thamk you for your response.
I've done some testing by running the mprime (linux version of prime95) torture test.
The test failed after a few seconds. So I suppose, the problem is a defect CPU.
If it where a thermical problem, the test would run a little bit longer.
Now I'm trying to get a replacment CPU for my system.
Unfortunatley this type is not sold on every corner nowadays. So I have to try the market for used parts; not to be sure, getting a error free one.

Hartmut

A defect CPU is relative unlikely.
I'v seen that you have 2 GiB RAM, thats a lot for the most socket A mainboards and I would recommend you to check the RAM first before you spend your money for a new cpu.

Hartmut Geissbauer
Hartmut Geissbauer
Joined: 5 Jan 06
Posts: 31
Credit: 152,941,307
RAC: 0

The RAM is brand new because

The RAM is brand new because of a defect RAM 14 days ago.
That was the time I received 0.00 credit for my results. They where send to a third computer for recalculation.
The new RAM has been checked with memtest86 and seems to be ok.
BTW: I bought two used CPUs on ebay. They'll be delivered next week. Then I'll do same more checking.

Hartmut

Hartmut Geissbauer
Hartmut Geissbauer
Joined: 5 Jan 06
Posts: 31
Credit: 152,941,307
RAC: 0

Problem solved! Here are

Problem solved!

Here are my results from testing:

CPU ok
RAM ok
Motherboard defect

While swapping the CPUs I discovered a bunch of blasted elcos on the mobo.
It was an ASUS A7N8X. Does anyone else have a similiar problem?

Hartmut

Desti
Desti
Joined: 20 Aug 05
Posts: 117
Credit: 23,762,214
RAC: 0

RE: Problem solved! Here

Message 47326 in response to message 47325

Quote:

Problem solved!

Here are my results from testing:

CPU ok
RAM ok
Motherboard defect

While swapping the CPUs I discovered a bunch of blasted elcos on the mobo.
It was an ASUS A7N8X. Does anyone else have a similiar problem?

Hartmut

Broken elcos on mainboards is a know problem, but I have never heard that this happens on an A7N8X.

Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4,312
Credit: 250,559,893
RAC: 34,635

RE: While swapping the CPUs

Message 47327 in response to message 47325

Quote:
While swapping the CPUs I discovered a bunch of blasted elcos on the mobo.
It was an ASUS A7N8X. Does anyone else have a similiar problem?


Yep. In fact 180 of them.

BM

BM

Desti
Desti
Joined: 20 Aug 05
Posts: 117
Credit: 23,762,214
RAC: 0

RE: RE: While swapping

Message 47328 in response to message 47327

Quote:
Quote:
While swapping the CPUs I discovered a bunch of blasted elcos on the mobo.
It was an ASUS A7N8X. Does anyone else have a similiar problem?

Yep. In fact 180 of them.

BM

What mainboards were affected?

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.