w/u fail at end of processing

nairb
nairb
Joined: 21 Feb 05
Posts: 22
Credit: 6489225
RAC: 3327
Topic 193080

Hi, I have a linux machine running for ages processing w/u just fine.
I have now had 5 or 6 w/u end with
(process exited with code 193 (0xc1)) right at the end of processing.
The app ver is einst_s5r2 4.35.
Is this a machine prob?. Cannot find what code 193 is.

Ta
Nairb

Stick
Stick
Joined: 24 Feb 05
Posts: 790
Credit: 33456032
RAC: 13270

w/u fail at end of processing

Quote:

Hi, I have a linux machine running for ages processing w/u just fine.
I have now had 5 or 6 w/u end with
(process exited with code 193 (0xc1)) right at the end of processing.
The app ver is einst_s5r2 4.35.
Is this a machine prob?. Cannot find what code 193 is.

Ta
Nairb

The BOINC Error Codes page (from the BOINC Wiki) says 193 is an ERR_INVALID_EVENT. Unfortunately, I don't know what that might mean relative to Linux and can't help you any further.

Bikeman (Heinz-Bernd Eggenstein)
Bikeman (Heinz-...
Moderator
Joined: 28 Aug 06
Posts: 3522
Credit: 778721168
RAC: 1192385

RE: Hi, I have a linux

Quote:

Hi, I have a linux machine running for ages processing w/u just fine.
I have now had 5 or 6 w/u end with
(process exited with code 193 (0xc1)) right at the end of processing.
The app ver is einst_s5r2 4.35.
Is this a machine prob?. Cannot find what code 193 is.

Ta
Nairb

The science client terminates with segmentation violations, in different stages of the computation. I would not rule out a problem with the hardware. If it's overclocked, you might consider running at design speed for some time to see if that fixes the problem.

CU

H-BE

nairb
nairb
Joined: 21 Feb 05
Posts: 22
Credit: 6489225
RAC: 3327

Thanks for the replies. I

Thanks for the replies. I did check on the boinc wiki.... but plainly not in the right place. Each w/u fails at the end of processing. Which is frustrating on a 44hr w/u. I guess the science is lost as well as the credits.
So I have suspended ein for a while and downloaded some seti w/u to see if the same thing happens.
Maybe its time to run a mem checker for several hrs. This machine has been totally solid for years. This running flat out 24*7 sorts the good kit from the duff.

Nairb

Bikeman (Heinz-Bernd Eggenstein)
Bikeman (Heinz-...
Moderator
Joined: 28 Aug 06
Posts: 3522
Credit: 778721168
RAC: 1192385

RE: Thanks for the replies.

Message 72702 in response to message 72701

Quote:

Thanks for the replies. I did check on the boinc wiki.... but plainly not in the right place. Each w/u fails at the end of processing. Which is frustrating on a 44hr w/u. I guess the science is lost as well as the credits.
So I have suspended ein for a while and downloaded some seti w/u to see if the same thing happens.
Maybe its time to run a mem checker for several hrs. This machine has been totally solid for years. This running flat out 24*7 sorts the good kit from the duff.

Nairb

Not all fail at the end, some fail at the very beginning. Hmmmm....could be related to zipping/unzipping files ?? And it started with the new app version ....

You know, there's one other thing you should try: install the latest beta test version from the beta test page. Something related to zipping/unzipping was fixed in this beta release, it could well apply to your problem!

Good luck!

H-BE

archae86
archae86
Joined: 6 Dec 05
Posts: 3161
Credit: 7307201689
RAC: 2300376

RE: Not all fail at the

Message 72703 in response to message 72702

Quote:

Not all fail at the end, some fail at the very beginning. Hmmmm....could be related to zipping/unzipping files ?? And it started with the new app version ....


Is it by any chance overclocked? It could easily be that for your particular machine, the new ap version by bad luck represented a slightly more challenging proposition--so if you had adjusted to very little margin before, it could have gone negative.

For my Core 2 machines, the Einstein ap then current when I was adjusting them (in April and in early August) definitely was more demanding from a speed/voltage point of view than was the SETI ap or just staying alive in Windows, so this can happen.

I mention it mainly because your symptoms are a pretty good fit. When I was two voltage increments too low for Einstein success I got a failure about once a day. When I was one increment too low I got one failure in two weeks. In weeks of running, I had no SETI or system failures, even a couple of increments lower voltage than any of those.

Your system's error-reports are utterly different than mine, but you are running a different architecture CPU on a different OS, so that is hardly surprising.

Jord
Joined: 26 Jan 05
Posts: 2952
Credit: 5893653
RAC: 1

RE: The BOINC Error Codes

Message 72704 in response to message 72699

Quote:
The BOINC Error Codes page (from the BOINC Wiki) says 193 is an ERR_INVALID_EVENT. Unfortunately, I don't know what that might mean relative to Linux and can't help you any further.


Stick, positive error numbers are a problem with the application.
Negative error numbers are a problem in BOINC.

Error -193 is a BOINC error. This error number is no longer used in the present code.

Error 193 is a SIGSEGV (segmentation violation) error in the application, pointing to memory problems or the Einstein application making invalid memory references.

nairb, besides checking the memory, also check if you still got enough paging file left.

Stick
Stick
Joined: 24 Feb 05
Posts: 790
Credit: 33456032
RAC: 13270

RE: Stick, positive error

Message 72705 in response to message 72704

Quote:
Stick, positive error numbers are a problem with the application.
Negative error numbers are a problem in BOINC.

Jord,

Thanks for correcting my screw-up!

Stick

th3
th3
Joined: 24 Aug 06
Posts: 208
Credit: 2208434
RAC: 0

Your comp was running OK with

Your comp was running OK with 4.21, then with 4.35 all your WUs are failing, then i have one more reason to dislike 4.35, LoL (not performing as well as 4.21 on Core 2)

Archae coud be right about the vcore thing, i had some Thoroughbred Athlons fail at stock volts/clock as they got older, they might require a voltage bump even without overclocking. Yours is a Barton core anyway (i think), they were considered to be better. Got a Barton 2500+ still going strong at stock volts, but it was never used for crunching, maybe thats how it survived this long? :)

nairb
nairb
Joined: 21 Feb 05
Posts: 22
Credit: 6489225
RAC: 3327

Well I think Archae is on the

Well I think Archae is on the right track here. This machine is the only overclocked pc I have. Its a barton core amd overclocked from 2500 to a 2800 and has been for the last 3 or 4 years. Its always been very stable with a stick of corsair xms memory. I never changed the core voltage as it did seti w/u fine. Its now done 4 seti w/u without fault. Its only with the new ein app that its started failing. Its not using any swap space either, so me thinks its time to retire this pc from ein work. I have put 2 older pc's on the job to see if they are ok with the new app.

Nairb

nairb
nairb
Joined: 21 Feb 05
Posts: 22
Credit: 6489225
RAC: 3327

So I decided to try 2 older

So I decided to try 2 older pc's...... 1 of them is a dual 700mhz xeon machine.
Never been overclocked and has run without any problem on any project for at least 5 years. Except that is for the latest downloaded ein app.
The first 2 wu quit after 44 hrs with the good old 193 error. The last 2 quit at the end of processing 106 hrs!. The final wu I aborted!. This poor old machine is still doing rosetta/seti etc without fault.

The other machine is running win98. And has done 2 or 3 ein wu fine.

So the conclusion is... ein dosent like any of my linux machines any more.

So when it becomes less fussy I will join in again.

Nairb

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.