linux 4.21 doesn't validate against windows 4.17?

Alf da Dinnamint
Alf da Dinnamint
Joined: 21 Feb 05
Posts: 4
Credit: 499600
RAC: 0
Topic 192810

Over the last few weeks, I've had a number of results which didn't validate.

In every case, this has been where my host (linux, running 4.21) was validating against windows hosts, running 4.17; the windows hosts seem to have won. I have never seen a problem where my host is validating against other linux hosts, my results are always valid otherwise.

An example of such a workunit is http://einsteinathome.org/workunit/33809133

Is there some sort of cross-platform validation issue going on, or should I suspect my host?

Bikeman (Heinz-Bernd Eggenstein)
Bikeman (Heinz-...
Moderator
Joined: 28 Aug 06
Posts: 3522
Credit: 811313109
RAC: 1293288

linux 4.21 doesn't validate against windows 4.17?

Quote:

Over the last few weeks, I've had a number of results which didn't validate.

In every case, this has been where my host (linux, running 4.21) was validating against windows hosts, running 4.17; the windows hosts seem to have won. I have never seen a problem where my host is validating against other linux hosts, my results are always valid otherwise.

An example of such a workunit is http://einsteinathome.org/workunit/33809133

Is there some sort of cross-platform validation issue going on, or should I suspect my host?

There is a known cross-platform validation issue as you suspected, no need to mistrust your host :-)
CU

BRM

Alf da Dinnamint
Alf da Dinnamint
Joined: 21 Feb 05
Posts: 4
Credit: 499600
RAC: 0

Oh... and I see could have

Oh... and I see could have found that out myself if I'd looked a bit harder in these forums. Doh. Thanks very much for the reply, BRM.

Conan
Conan
Joined: 19 Jun 05
Posts: 172
Credit: 8689330
RAC: 5927

Here's another one that lost

Here's another one that lost out in the Linux (me) versus Windows validation problem. I have been lucky so far as this is the first one I seem to recall, but with about a dozen still pending I could get others.

33948225

Hope it helps.

Admin
Admin
Joined: 4 Sep 06
Posts: 1
Credit: 26999
RAC: 0

Here is a result where I lost

Message 67963 in response to message 67962

Here is a result where I lost out, with me running Windows app 4.17. The other two results validated to make quorum, were obtained from Linux app 4.21.

http://einsteinathome.org/workunit/33926687

We should be getting credit for these "invalid" results, since the problem appears to be with the Linux app 4.21 beta.

LP.

leg
leg
Joined: 25 Feb 05
Posts: 7
Credit: 1050093693
RAC: 454130

I've had 3-4 linux units

I've had 3-4 linux units thrown out with same problem in last few months. Most recent will be 33926016 (http://einsteinathome.org/workunit/33926016). You can tell because as soon as you report result it is sent to a third machine. Actually the problem goes both ways. The subsequent results have to match the first machine to complete successfully. I had one result where my linux machine finished first and a windows machine received no credit as 2nd and the unit was sent to 3rd machine which was running linux.

This seems to affect linux machines more because I believe the linux machines are generally older and slower and more heavily loaded. The linux machines will thus finish second more frequently. This is true in my case at least.

Admittedly my coding days are long passed, but it does not seem to me it would be that difficult to program E@H servers to check operating system of clients and only to send a particular WU to machines with same operating system. I don't know if this is a boinc problem or an E@H problem since I only run E@H. It does make you wonder if it is worthwhile in terms of cost to client and project to run E@H on these older linux machines.

Alinator
Alinator
Joined: 8 May 05
Posts: 927
Credit: 9352143
RAC: 0

RE: I've had 3-4 linux

Message 67965 in response to message 67964

Quote:

I've had 3-4 linux units thrown out with same problem in last few months. Most recent will be 33926016 (http://einsteinathome.org/workunit/33926016). You can tell because as soon as you report result it is sent to a third machine. Actually the problem goes both ways. The subsequent results have to match the first machine to complete successfully. I had one result where my linux machine finished first and a windows machine received no credit as 2nd and the unit was sent to 3rd machine which was running linux.

This seems to affect linux machines more because I believe the linux machines are generally older and slower and more heavily loaded. The linux machines will thus finish second more frequently. This is true in my case at least.

Admittedly my coding days are long passed, but it does not seem to me it would be that difficult to program E@H servers to check operating system of clients and only to send a particular WU to machines with same operating system. I don't know if this is a boinc problem or an E@H problem since I only run E@H. It does make you wonder if it is worthwhile in terms of cost to client and project to run E@H on these older linux machines.

Actually the ability to check host platforms is already built into the BOINC framework. It's called Homogenous Rendundancy, but the issue is it was never required on EAH before. The problem has been with the new app being tried out now on this defacto beta S5R2 run.

Other than that EAH is one of the most old host friendly projects around, since the don't issue a useless result by default just to keep the the DIG's happy (Demands Instant Gratification). This means when things are 'normal' around here you are reasonably assured that if your host completes the result successfully and on time, the effort it put in was actually helping the science.

Alinator

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.