Optomized S5 SSE3

Akos Fekete
Akos Fekete
Joined: 13 Nov 05
Posts: 561
Credit: 4527270
RAC: 0

RE: RE: I think if the

Message 39169 in response to message 39166

Quote:
Quote:
I think if the validator doesn't accept one digit fault then it will not accept 7 digits fault too...

I compare result which produce some opti app. (I write here only first row)

WU (output file): l1_0229.0_S5R1__2611_S5R1a_1_0
Original app:
229.499118078 0.010001 -0.843352 -1.7676e-009 32.6857
S5S007:
229.499118078 0.010001 -0.843352 -1.7676e-009 32.6857
S5T0308:
229.499115044 4.87049 0.871399 6.8924e-011 654.035

That isn't problem of one digit. S5T0308 produce totally incompatible results! I don't know whay?

There are only one important difference between S5S0007 and S5T0308, the rounding of the last bits. I think you can find out that how can S5T0308 produces these results. It isn't too difficult...

LiborA
LiborA
Joined: 8 Dec 05
Posts: 74
Credit: 337135
RAC: 0

RE: RE: RE: I think if

Message 39170 in response to message 39169

Quote:
Quote:
Quote:
I think if the validator doesn't accept one digit fault then it will not accept 7 digits fault too...

I compare result which produce some opti app. (I write here only first row)

WU (output file): l1_0229.0_S5R1__2611_S5R1a_1_0
Original app:
229.499118078 0.010001 -0.843352 -1.7676e-009 32.6857
S5S007:
229.499118078 0.010001 -0.843352 -1.7676e-009 32.6857
S5T0308:
229.499115044 4.87049 0.871399 6.8924e-011 654.035

That isn't problem of one digit. S5T0308 produce totally incompatible results! I don't know whay?

There are only one important difference between S5S0007 and S5T0308, the rounding of the last bits. I think you can find out that how can S5T0308 produces these results. It isn't too difficult...

Maybe for you it is easy, for me is incomprehensible how can rounding of the last bit produce so diferent results.

There is the first row of results from S5T0711:
229.499118078 0.010001 -0.843352 -1.7676e-009 32.6857

It is the same as afficial app, so it seem that S5T0711 produce outcome, which will be valid :)

Athlonheizer
Athlonheizer
Joined: 3 Jun 06
Posts: 33
Credit: 513937
RAC: 0

RE: standard app / 3140,11

Message 39171 in response to message 39168

Quote:

standard app / 3140,11 sec / 52,3min / result
S5T0709 SSE3 / 2385,27 sec / 39,8min / result
S5T0709 SSE3 / 2415,67 sec / 40,3min / result

S5T0711 SSE3 / 2277,33 sec / 38,0min / result
S5T0711 SSE3 / 2334,70 sec / 38,9min / result

All Results are valid.

EDIT: my gain calculation was wrong, here the correct

gain:
S5T0709 = 23-24%
S5T0711 = 25-28%

Okay next week wie arrive the 50%

Athlon

Stay tuned and keep crunching

Joined: 28 Mar 06
Posts: 21
Credit: 395978
RAC: 0

RE: RE: My results vary a

Message 39172 in response to message 39153

Quote:
Quote:

My results vary a bit, don't know why though. The direction the results are going is pretty clear:
Standard: 29073
S5T0003: 34150 (valid)
S5T0709: 14650 (initial, would make around 50% savings compared to standard app!)

I will post further resuts of S5T0709 and see if that monster-speed-up keeps up.

Many thanks to aksof for his work and effort!!

Hey man what kind of hell computer you use?

Athlon

Some Intel with real 3GHz and more with S5T0709 over 30000 Sec.
My AMD 3500+ @ 2650MHz with S5T0709 under 20000 sec.

Using the same CPU than you. AMD 64 Athlon 3500+ but stock, since I have new memory and haven't had time to do some OC and check the stability. I think the 14650 seconds was a faster WU though. I have checked the other result of that WU and the guy using a P4 3Ghz also has some increase in times, without using optimized app. From 70K down to 45K. So I assume it's the WUs.

different WUs I've seen and had so far:
h1_0436.5_S5R1__80_S5R1a slower (~14500)
h1_0436.5_S5R1__76_S5R1a faster (~26700)

Edit: S5T0709 produces valid results with my machine

Nuadormrac
Nuadormrac
Joined: 9 Feb 05
Posts: 76
Credit: 229259947
RAC: 45

OK, grabbing s5to711... Will

OK, grabbing s5to711... Will see how it goes. I'm crunching under Vista Ultimate 64, beta 2. Don't s'pose I could get anymore in a test environment then running a beta operating system and all :D

Akos Fekete
Akos Fekete
Joined: 13 Nov 05
Posts: 561
Credit: 4527270
RAC: 0

Bernd told me some

Bernd told me some things.

He would like that we use the 4.10 version number for patched files, so you should rename your files and built an app_info.xml file in the future. The next patches will search einstein_S5R1_4.10_windows_intelx86.exe in their directory.

The other thing is that the validator doesn't drop out those results that have only a small difference in the values, like as S4 validation. So, there is possible to built ( a bit ) faster codes!

Athlonheizer
Athlonheizer
Joined: 3 Jun 06
Posts: 33
Credit: 513937
RAC: 0

RE: The other thing is that

Message 39175 in response to message 39174

Quote:
The other thing is that the validator doesn't drop out those results that have only a small difference in the values, like as S4 validation. So, there is possible to built ( a bit ) faster codes!


That sounds very good!

Quote:
Okay next week wie arrive the 50%

Athlon

Other Question: How long does it take to crunch a WU without optimised standard App and without optimised Apps here?
Days - Weeks - Month?

Stay tuned and keep crunching

Akos Fekete
Akos Fekete
Joined: 13 Nov 05
Posts: 561
Credit: 4527270
RAC: 0

S5T0712.dat - eliminated

Message 39176 in response to message 39162

S5T0712.dat

- eliminated double jumps
- reduced amount of FPU macro ops
- removed double loads on general purpose registers

- better SSE register usage
- reduced memory and integer register usage
- optimized branch structure
- faster FPU comparisons

- SSE3 truncation
- some reordered instructions
- automatic SSE/SSE3 usage
- less FPU-memory-FPU operation

CPU: ALL
app: einstein_S5R1_4.10_windows_intelx86.exe

Svenie25
Svenie25
Joined: 21 Mar 05
Posts: 139
Credit: 2436862
RAC: 0

RE: Bernd told me some

Message 39177 in response to message 39174

Quote:

Bernd told me some things.

He would like that we use the 4.10 version number for patched files, so you should rename your files and built an app_info.xml file in the future. The next patches will search einstein_S5R1_4.10_windows_intelx86.exe in their directory.

Could someone please write, how such an app_info.xml have to look? I don´t know much about such things... ;)

Pepperammi
Pepperammi
Joined: 20 Feb 05
Posts: 131
Credit: 437943
RAC: 0

I wrote one but it didn't

I wrote one but it didn't work. don't know why. I've done them before for seti and had no prob.

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.