GPU compuation error, OpenSuse, GTX1050

John
John
Joined: 17 Jan 18
Posts: 7
Credit: 3275058426
RAC: 10323597
Topic 213067

All GPU WUs on this machine are failing, with error code 2:

LATeah0051L_1084.0_0_0.0_12139615_1

Workunit ID: 334617487
Created: 17 Jan 2018 15:32:18 GMT
Sent: 17 Jan 2018 16:06:15 GMT
Report deadline: 31 Jan 2018 16:06:15 GMT
Received: 17 Jan 2018 17:12:06 GMT
Server state: Over
Outcome: Computation error
Client state: Compute error
Exit status: 2 (0x00000002) Unknown error code
Computer: 12617713
Run time (sec): 0.00
CPU time (sec): 0.00
Peak working set size (MB): 0
Peak swap size (MB): 0
Peak disk usage (MB): 0
Validation state: Invalid
Granted credit: 0
Application: Gamma-ray pulsar binary search #1 on GPUs v1.20 (FGRPopencl1K-nvidia) x86_64-pc-linux-gnu

 

mikey
mikey
Joined: 22 Jan 05
Posts: 12783
Credit: 1873219624
RAC: 1892407

John_328 wrote:All GPU WUs on

John_328 wrote:

All GPU WUs on this machine are failing, with error code 2:

LATeah0051L_1084.0_0_0.0_12139615_1

Your latest one finished validated!!

 

Gary Roberts
Gary Roberts
Moderator
Joined: 9 Feb 05
Posts: 5874
Credit: 118443431730
RAC: 25897315

John_328 wrote:All GPU WUs on

John_328 wrote:

All GPU WUs on this machine are failing, with error code 2:

....

16:19:23 (2947): [debug]: Flags: X64 SSE SSE2 GNUC X86 GNUX86
16:19:23 (2947): [debug]: glibc version/release: 2.22/stable
16:19:23 (2947): [debug]: Set up communication with graphics process.
Could not create topdir for cache
</stderr_txt>
]]>

 

The task seems to fail immediately, and the error message, "Could not create ..." makes it look very much like a 'permissions' problem.  Do you run BOINC as your normal user or was a special boinc:boinc user and group set up for the purpose?

You should browse the complete BOINC directory structure looking for anything unusual in the ownership/permissions of all files and subdirectories.  During startup, slot directories are created/populated, one for each separate task that is running.  You might be able to find the particular slot directory for the last GPU task that crashed and examine all the remnants there to see if you can get any clues.  I've never felt the need to delve into the slot directories in detail (other than an occasional brief look) so I don't have any experience with which to guide you, unfortunately.  Maybe someone else with more knowledge about this might be able to chime in.

 

Cheers,
Gary.

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.