Nvidia GPU - X server found. dri2 connection failed! -- signal handler called: signal 6

rjs5
rjs5
Joined: 3 Jul 05
Posts: 32
Credit: 656940790
RAC: 1429390
Topic 213958

I am able to successfully run PrimeGrid, Seti, MilkyWay Tasks without problems. Einstein is the only project that fails.

Does anyone see anything in my configuration that would only cause a problem with Einstein?

I don't use the screen saver and have plenty of resources available for compute on this . 

 

file hsgamma_FGRPB1G_1.20_x86_64-pc-linux-gnu__FGRPopencl1K-nvidia
hsgamma_FGRPB1G_1.20_x86_64-pc-linux-gnu__FGRPopencl1K-nvidia: ELF 64-bit LSB executable, x86-64, version 1 (SYSV), dynamically linked, interpreter /lib64/ld-linux-x86-64.so.2, for GNU/Linux 2.6.8, with debug_info, not stripped

ldd hsgamma_FGRPB1G_1.20_x86_64-pc-linux-gnu__FGRPopencl1K-nvidia
    linux-vdso.so.1 (0x00007ffffb9c1000)
    libOpenCL.so.1 => /lib64/libOpenCL.so.1 (0x00007f5b0650b000)
    libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f5b062ec000)
    libdl.so.2 => /lib64/libdl.so.2 (0x00007f5b060e8000)
    libm.so.6 => /lib64/libm.so.6 (0x00007f5b05d93000)
    libc.so.6 => /lib64/libc.so.6 (0x00007f5b059b0000)
    /lib64/ld-linux-x86-64.so.2 (0x00007f5b0672b000)

 

| Starting BOINC client version 7.9.2 for x86_64-pc-linux-gnu
| This a development version of BOINC and may not function properly
| log flags: file_xfer, sched_ops, task
| Libraries: libcurl/7.55.1 OpenSSL/1.1.0g zlib/1.2.11 libidn2/2.0.4 libpsl/0.18.0 (+libidn2/2.0.3) libssh2/1.8.0 nghttp2/1.25.0
| Data directory: /home/rjs
| CUDA: NVIDIA GPU 0: GeForce GTX 1080 (driver version 390.25, CUDA version 9.1, compute capability 6.1, 4096MB, 3980MB available, 8876 GFLOPS peak)
| OpenCL: NVIDIA GPU 0: GeForce GTX 1080 (driver version 390.25, device version OpenCL 1.2 CUDA, 8119MB, 3980MB available, 8876 GFLOPS peak)
| OpenCL CPU: pthread-Intel(R) Core(TM) i7-8700K CPU @ 3.70GHz (OpenCL driver vendor: The pocl project, driver version 0.15-pre, device version OpenCL 1.2 pocl HSTR: pthread-x86_64-unknown-linux-gnu-haswell)
| [libc detection] gathered: 2.26, GNU libc
| Host name: sky1151
| Processor: 12 GenuineIntel Intel(R) Core(TM) i7-8700K CPU @ 3.70GHz [Family 6 Model 158 Stepping 10]
| Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc art arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault epb invpcid_single pti tpr_shadow vnmi flexpriority ept vpid fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm mpx rdseed adx smap clflushopt intel_pt xsaveopt xsavec xgetbv1 xsaves dtherm ida arat pln pts hwp hwp_notify hwp_act_window hwp_epp
| OS: Linux Fedora: Fedora 27 (Workstation Edition) [4.15.7-300.fc27.x86_64|libc 2.26 (GNU libc)]
| Memory: 15.59 GB physical, 7.86 GB virtual
| Disk: 857.95 GB total, 804.87 GB free

 

 

 

 

Task 735524172
Name:
LATeah0056L_892.0_0_0.0_2279080_1
Workunit ID:
342784517
Created:
4 Mar 2018 19:32:23 GMT
Sent:
4 Mar 2018 20:29:37 GMT
Report deadline:
18 Mar 2018 20:29:37 GMT
Received:
4 Mar 2018 20:31:46 GMT
Server state:
Over
Outcome:
Computation error
Client state:
Compute error
Exit status:
6 (0x00000006) Unknown error code
Computer:
12631633
Run time (sec):
2.06
CPU time (sec):
0.11
Peak working set size (MB):
0
Peak swap size (MB):
0
Peak disk usage (MB):
0.03
Validation state:
Invalid
Granted credit:
0
Application:
Gamma-ray pulsar binary search #1 on GPUs v1.20 (FGRPopencl1K-nvidia)
x86_64-pc-linux-gnu
Stderr output

<core_client_version>7.8.4</core_client_version>
<![CDATA[
<message>
process exited with code 6 (0x6, -250)</message>
<stderr_txt>
12:28:27 (31287): [normal]: This Einstein@home App was built at: Feb 15 2017 10:50:14

12:28:27 (31287): [normal]: Start of BOINC application '../../projects/einstein.phys.uwm.edu/hsgamma_FGRPB1G_1.20_x86_64-pc-linux-gnu__FGRPopencl1K-nvidia'.
12:28:27 (31287): [debug]: 1e+16 fp, 4.8e+09 fp/s, 2173102 s, 603h38m21s76
12:28:27 (31287): [normal]: % CPU usage: 1.000000, GPU usage: 1.000000
command line: ../../projects/einstein.phys.uwm.edu/hsgamma_FGRPB1G_1.20_x86_64-pc-linux-gnu__FGRPopencl1K-nvidia --inputfile ../../projects/einstein.phys.uwm.edu/LATeah0056L.dat --alpha 4.42281478648 --delta -0.0345027837249 --skyRadius 2.152570e-06 --ldiBins 15 --f0start 884.0 --f0Band 8.0 --firstSkyPoint 0 --numSkyPoints 1 --f1dot -1e-13 --f1dotBand 1e-13 --df1dot 3.344368011e-15 --ephemdir ../../projects/einstein.phys.uwm.edu/JPLEPH --Tcoh 2097152.0 --toplist 10 --cohFollow 10 --numCells 1 --useWeights 1 --Srefinement 1 --CohSkyRef 1 --cohfullskybox 1 --mmfu 0.1 --reftime 56100 --model 0 --f0orbit 0.005 --mismatch 0.1 --demodbinary 1 --BinaryPointFile ../../projects/einstein.phys.uwm.edu/templates_LATeah0056L_0892_2279080.dat --debug 1 --device 0 -o LATeah0056L_892.0_0_0.0_2279080_1_0.out
output files: 'LATeah0056L_892.0_0_0.0_2279080_1_0.out' '../../projects/einstein.phys.uwm.edu/LATeah0056L_892.0_0_0.0_2279080_1_0' 'LATeah0056L_892.0_0_0.0_2279080_1_0.out.cohfu' '../../projects/einstein.phys.uwm.edu/LATeah0056L_892.0_0_0.0_2279080_1_1'
12:28:27 (31287): [debug]: Flags: X64 SSE SSE2 GNUC X86 GNUX86
12:28:27 (31287): [debug]: glibc version/release: 2.26/stable
12:28:27 (31287): [debug]: Set up communication with graphics process.
X server found. dri2 connection failed!
Device open failed, aborting...
free(): invalid pointer

-- signal handler called: signal 6
1 stack frames obtained for this thread:
Frame 31:
    Binary file: ../../projects/einstein.phys.uwm.edu/hsgamma_FGRPB1G_1.20_x86_64-pc-linux-gnu__FGRPopencl1K-nvidia (0x48b261)
    Source file: hs_boinc_extras.c (Function: sighandler / Line: 290)
Frame 30:
    Binary file: /lib64/libc.so.6 (0x7f178e8bb66b)
    Offset info: gsignal+0xcb
Frame 29:
    Binary file: /lib64/libc.so.6 (0x7f178e8bb66b)
    Offset info: gsignal+0xcb
Frame 28:
    Binary file: /lib64/libc.so.6 (0x7f178e8bd381)
    Offset info: abort+0x141
Frame 27:
    Binary file: /lib64/libc.so.6 (0x7f178e905a57)
    Offset info: +0x81a57
Frame 26:
    Binary file: /lib64/libc.so.6 (0x7f178e90c9aa)
    Offset info: +0x889aa
Frame 25:
    Binary file: /lib64/libc.so.6 (0x7f178e90f47c)
    Offset info: +0x8b47c
Frame 24:
    Binary file: ../../projects/einstein.phys.uwm.edu/hsgamma_FGRPB1G_1.20_x86_64-pc-linux-gnu__FGRPopencl1K-nvidia (0x6a7598)
    Offset info: _ZNSt13runtime_errorD2Ev+0x58
    Source file: basic_string.h (Function: &#160;S= / Line: 249)
    Source file: basic_string.h (Function: ~basic_string / Line: 539)
    Source file: stdexcept.cc (Function: &#160;S= / Line: 68)
Frame 23:
    Binary file: /lib64/libMesaOpenCL.so.1 (0x7f1783b989be)
    Offset info: +0x209be
Frame 22:
    Binary file: ../../projects/einstein.phys.uwm.edu/hsgamma_FGRPB1G_1.20_x86_64-pc-linux-gnu__FGRPopencl1K-nvidia (0x69992f)
    Source file: eh_throw.cc (Function:  / Line: 52)
Frame 21:
    Binary file: /lib64/libMesaOpenCL.so.1 (0x7f1783c1aa3f)
    Offset info: +0xa2a3f
Frame 20:
    Binary file: /lib64/libMesaOpenCL.so.1 (0x7f1783bc48e4)
    Offset info: +0x4c8e4
Frame 19:
    Binary file: /lib64/libMesaOpenCL.so.1 (0x7f1783bc4914)
    Offset info: +0x4c914
Frame 18:
    Binary file: /lib64/ld-linux-x86-64.so.2 (0x7f178f60fc13)
    Offset info: +0x10c13
Frame 17:
    Binary file: /lib64/ld-linux-x86-64.so.2 (0x7f178f614b6a)
    Offset info: +0x15b6a
Frame 16:
    Binary file: /lib64/libc.so.6 (0x7f178e9dffff)
    Offset info: _dl_catch_error+0x8f
Frame 15:
    Binary file: /lib64/ld-linux-x86-64.so.2 (0x7f178f614079)
    Offset info: +0x15079
Frame 14:
    Binary file: /lib64/libdl.so.2 (0x7f178efbcf96)
    Offset info: +0xf96
Frame 13:
    Binary file: /lib64/libc.so.6 (0x7f178e9dffff)
    Offset info: _dl_catch_error+0x8f
Frame 12:
    Binary file: /lib64/libdl.so.2 (0x7f178efbd715)
    Offset info: +0x1715
Frame 11:
    Binary file: /lib64/libdl.so.2 (0x7f178efbd021)
    Offset info: dlopen+0x41
Frame 10:
    Binary file: /lib64/libOpenCL.so.1 (0x7f178f3e4a82)
    Offset info: +0x5a82
Frame 9:
    Binary file: /lib64/libOpenCL.so.1 (0x7f178f3e6a74)
    Offset info: clGetPlatformIDs+0x114
Frame 8:
    Binary file: ../../projects/einstein.phys.uwm.edu/hsgamma_FGRPB1G_1.20_x86_64-pc-linux-gnu__FGRPopencl1K-nvidia (0x5baf44)
    Offset info: _Z24boinc_get_opencl_ids_auxPciiPP13_cl_device_idPP15_cl_platform_id+0x74
    Source file: unknown (Function:  / Line: 0)
Frame 7:
    Binary file: ../../projects/einstein.phys.uwm.edu/hsgamma_FGRPB1G_1.20_x86_64-pc-linux-gnu__FGRPopencl1K-nvidia (0x5bb46a)
    Offset info: _Z20boinc_get_opencl_idsPP13_cl_device_idPP15_cl_platform_id+0xe6
    Source file: unknown (Function:  / Line: 0)
Frame 6:
    Binary file: ../../projects/einstein.phys.uwm.edu/hsgamma_FGRPB1G_1.20_x86_64-pc-linux-gnu__FGRPopencl1K-nvidia (0x48bc66)
    Offset info: eah_boinc_get_opencl_ids+0x26
    Source file: hs_boinc_options.cpp (Function: eah_boinc_get_opencl_ids / Line: 136)
Frame 5:
    Binary file: ../../projects/einstein.phys.uwm.edu/hsgamma_FGRPB1G_1.20_x86_64-pc-linux-gnu__FGRPopencl1K-nvidia (0x48dcf4)
    Offset info: gen_fft_get_ctx+0x44
    Source file: unknown (Function: gen_fft_get_ctx / Line: 0)
Frame 4:
    Binary file: ../../projects/einstein.phys.uwm.edu/hsgamma_FGRPB1G_1.20_x86_64-pc-linux-gnu__FGRPopencl1K-nvidia (0x47975c)
    Offset info: MAIN+0x15c
    Source file: HSgammaPulsar.c (Function: MAIN / Line: 4251)
Frame 3:
    Binary file: ../../projects/einstein.phys.uwm.edu/hsgamma_FGRPB1G_1.20_x86_64-pc-linux-gnu__FGRPopencl1K-nvidia (0x46c0ff)
    Offset info: main+0x5ff
    Source file: hs_boinc_extras.c (Function: worker / Line: 832)
    Source file: hs_boinc_extras.c (Function: main / Line: 1038)
Frame 2:
    Binary file: /lib64/libc.so.6 (0x7f178e8a500a)
    Offset info: __libc_start_main+0xea
Frame 1:
    Binary file: ../../projects/einstein.phys.uwm.edu/hsgamma_FGRPB1G_1.20_x86_64-pc-linux-gnu__FGRPopencl1K-nvidia (0x46e5f9)
    Source file: unknown (Function: _start / Line: 0)

End of stcaktrace
12:28:28 (31287): called boinc_finish

</stderr_txt>
]]>