I haven't seen any mention of MSI motherboards for kernel HWMON updates for sensor drivers.
Main activity for the past year has been for better sensor support for Asus and Gigabyte boards of their discrete SIO monitoring chips.
Latest update is to the generic k10temp driver but that is the one that I am waiting to be merged into the 6.3 kernel at the end of this month.
A lot of motherboards got sensor support in that driver, but I never have seen any MSI motherboards in the driver k10temp code.
Your board may be supported by one of the old ITE SIO chips or one of the Nuvoton SIO chips already. So may be covered by the old it87 driver or the nct6775 driver.
Mesa pieces other than opencl is probably ok. You can uninstall mesa opencl or just disable it by removing mesa.icd from /etc/OpenCL/vendors/. To uninstall dpkg -l | grep -i opencl and you will probably see something named mesa-opencl-icd. You can then use sudo dpkg -P mesa-opencl-icd (or whatever the correct named is) followed by sudo apt autoremove and a reboot.
Install (as before) with: sudo amdgpu-install -y --usecase=opencl --opencl=legacy Check /etc/OpenCL/vendors and you should see a file named something like amdocl64.icd with contents that point to the correct amd lib for opencl, libamdocl64.so.
*** corrected amd icd file/lib names
Wondering if you have any suggestions. I powered down the box, pulled the RX 580 out and replaced it with an RX 6600. As near as I can tell from the release notes it uses the same amdgpu-install_5.4.50403-1_all driver package. But BOINC is squawking about missing GPU.
Wed 19 Apr 2023 04:21:24 PM CDT | | No usable GPUs found
Wed 19 Apr 2023 04:21:24 PM CDT | | app version refers to missing GPU type ATI
For now I'll put the RX 580 back in. Any suggestions much appreciated.
$ clinfo
Number of platforms 1
Platform Name AMD Accelerated Parallel Processing
Platform Vendor Advanced Micro Devices, Inc.
Platform Version OpenCL 2.1 AMD-APP (3380.4)
Platform Profile FULL_PROFILE
Platform Extensions cl_khr_icd cl_amd_event_callback cl_amd_offline_devices
Platform Host timer resolution 1ns
Platform Extensions function suffix AMD
Platform Name AMD Accelerated Parallel Processing
Number of devices 0
NULL platform behavior
clGetPlatformInfo(NULL, CL_PLATFORM_NAME, ...) No platform
clGetDeviceIDs(NULL, CL_DEVICE_TYPE_ALL, ...) No platform
clCreateContext(NULL, ...) [default] No platform
clCreateContext(NULL, ...) [other] No platform
clCreateContextFromType(NULL, CL_DEVICE_TYPE_DEFAULT) No devices found in platform
clCreateContextFromType(NULL, CL_DEVICE_TYPE_CPU) No devices found in platform
clCreateContextFromType(NULL, CL_DEVICE_TYPE_GPU) No devices found in platform
clCreateContextFromType(NULL, CL_DEVICE_TYPE_ACCELERATOR) No devices found in platform
clCreateContextFromType(NULL, CL_DEVICE_TYPE_CUSTOM) No devices found in platform
clCreateContextFromType(NULL, CL_DEVICE_TYPE_ALL) No devices found in platform
RX580 is legacy, RX6600 is newer tech and uses rocm. So re-install driver with: sudo amdgpu-install -y --usecase=opencl --opencl=legacy,rocm --accept-eula
RX580 is legacy, RX6600 is newer tech and uses rocm. So re-install driver with: sudo amdgpu-install -y --usecase=opencl --opencl=legacy,rocm --accept-eula
Is this telling me I can reinstall the driver tonight with the RX 580 still in, per above and swap the cards tomorrow? Or in other words, with rocr added in there, it'll run either the Polaris or the Navi cards?
Yes, you can install opencl for either card separately (--opencl=legacy or --opencl=rocr) or as above (--opencl=legacy,rocr) for both. Doing this will allow changing cards (or running both together) without having to do a driver install for each.
At some point earlier yesterday I turned my iGPU back on in BIOS and set it to "forced" and with a small udma buffer (was 64mb, now @ 256mb) and hooked the KVM switch to it instead of the RX 580. No problems with it at this point.
Last night I pulled out the RX 580 and put the RX 6600 in then did a
I had some issues with BOINC now seeing another ATI device that I assume must have been the iGPU in the Ryzen 5 5600G. All GPU WUs were erroring out. U suspended GPU usage and went off to investigate
<ignore_ati_dev>0</ignore_ati_dev>
I didn't get things working so un-installed amdgpu again and reinstalled with just opencl=legacy with the RX 580 back in.
This morning I shut that box down again, swapped the RX 580 for the RX 6600 and did the uninstall and amdgpu-install. This time I JUST specified
Radeon-Profile comes up showing the two devices but the 1st shows a 0 MHZ clock, 875 Mhz mem clock, 51C and 0 rpm:
VGA: Advanced Micro Devices, Inc. [AMD/ATI] Navi 23 [Radeon RX 6600/6600 XT/6600M] (rev c7)
VGA: Advanced Micro Devices, Inc. [AMD/ATI] Cezanne [Radeon Vega Series / Radeon Vega Mobile Series] (rev c8)
OpenGL driver: radeonsi
Kernel driver: amdgpu
name of display: :0
display: :0 screen: 0
direct rendering: Yes
and
$ clinfo
Number of platforms 1
Platform Name AMD Accelerated Parallel Processing
Platform Vendor Advanced Micro Devices, Inc.
Platform Version OpenCL 2.1 AMD-APP (3513.0)
Platform Profile FULL_PROFILE
Platform Extensions cl_khr_icd cl_amd_event_callback
Platform Host timer resolution 1ns
Platform Extensions function suffix AMD
Platform Name AMD Accelerated Parallel Processing
Number of devices 2
Device Name gfx1032
Device Vendor Advanced Micro Devices, Inc.
Device Vendor ID 0x1002
Device Version OpenCL 2.0
Driver Version 3513.0 (HSA1.1,LC)
Device OpenCL C Version OpenCL C 2.0
Device Type GPU
Device Board Name (AMD) AMD Radeon RX 6600
Device Topology (AMD) PCI-E, 12:00.0
Device Profile FULL_PROFILE
Device Available Yes
Compiler Available Yes
Linker Available Yes
Max compute units 14
SIMD per compute unit (AMD) 4
SIMD width (AMD) 32
SIMD instruction width (AMD) 1
Max clock frequency 2750MHz
Graphics IP (AMD) 10.3
Device Partition (core)
Max number of sub-devices 14
Supported partition types None
Supported affinity domains (n/a)
Max work item dimensions 3
Max work item sizes 1024x1024x1024
Max work group size 256
Preferred work group size (AMD) 256
Max work group size (AMD) 1024
Preferred work group size multiple 32
Wavefront width (AMD) 32
Preferred / native vector sizes
char 4 / 4
short 2 / 2
int 1 / 1
long 1 / 1
half 1 / 1 (cl_khr_fp16)
float 1 / 1
double 1 / 1 (cl_khr_fp64)
Half-precision Floating-point support (cl_khr_fp16)
Denormals No
Infinity and NANs No
Round to nearest No
Round to zero No
Round to infinity No
IEEE754-2008 fused multiply-add No
Support is emulated in software No
Single-precision Floating-point support (core)
Denormals Yes
Infinity and NANs Yes
Round to nearest Yes
Round to zero Yes
Round to infinity Yes
IEEE754-2008 fused multiply-add Yes
Support is emulated in software No
Correctly-rounded divide and sqrt operations Yes
Double-precision Floating-point support (cl_khr_fp64)
Denormals Yes
Infinity and NANs Yes
Round to nearest Yes
Round to zero Yes
Round to infinity Yes
IEEE754-2008 fused multiply-add Yes
Support is emulated in software No
Address bits 64, Little-Endian
Global memory size 8573157376 (7.984GiB)
Global free memory (AMD) 8372224 (7.984GiB)
Global memory channels (AMD) 4
Global memory banks per channel (AMD) 4
Global memory bank width (AMD) 256 bytes
Error Correction support No
Max memory allocation 7287183768 (6.787GiB)
Unified memory for Host and Device No
Shared Virtual Memory (SVM) capabilities (core)
Coarse-grained buffer sharing Yes
Fine-grained buffer sharing Yes
Fine-grained system sharing No
Atomics No
Minimum alignment for any data type 128 bytes
Alignment of base address 1024 bits (128 bytes)
Preferred alignment for atomics
SVM 0 bytes
Global 0 bytes
Local 0 bytes
Max size for global variable 7287183768 (6.787GiB)
Preferred total size of global vars 8573157376 (7.984GiB)
Global Memory cache type Read/Write
Global Memory cache size 16384 (16KiB)
Global Memory cache line size 64 bytes
Image support Yes
Max number of samplers per kernel 29695
Max size for 1D images from buffer 134217728 pixels
Max 1D or 2D image array size 8192 images
Base address alignment for 2D image buffers 256 bytes
Pitch alignment for 2D image buffers 256 pixels
Max 2D image size 16384x16384 pixels
Max 3D image size 16384x16384x8192 pixels
Max number of read image args 128
Max number of write image args 8
Max number of read/write image args 64
Max number of pipe args 16
Max active pipe reservations 16
Max pipe packet size 2992216472 (2.787GiB)
Local memory type Local
Local memory size 65536 (64KiB)
Local memory syze per CU (AMD) 65536 (64KiB)
Local memory banks (AMD) 32
Max number of constant args 8
Max constant buffer size 7287183768 (6.787GiB)
Preferred constant buffer size (AMD) 16384 (16KiB)
Max size of kernel argument 1024
Queue properties (on host)
Out-of-order execution No
Profiling Yes
Queue properties (on device)
Out-of-order execution Yes
Profiling Yes
Preferred size 262144 (256KiB)
Max size 8388608 (8MiB)
Max queues on device 1
Max events on device 1024
Prefer user sync for interop Yes
Number of P2P devices (AMD) 1
P2P devices (AMD) <printDeviceInfo:147: get number of CL_DEVICE_P2P_DEVICES_AMD : error -30>
Profiling timer resolution 1ns
Profiling timer offset since Epoch (AMD) 0ns (Wed Dec 31 18:00:00 1969)
Execution capabilities
Run OpenCL kernels Yes
Run native kernels No
Thread trace supported (AMD) No
Number of async queues (AMD) 8
Max real-time compute queues (AMD) 8
Max real-time compute units (AMD) 14
printf() buffer size 4194304 (4MiB)
Built-in kernels (n/a)
Device Extensions cl_khr_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_gl_sharing cl_amd_device_attribute_query cl_amd_media_ops cl_amd_media_ops2 cl_khr_image2d_from_buffer cl_khr_subgroups cl_khr_depth_images cl_amd_copy_buffer_p2p cl_amd_assembly_program
Device Name gfx90c:xnack-
Device Vendor Advanced Micro Devices, Inc.
Device Vendor ID 0x1002
Device Version OpenCL 2.0
Driver Version 3513.0 (HSA1.1,LC)
Device OpenCL C Version OpenCL C 2.0
Device Type GPU
Device Board Name (AMD) (n/a)
Device Topology (AMD) PCI-E, 30:00.0
Device Profile FULL_PROFILE
Device Available Yes
Compiler Available Yes
Linker Available Yes
Max compute units 8
SIMD per compute unit (AMD) 4
SIMD width (AMD) 16
SIMD instruction width (AMD) 1
Max clock frequency 2000MHz
Graphics IP (AMD) 9.0
Device Partition (core)
Max number of sub-devices 8
Supported partition types None
Supported affinity domains (n/a)
Max work item dimensions 3
Max work item sizes 1024x1024x1024
Max work group size 256
Preferred work group size (AMD) 256
Max work group size (AMD) 1024
Preferred work group size multiple <getWGsizes:1200: create kernel : error -6>
Wavefront width (AMD) 64
Preferred / native vector sizes
char 4 / 4
short 2 / 2
int 1 / 1
long 1 / 1
half 1 / 1 (cl_khr_fp16)
float 1 / 1
double 1 / 1 (cl_khr_fp64)
Half-precision Floating-point support (cl_khr_fp16)
Denormals No
Infinity and NANs No
Round to nearest No
Round to zero No
Round to infinity No
IEEE754-2008 fused multiply-add No
Support is emulated in software No
Single-precision Floating-point support (core)
Denormals Yes
Infinity and NANs Yes
Round to nearest Yes
Round to zero Yes
Round to infinity Yes
IEEE754-2008 fused multiply-add Yes
Support is emulated in software No
Correctly-rounded divide and sqrt operations Yes
Double-precision Floating-point support (cl_khr_fp64)
Denormals Yes
Infinity and NANs Yes
Round to nearest Yes
Round to zero Yes
Round to infinity Yes
IEEE754-2008 fused multiply-add Yes
Support is emulated in software No
Address bits 64, Little-Endian
Global memory size 536870912 (512MiB)
Global free memory (AMD) 524288 (512MiB)
Global memory channels (AMD) 4
Global memory banks per channel (AMD) 4
Global memory bank width (AMD) 256 bytes
Error Correction support No
Max memory allocation 456340272 (435.2MiB)
Unified memory for Host and Device No
Shared Virtual Memory (SVM) capabilities (core)
Coarse-grained buffer sharing Yes
Fine-grained buffer sharing Yes
Fine-grained system sharing No
Atomics No
Minimum alignment for any data type 128 bytes
Alignment of base address 1024 bits (128 bytes)
Preferred alignment for atomics
SVM 0 bytes
Global 0 bytes
Local 0 bytes
Max size for global variable 456340272 (435.2MiB)
Preferred total size of global vars 536870912 (512MiB)
Global Memory cache type Read/Write
Global Memory cache size 16384 (16KiB)
Global Memory cache line size 64 bytes
Image support Yes
Max number of samplers per kernel 5688
Max size for 1D images from buffer 134217728 pixels
Max 1D or 2D image array size 8192 images
Base address alignment for 2D image buffers 256 bytes
Pitch alignment for 2D image buffers 256 pixels
Max 2D image size 16384x16384 pixels
Max 3D image size 16384x16384x8192 pixels
Max number of read image args 128
Max number of write image args 8
Max number of read/write image args 64
Max number of pipe args 16
Max active pipe reservations 16
Max pipe packet size 456340272 (435.2MiB)
Local memory type Local
Local memory size 65536 (64KiB)
Local memory syze per CU (AMD) 65536 (64KiB)
Local memory banks (AMD) 32
Max number of constant args 8
Max constant buffer size 456340272 (435.2MiB)
Preferred constant buffer size (AMD) 16384 (16KiB)
Max size of kernel argument 1024
Queue properties (on host)
Out-of-order execution No
Profiling Yes
Queue properties (on device)
Out-of-order execution Yes
Profiling Yes
Preferred size 262144 (256KiB)
Max size 8388608 (8MiB)
Max queues on device 1
Max events on device 1024
Prefer user sync for interop Yes
Number of P2P devices (AMD) 0
P2P devices (AMD) <printDeviceInfo:147: get number of CL_DEVICE_P2P_DEVICES_AMD : error -30>
Profiling timer resolution 1ns
Profiling timer offset since Epoch (AMD) 0ns (Wed Dec 31 18:00:00 1969)
Execution capabilities
Run OpenCL kernels Yes
Run native kernels No
Thread trace supported (AMD) No
Number of async queues (AMD) 8
Max real-time compute queues (AMD) 8
Max real-time compute units (AMD) 8
printf() buffer size 4194304 (4MiB)
Built-in kernels (n/a)
Device Extensions cl_khr_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_gl_sharing cl_amd_device_attribute_query cl_amd_media_ops cl_amd_media_ops2 cl_khr_image2d_from_buffer cl_khr_subgroups cl_khr_depth_images cl_amd_copy_buffer_p2p cl_amd_assembly_program
NULL platform behavior
clGetPlatformInfo(NULL, CL_PLATFORM_NAME, ...) No platform
clGetDeviceIDs(NULL, CL_DEVICE_TYPE_ALL, ...) No platform
clCreateContext(NULL, ...) [default] No platform
clCreateContext(NULL, ...) [other] Success [AMD]
clCreateContextFromType(NULL, CL_DEVICE_TYPE_DEFAULT) Success (1)
Platform Name AMD Accelerated Parallel Processing
Device Name gfx1032
clCreateContextFromType(NULL, CL_DEVICE_TYPE_CPU) No devices found in platform
clCreateContextFromType(NULL, CL_DEVICE_TYPE_GPU) Success (2)
Platform Name AMD Accelerated Parallel Processing
Device Name gfx1032
Device Name gfx90c:xnack-
clCreateContextFromType(NULL, CL_DEVICE_TYPE_ACCELERATOR) No devices found in platform
clCreateContextFromType(NULL, CL_DEVICE_TYPE_CUSTOM) No devices found in platform
clCreateContextFromType(NULL, CL_DEVICE_TYPE_ALL) Success (2)
Platform Name AMD Accelerated Parallel Processing
Device Name gfx1032
Device Name gfx90c:xnack-
Just to see if anything changes and to simplify the picture I'm going to go back into BIOS and disable the iGPU, plug the monitor back into the RX 6600.
Result edit pending. -- No DIFF. RX 6600 now getting comp errors on MW WUs.
Only option I see left is to uninstall the driver and reinstall with both legacy & rocm.
Disabling the iGPU might be ok for boinc, but at the system level what's happening? Disable iGPU in the bios to get it out of the picture. You have another system info utility with rocr, rocminfo - you could run this...
Back to /etc/OpenCL/vendors/: What .icd files are now present and what libs do they point to?
The wu errors you are getting seem to be inside a GW app. Might be a good time to try a FGRP task and see if that works.
I haven't seen any mention of
)
I haven't seen any mention of MSI motherboards for kernel HWMON updates for sensor drivers.
Main activity for the past year has been for better sensor support for Asus and Gigabyte boards of their discrete SIO monitoring chips.
Latest update is to the generic k10temp driver but that is the one that I am waiting to be merged into the 6.3 kernel at the end of this month.
A lot of motherboards got sensor support in that driver, but I never have seen any MSI motherboards in the driver k10temp code.
Your board may be supported by one of the old ITE SIO chips or one of the Nuvoton SIO chips already. So may be covered by the old it87 driver or the nct6775 driver.
github lm-sensors
)
github lm-sensors
Yes, looks like lm-sensors
)
Yes, looks like lm-sensors gets a sensor configuration entry for proper scaling.
But it still needs to be supported either in the k10temp or nct6775 driver.
The nct6775 driver has had a lot of activitity lately adding in all the new permutations of the new NCT SIO chips.
But none of those changes have made it into kernel updates yet.
mountkidd wrote:Mesa pieces
)
Wondering if you have any suggestions. I powered down the box, pulled the RX 580 out and replaced it with an RX 6600. As near as I can tell from the release notes it uses the same amdgpu-install_5.4.50403-1_all driver package. But BOINC is squawking about missing GPU.
For now I'll put the RX 580 back in. Any suggestions much appreciated.
Skip
PS:
$ inxi -G
Graphics: Device-1: Advanced Micro Devices [AMD/ATI] Navi 23 [Radeon RX 6600/6600 XT/6600M] driver: amdgpu v: 5.18.13
Display: x11 server: X.Org 1.20.13 driver: amdgpu,ati unloaded: fbdev,modesetting,radeon,vesa
resolution: 1920x1080~60Hz
OpenGL: renderer: AMD Radeon RX 6600 (DIMGREY_CAVEFISH DRM 3.48.0 5.15.0-69-generic LLVM 12.0.0) v: 4.6 Mesa 21.2.6
PPS:
$ clinfo
Number of platforms 1
Platform Name AMD Accelerated Parallel Processing
Platform Vendor Advanced Micro Devices, Inc.
Platform Version OpenCL 2.1 AMD-APP (3380.4)
Platform Profile FULL_PROFILE
Platform Extensions cl_khr_icd cl_amd_event_callback cl_amd_offline_devices
Platform Host timer resolution 1ns
Platform Extensions function suffix AMD
Platform Name AMD Accelerated Parallel Processing
Number of devices 0
NULL platform behavior
clGetPlatformInfo(NULL, CL_PLATFORM_NAME, ...) No platform
clGetDeviceIDs(NULL, CL_DEVICE_TYPE_ALL, ...) No platform
clCreateContext(NULL, ...) [default] No platform
clCreateContext(NULL, ...) [other] No platform
clCreateContextFromType(NULL, CL_DEVICE_TYPE_DEFAULT) No devices found in platform
clCreateContextFromType(NULL, CL_DEVICE_TYPE_CPU) No devices found in platform
clCreateContextFromType(NULL, CL_DEVICE_TYPE_GPU) No devices found in platform
clCreateContextFromType(NULL, CL_DEVICE_TYPE_ACCELERATOR) No devices found in platform
clCreateContextFromType(NULL, CL_DEVICE_TYPE_CUSTOM) No devices found in platform
clCreateContextFromType(NULL, CL_DEVICE_TYPE_ALL) No devices found in platform
RX580 is legacy, RX6600 is
)
RX580 is legacy, RX6600 is newer tech and uses rocm. So re-install driver with: sudo amdgpu-install -y --usecase=opencl --opencl=legacy,rocm --accept-eula
mountkidd wrote:RX580 is
)
Is this telling me I can reinstall the driver tonight with the RX 580 still in, per above and swap the cards tomorrow? Or in other words, with rocr added in there, it'll run either the Polaris or the Navi cards?
And thank you for the quick reply,
Skip
Yes, you can install opencl
)
Yes, you can install opencl for either card separately (--opencl=legacy or --opencl=rocr) or as above (--opencl=legacy,rocr) for both. Doing this will allow changing cards (or running both together) without having to do a driver install for each.
At some point earlier
)
At some point earlier yesterday I turned my iGPU back on in BIOS and set it to "forced" and with a small udma buffer (was 64mb, now @ 256mb) and hooked the KVM switch to it instead of the RX 580. No problems with it at this point.
Last night I pulled out the RX 580 and put the RX 6600 in then did a
and then
sudo reboot
I didn't get things working so un-installed amdgpu again and reinstalled with just opencl=legacy with the RX 580 back in.
This morning I shut that box down again, swapped the RX 580 for the RX 6600 and did the uninstall and amdgpu-install. This time I JUST specified
Rebooted and this resulted in the iGPU showing up again but it was taken care of for now with
It became device 1 this time and the RX 6600 was 0. Never the less...
All the E@H WUs (only ones tried) got comp errors after a few seconds with some info in the task detail:
https://einsteinathome.org/workunit/724282248
Radeon-Profile comes up showing the two devices but the 1st shows a 0 MHZ clock, 875 Mhz mem clock, 51C and 0 rpm:
and
Just to see if anything changes and to simplify the picture I'm going to go back into BIOS and disable the iGPU, plug the monitor back into the RX 6600.
Result edit pending. -- No DIFF. RX 6600 now getting comp errors on MW WUs.
Only option I see left is to uninstall the driver and reinstall with both legacy & rocm.
you need to look at the first
)
you need to look at the first error in the chain, not the last one.
"OpenCL CreateCommandQueue failed with OpenCL error: CL_OUT_OF_HOST_MEMORY"
that's your error.
_________________________________________________________________________
Disabling the iGPU might be
)
Disabling the iGPU might be ok for boinc, but at the system level what's happening? Disable iGPU in the bios to get it out of the picture. You have another system info utility with rocr, rocminfo - you could run this...
Back to /etc/OpenCL/vendors/: What .icd files are now present and what libs do they point to?
The wu errors you are getting seem to be inside a GW app. Might be a good time to try a FGRP task and see if that works.