Hello everyone, I returned to my hometown where I have a small headless Intel NUC that runs OMV 5.6.26-1 (i.e. latest).
I performed a BIOS update to this NUC (model NUC6CAYH) and upon plugging it back to a monitor+keyboard, I saw an error that keeps repeating upon every boot, that I had not noticed before (4 entries in each boot, not anymore later):
$ grep -E "error" /var/log/syslog
May 8 11:48:43 NUC-NAS watchdog[1357]: error retry time-out = 60 seconds
May 8 11:50:33 NUC-NAS kernel: [ 9.562856] EXT4-fs (sdb1): re-mounted. Opts: errors=remount-ro
May 8 11:50:33 NUC-NAS kernel: [ 11.463860] EDAC pnd2: Failed to register device with error -22.
May 8 11:50:33 NUC-NAS kernel: [ 11.502157] EDAC pnd2: Failed to register device with error -22.
May 8 11:50:33 NUC-NAS kernel: [ 11.566280] EDAC pnd2: Failed to register device with error -22.
May 8 11:50:33 NUC-NAS kernel: [ 11.618265] EDAC pnd2: Failed to register device with error -22.
May 8 11:50:53 NUC-NAS watchdog[1187]: error retry time-out = 60 seconds
[...]
May 8 14:24:24 NUC-NAS kernel: [ 8.800715] EXT4-fs (sdb1): re-mounted. Opts: errors=remount-ro
May 8 14:24:24 NUC-NAS kernel: [ 10.580472] EDAC pnd2: Failed to register device with error -22.
May 8 14:24:24 NUC-NAS kernel: [ 10.615192] EDAC pnd2: Failed to register device with error -22.
May 8 14:24:24 NUC-NAS kernel: [ 10.675325] EDAC pnd2: Failed to register device with error -22.
May 8 14:24:24 NUC-NAS kernel: [ 10.767136] EDAC pnd2: Failed to register device with error -22.
May 8 14:24:44 NUC-NAS watchdog[1178]: error retry time-out = 60 seconds
[...]
May 8 14:40:34 NUC-NAS kernel: [ 9.218320] EXT4-fs (sdb1): re-mounted. Opts: errors=remount-ro
May 8 14:40:34 NUC-NAS kernel: [ 11.254685] EDAC pnd2: Failed to register device with error -22.
May 8 14:40:34 NUC-NAS kernel: [ 11.297516] EDAC pnd2: Failed to register device with error -22.
May 8 14:40:34 NUC-NAS kernel: [ 11.345416] EDAC pnd2: Failed to register device with error -22.
May 8 14:40:34 NUC-NAS kernel: [ 11.393533] EDAC pnd2: Failed to register device with error -22.
May 8 14:40:45 NUC-NAS watchdog[1181]: error retry time-out = 60 seconds
Alles anzeigen
I tried to find out on other Linux-related forums for information on identifying EDAC and pnd2 device, I only found out that EDAC stands for Error Detection And Correction (EDAC). Most people talk about ECC memory and of course, this Intel NUC does not have ECC memory inside (nor supporting it).
The Intel NUC Visual BIOS has no settings for memory, as perhaps on typical motherboards. And unfortunately, Intel does not allow rolling-back i.e. downgrade BIOS.
The advice to resolve this "EDAC pnd2 failed to register error -22" found was a) update BIOS first, then b) run a previous kernel. The output of my hardware configurations is the following:
$ lscpu
Architecture: x86_64
CPU op-mode(s): 32-bit, 64-bit
Byte Order: Little Endian
Address sizes: 39 bits physical, 48 bits virtual
CPU(s): 4
On-line CPU(s) list: 0-3
Thread(s) per core: 1
Core(s) per socket: 4
Socket(s): 1
NUMA node(s): 1
Vendor ID: GenuineIntel
CPU family: 6
Model: 92
Model name: Intel(R) Celeron(R) CPU J3455 @ 1.50GHz
Stepping: 9
CPU MHz: 798.873
CPU max MHz: 2300.0000
CPU min MHz: 800.0000
BogoMIPS: 2995.20
Virtualization: VT-x
L1d cache: 24K
L1i cache: 32K
L2 cache: 1024K
NUMA node0 CPU(s): 0-3
Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc art arch_perfmon pebs bts rep_good nopl xtopology tsc_reliable nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq dtes64 ds_cpl vmx est tm2 ssse3 sdbg cx16 xtpr pdcm sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave rdrand lahf_lm 3dnowprefetch cpuid_fault cat_l2 ibrs ibpb stibp tpr_shadow vnmi flexpriority ept vpid ept_ad fsgsbase tsc_adjust smep erms mpx rdt_a rdseed smap clflushopt intel_pt sha_ni xsaveopt xsavec xgetbv1 xsaves dtherm ida arat pln pts md_clear arch_capabilities
$ dmidecode --type memory
Getting SMBIOS data from sysfs.
SMBIOS 3.0.0 present.
Handle 0x002E, DMI type 16, 23 bytes
Physical Memory Array
Location: System Board Or Motherboard
Use: System Memory
Error Correction Type: None
Maximum Capacity: 32 GB
Error Information Handle: Not Provided
Number Of Devices: 4
Handle 0x002F, DMI type 17, 40 bytes
Memory Device
Array Handle: 0x002E
Error Information Handle: Not Provided
Total Width: 64 bits
Data Width: 64 bits
Size: 4096 MB
Form Factor: SODIMM
Set: None
Locator: ChannelA-DIMM0
Bank Locator: BANK 0
Type: DDR3
Type Detail: Synchronous
Speed: 1866 MT/s
Manufacturer: Kingston
Part Number: KHX1866C11S3L/4G
Rank: Unknown
Configured Memory Speed: 1866 MT/s
Minimum Voltage: 44.975 V
Maximum Voltage: 44.975 V
Configured Voltage: 1.5 V
Handle 0x0030, DMI type 17, 40 bytes
Memory Device
Array Handle: 0x002E
Error Information Handle: Not Provided
Total Width: 64 bits
Data Width: 64 bits
Size: 4096 MB
Form Factor: SODIMM
Set: None
Locator: ChannelB-DIMM0
Bank Locator: BANK 1
Type: DDR3
Type Detail: Synchronous
Speed: 1866 MT/s
Manufacturer: Kingston
Part Number: KHX1866C11S3L/4G
Rank: Unknown
Configured Memory Speed: 1866 MT/s
Minimum Voltage: 1.35 V
Maximum Voltage: 1.5 V
Configured Voltage: 1.5 V
$ lsmem
RANGE SIZE STATE REMOVABLE BLOCK
0x0000000000000000-0x000000007fffffff 2G online yes 0-15
0x0000000100000000-0x000000027fffffff 6G online yes 32-79
Memory block size: 128M
Total online memory: 8G
Total offline memory: 0B
Alles anzeigen
My Intel NUC has 2x4GB Kingston memory modules (same mode each). Not sure why lsmem reports this as 2+6GB...
I could not find to what relates this pnd2 device, but if anyone from you has encountered this error and understands it or can confirm it's related to ECC memory checking, can you kindly help me fix this error?
Thank you all in advance.