I had a short circuit in my house. After fixing it the power supply of my OMV machine was dead. I replaced it now and see random errors on the SSD. The errors might be connected to the short circuit or not. It is very possible that i just did not remark them before.
I had the machine running for over 24h without issues until today but now the HDD/ controller errors reappeared. So they aren't very frequent.
I wonder if it might be the SATA controller or SSD that produces these. Anyone got a clue how to identify the source of the issue?
Here are some logs:
Code
Jul 2 20:41:01 debNAS kernel: [ 220.797519] ata2.00: failed command: READ FPDMA QUEUED
Jul 2 20:41:01 debNAS kernel: [ 220.800311] ata2.00: cmd 60/08:88:08:0e:84/00:00:18:00:00/40 tag 17 ncq dma 4096 in
Jul 2 20:41:01 debNAS kernel: [ 220.800311] res 40/00:00:00:00:00/00:00:00:00:00/40 Emask 0x4 (timeout)
Jul 2 20:41:01 debNAS kernel: [ 220.805974] ata2.00: status: { DRDY }
Jul 2 20:41:02 debNAS kernel: [ 220.808781] ata2.00: failed command: READ FPDMA QUEUED
Jul 2 20:41:02 debNAS kernel: [ 220.811585] ata2.00: cmd 60/08:90:08:76:84/00:00:18:00:00/40 tag 18 ncq dma 4096 in
Jul 2 20:41:02 debNAS kernel: [ 220.811585] res 40/00:00:00:00:00/00:00:00:00:00/40 Emask 0x4 (timeout)
Jul 2 20:41:02 debNAS kernel: [ 220.817269] ata2.00: status: { DRDY }
Jul 2 20:41:02 debNAS kernel: [ 220.820074] ata2.00: failed command: READ FPDMA QUEUED
Jul 2 20:41:02 debNAS kernel: [ 220.822894] ata2.00: cmd 60/80:98:e0:3e:ed/00:00:07:00:00/40 tag 19 ncq dma 65536 in
Jul 2 20:41:02 debNAS kernel: [ 220.822894] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Jul 2 20:41:02 debNAS kernel: [ 220.828548] ata2.00: status: { DRDY }
Jul 2 20:41:02 debNAS kernel: [ 220.831363] ata2.00: failed command: READ FPDMA QUEUED
Jul 2 20:41:02 debNAS kernel: [ 220.834169] ata2.00: cmd 60/08:a0:40:13:8c/00:00:13:00:00/40 tag 20 ncq dma 4096 in
Jul 2 20:41:02 debNAS kernel: [ 220.834169] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Jul 2 20:41:02 debNAS kernel: [ 220.839824] ata2.00: status: { DRDY }
Jul 2 20:41:02 debNAS kernel: [ 220.842633] ata2.00: failed command: READ FPDMA QUEUED
Jul 2 20:41:02 debNAS kernel: [ 220.845437] ata2.00: cmd 60/08:a8:28:2a:c1/00:00:01:00:00/40 tag 21 ncq dma 4096 in
Jul 2 20:41:02 debNAS kernel: [ 220.845437] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Jul 2 20:41:02 debNAS kernel: [ 220.851080] ata2.00: status: { DRDY }
Jul 2 20:41:02 debNAS kernel: [ 220.853893] ata2.00: failed command: READ FPDMA QUEUED
Jul 2 20:41:02 debNAS kernel: [ 220.856678] ata2.00: cmd 60/08:e8:f8:85:d2/00:00:10:00:00/40 tag 29 ncq dma 4096 in
Jul 2 20:41:02 debNAS kernel: [ 220.856678] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Jul 2 20:41:02 debNAS kernel: [ 220.862321] ata2.00: status: { DRDY }
Jul 2 20:41:02 debNAS kernel: [ 220.865127] ata2.00: failed command: READ FPDMA QUEUED
Jul 2 20:41:02 debNAS kernel: [ 220.867931] ata2.00: cmd 60/08:f0:a0:cb:a3/00:00:10:00:00/40 tag 30 ncq dma 4096 in
Jul 2 20:41:02 debNAS kernel: [ 220.867931] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Jul 2 20:41:02 debNAS kernel: [ 220.873575] ata2.00: status: { DRDY }
Jul 2 20:41:02 debNAS kernel: [ 220.876423] ata2.00: failed command: WRITE FPDMA QUEUED
Jul 2 20:41:02 debNAS kernel: [ 220.879328] ata2.00: cmd 61/08:f8:40:ca:0f/00:00:11:00:00/40 tag 31 ncq dma 4096 out
Jul 2 20:41:02 debNAS kernel: [ 220.879328] res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Jul 2 20:41:02 debNAS kernel: [ 220.885213] ata2.00: status: { DRDY }
Jul 2 20:41:02 debNAS kernel: [ 220.888121] ata2: hard resetting link
Jul 2 20:41:02 debNAS kernel: [ 221.204674] ata2: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
Jul 2 20:41:02 debNAS kernel: [ 221.206726] ata2.00: supports DRM functions and may not be fully accessible
Jul 2 20:41:02 debNAS kernel: [ 221.209820] ata2.00: supports DRM functions and may not be fully accessible
Jul 2 20:41:02 debNAS kernel: [ 221.212463] ata2.00: configured for UDMA/133
Jul 2 20:41:02 debNAS kernel: [ 221.212488] ahci 0000:00:13.0: port does not support device sleep
Jul 2 20:41:02 debNAS kernel: [ 221.212620] ata2.00: device reported invalid CHS sector 0
Jul 2 20:41:02 debNAS kernel: [ 221.212641] ata2.00: device reported invalid CHS sector 0
Jul 2 20:41:02 debNAS kernel: [ 221.212649] ata2.00: device reported invalid CHS sector 0
Jul 2 20:41:02 debNAS kernel: [ 221.212662] ata2.00: device reported invalid CHS sector 0
Jul 2 20:41:02 debNAS kernel: [ 221.212669] ata2.00: device reported invalid CHS sector 0
Jul 2 20:41:02 debNAS kernel: [ 221.212677] ata2.00: device reported invalid CHS sector 0
Jul 2 20:41:02 debNAS kernel: [ 221.212685] ata2.00: device reported invalid CHS sector 0
Jul 2 20:41:02 debNAS kernel: [ 221.212692] ata2.00: device reported invalid CHS sector 0
Jul 2 20:41:02 debNAS kernel: [ 221.212700] ata2.00: device reported invalid CHS sector 0
Jul 2 20:41:02 debNAS kernel: [ 221.212787] sd 1:0:0:0: [sda] tag#17 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
Jul 2 20:41:02 debNAS kernel: [ 221.212803] sd 1:0:0:0: [sda] tag#17 Sense Key : Illegal Request [current]
Jul 2 20:41:02 debNAS kernel: [ 221.212818] sd 1:0:0:0: [sda] tag#17 Add. Sense: Unaligned write command
Jul 2 20:41:02 debNAS kernel: [ 221.212834] sd 1:0:0:0: [sda] tag#17 CDB: Read(10) 28 00 18 84 0e 08 00 00 08 00
Jul 2 20:41:02 debNAS kernel: [ 221.212843] print_req_error: I/O error, dev sda, sector 411307528
Jul 2 20:41:02 debNAS kernel: [ 221.225557] sd 1:0:0:0: [sda] tag#18 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
Jul 2 20:41:02 debNAS kernel: [ 221.225562] sd 1:0:0:0: [sda] tag#18 Sense Key : Illegal Request [current]
Jul 2 20:41:02 debNAS kernel: [ 221.225566] sd 1:0:0:0: [sda] tag#18 Add. Sense: Unaligned write command
Jul 2 20:41:02 debNAS kernel: [ 221.225576] sd 1:0:0:0: [sda] tag#18 CDB: Read(10) 28 00 18 84 76 08 00 00 08 00
Jul 2 20:41:02 debNAS kernel: [ 221.225582] print_req_error: I/O error, dev sda, sector 411334152
Jul 2 20:41:02 debNAS kernel: [ 221.228491] sd 1:0:0:0: [sda] tag#19 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
Jul 2 20:41:02 debNAS kernel: [ 221.228494] sd 1:0:0:0: [sda] tag#19 Sense Key : Illegal Request [current]
Jul 2 20:41:02 debNAS kernel: [ 221.228498] sd 1:0:0:0: [sda] tag#19 Add. Sense: Unaligned write command
Jul 2 20:41:02 debNAS kernel: [ 221.228501] sd 1:0:0:0: [sda] tag#19 CDB: Read(10) 28 00 07 ed 3e e0 00 00 80 00
Jul 2 20:41:02 debNAS kernel: [ 221.228503] print_req_error: I/O error, dev sda, sector 132988640
Jul 2 20:41:02 debNAS kernel: [ 221.231389] sd 1:0:0:0: [sda] tag#30 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
Jul 2 20:41:02 debNAS kernel: [ 221.231392] sd 1:0:0:0: [sda] tag#30 Sense Key : Illegal Request [current]
Jul 2 20:41:02 debNAS kernel: [ 221.231395] sd 1:0:0:0: [sda] tag#30 Add. Sense: Unaligned write command
Jul 2 20:41:02 debNAS kernel: [ 221.231399] sd 1:0:0:0: [sda] tag#30 CDB: Read(10) 28 00 10 a3 cb a0 00 00 08 00
Jul 2 20:41:02 debNAS kernel: [ 221.231401] print_req_error: I/O error, dev sda, sector 279169952
Jul 2 20:41:02 debNAS kernel: [ 221.234260] ata2: EH complete
Alles anzeigen
Oh and btw. I already changed the cable, I changed SATA3 to SATA2 in BIOS and i changed the mainboard SATA port from 1 to 2... no success
Here is the SMART info:
Code
root@debNAS:~# smartctl -A /dev/sda
smartctl 6.6 2016-05-31 r4324 [x86_64-linux-4.19.0-0.bpo.5-amd64] (local build)
Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 1
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
5 Reallocated_Sector_Ct 0x0033 100 100 010 Pre-fail Always - 0
9 Power_On_Hours 0x0032 097 097 000 Old_age Always - 14939
12 Power_Cycle_Count 0x0032 099 099 000 Old_age Always - 116
177 Wear_Leveling_Count 0x0013 097 097 000 Pre-fail Always - 13
179 Used_Rsvd_Blk_Cnt_Tot 0x0013 100 100 010 Pre-fail Always - 0
181 Program_Fail_Cnt_Total 0x0032 100 100 010 Old_age Always - 0
182 Erase_Fail_Count_Total 0x0032 100 100 010 Old_age Always - 0
183 Runtime_Bad_Block 0x0013 100 099 010 Pre-fail Always - 0
187 Uncorrectable_Error_Cnt 0x0032 100 100 000 Old_age Always - 0
190 Airflow_Temperature_Cel 0x0032 071 061 000 Old_age Always - 29
195 ECC_Error_Rate 0x001a 200 200 000 Old_age Always - 0
199 CRC_Error_Count 0x003e 100 100 000 Old_age Always - 0
235 POR_Recovery_Count 0x0012 099 099 000 Old_age Always - 19
241 Total_LBAs_Written 0x0032 099 099 000 Old_age Always - 4165463797
Alles anzeigen