hey,
ich hab ein problem mit meinem server. ich hab ein software raid in omv erstellt mit 2x 500gb hdds (raid1).
folgender fehler:
Code
Jun 4 16:28:24 Media-Server kernel: [22386.804871] ata17.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen
Jun 4 16:28:24 Media-Server kernel: [22386.804876] ata17.00: failed command: READ DMA EXT
Jun 4 16:28:24 Media-Server kernel: [22386.804879] ata17.00: cmd 25/00:00:a8:72:fe/00:01:8f:00:00/e0 tag 1 dma 131072 in
Jun 4 16:28:24 Media-Server kernel: [22386.804879] res 40/00:01:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Jun 4 16:28:24 Media-Server kernel: [22386.804880] ata17.00: status: { DRDY }
Jun 4 16:28:24 Media-Server kernel: [22386.804883] ata17: hard resetting link
Jun 4 16:28:25 Media-Server kernel: [22387.301126] ata17: SATA link up 1.5 Gbps (SStatus 113 SControl 310)
Jun 4 16:28:25 Media-Server kernel: [22387.303874] ata17.00: configured for UDMA/33
Jun 4 16:28:25 Media-Server kernel: [22387.317104] ata17.00: device reported invalid CHS sector 0
Jun 4 16:28:25 Media-Server kernel: [22387.317114] ata17: EH complete
die smart werte der beiden hdds:
Code
root@Media-Server:/home/karl-heinz# smartctl -A /dev/sda
smartctl 5.41 2011-06-09 r3365 [x86_64-linux-3.16.0-0.bpo.4-amd64] (local build)
Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000f 200 200 051 Pre-fail Always - 0
3 Spin_Up_Time 0x0003 158 152 021 Pre-fail Always - 5091
4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 581
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
7 Seek_Error_Rate 0x000e 200 200 051 Old_age Always - 0
9 Power_On_Hours 0x0032 092 092 000 Old_age Always - 5974
10 Spin_Retry_Count 0x0012 100 100 051 Old_age Always - 0
11 Calibration_Retry_Count 0x0012 100 100 051 Old_age Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 149
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 19
193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 149
194 Temperature_Celsius 0x0022 113 098 000 Old_age Always - 34
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
197 Current_Pending_Sector 0x0012 200 200 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0010 200 200 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always - 0
200 Multi_Zone_Error_Rate 0x0008 200 200 051 Old_age Offline - 0
root@Media-Server:/home/karl-heinz# smartctl -A /dev/sdb
smartctl 5.41 2011-06-09 r3365 [x86_64-linux-3.16.0-0.bpo.4-amd64] (local build)
Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000f 114 100 006 Pre-fail Always - 67892502
3 Spin_Up_Time 0x0003 100 100 000 Pre-fail Always - 0
4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 143
5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail Always - 0
7 Seek_Error_Rate 0x000f 100 253 030 Pre-fail Always - 353779
9 Power_On_Hours 0x0032 099 098 000 Old_age Always - 1553
10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail Always - 0
12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 112
183 Runtime_Bad_Block 0x0032 100 100 000 Old_age Always - 0
184 End-to-End_Error 0x0032 100 100 099 Old_age Always - 0
187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0
188 Command_Timeout 0x0032 100 100 000 Old_age Always - 0
189 High_Fly_Writes 0x003a 100 100 000 Old_age Always - 0
190 Airflow_Temperature_Cel 0x0022 070 038 045 Old_age Always In_the_past 30 (14 244 30 23)
194 Temperature_Celsius 0x0022 030 062 000 Old_age Always - 30 (0 14 0 0)
195 Hardware_ECC_Recovered 0x001a 039 024 000 Old_age Always - 67892502
197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always - 0
240 Head_Flying_Hours 0x0000 100 253 000 Old_age Offline - 75892072121651
241 Total_LBAs_Written 0x0000 100 253 000 Old_age Offline - 1634258357
242 Total_LBAs_Read 0x0000 100 253 000 Old_age Offline - 1957154322
Alles anzeigen
zusätzlich (ich bin mir nicht sicher ob es an der hdd liegt) ist mein load average DERMASSEN hoch ... teilweise geht der auf 3, 4 oder 5 hoch. unter htop ist die cpu auslastung aber normal.
kann mir einer tipps geben?