Der grüne Festplattentod - Load Cycle Count

    This site uses cookies. By continuing to browse this site, you are agreeing to our Cookie Policy.

    • Der grüne Festplattentod - Load Cycle Count

      Hallo,

      ich bin über diese Seite gestolpert:
      linuxforen.de/forums/showthread.php?t=267786

      Nun bin ich nicht sicher, wie man in OMV damit umgeht, um das evtl
      sterben der Platten zu vermeiden.

      Source Code

      1. Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net
      2. === START OF INFORMATION SECTION ===
      3. Device Model: WDC WD20EARX-00PASB0
      4. Serial Number: WD-WCAZAH388201
      5. Firmware Version: 51.0AB51
      6. User Capacity: 2.000.398.934.016 bytes
      7. Device is: Not in smartctl database [for details use: -P showall]
      8. ATA Version is: 8
      9. ATA Standard is: Exact ATA specification draft version not indicated
      10. Local Time is: Fri Jun 20 20:20:10 2014 CEST
      11. SMART support is: Available - device has SMART capability.
      12. SMART support is: Disabled
      13. SMART Disabled. Use option -s with argument 'on' to enable it.
      14. root@Server:~# smartctl -a /dev/sdc
      15. smartctl 5.40 2010-07-12 r3124 [x86_64-unknown-linux-gnu] (local build)
      16. Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net
      17. === START OF INFORMATION SECTION ===
      18. Device Model: WDC WD20EARX-00PASB0
      19. Serial Number: WD-WCAZAH277130
      20. Firmware Version: 51.0AB51
      21. User Capacity: 2.000.398.934.016 bytes
      22. Device is: Not in smartctl database [for details use: -P showall]
      23. ATA Version is: 8
      24. ATA Standard is: Exact ATA specification draft version not indicated
      25. Local Time is: Fri Jun 20 20:20:13 2014 CEST
      26. SMART support is: Available - device has SMART capability.
      27. SMART support is: Disabled
      28. SMART Disabled. Use option -s with argument 'on' to enable it.
      29. root@Server:~# smartctl -a /dev/sdd
      30. smartctl 5.40 2010-07-12 r3124 [x86_64-unknown-linux-gnu] (local build)
      31. Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net
      32. === START OF INFORMATION SECTION ===
      33. Device Model: WDC WD30EFRX-68EUZN0
      34. Serial Number: WD-WMC4N1785106
      35. Firmware Version: 80.00A80
      36. User Capacity: 3.000.592.982.016 bytes
      37. Device is: Not in smartctl database [for details use: -P showall]
      38. ATA Version is: 9
      39. ATA Standard is: Exact ATA specification draft version not indicated
      40. Local Time is: Fri Jun 20 20:20:23 2014 CEST
      41. SMART support is: Available - device has SMART capability.
      42. SMART support is: Disabled
      43. SMART Disabled. Use option -s with argument 'on' to enable it.
      44. root@Server:~#
      Display All


      Bisher habe ich eigentlich alles deaktiviert, aber ist das so sinnvoll?

      Wie erkenne ich in OMV, ob meine Platten evtl betroffen sind?

      Gruss

      EDIT:
      Hier mal testweise SMART aktiviert, aber dann auch gleich wieder deaktiviert.

      Source Code

      1. login as: root
      2. root@192.168.1.18's password:
      3. Linux Server 3.2.0-0.bpo.4-amd64 #1 SMP Debian 3.2.57-3+deb7u2~bpo60+1 x86_64
      4. The programs included with the Debian GNU/Linux system are free software;
      5. the exact distribution terms for each program are described in the
      6. individual files in /usr/share/doc/*/copyright.
      7. Debian GNU/Linux comes with ABSOLUTELY NO WARRANTY, to the extent
      8. permitted by applicable law.
      9. Last login: Fri Jun 20 20:30:01 2014 from buero.fritz.box
      10. root@Server:~# smartctl -a /dev/sdb
      11. smartctl 5.40 2010-07-12 r3124 [x86_64-unknown-linux-gnu] (local build)
      12. Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net
      13. === START OF INFORMATION SECTION ===
      14. Device Model: WDC WD20EARX-00PASB0
      15. Serial Number: WD-WCAZAH388201
      16. Firmware Version: 51.0AB51
      17. User Capacity: 2.000.398.934.016 bytes
      18. Device is: Not in smartctl database [for details use: -P showall]
      19. ATA Version is: 8
      20. ATA Standard is: Exact ATA specification draft version not indicated
      21. Local Time is: Fri Jun 20 20:30:21 2014 CEST
      22. SMART support is: Available - device has SMART capability.
      23. SMART support is: Enabled
      24. === START OF READ SMART DATA SECTION ===
      25. SMART overall-health self-assessment test result: PASSED
      26. General SMART Values:
      27. Offline data collection status: (0x82) Offline data collection activity
      28. was completed without error.
      29. Auto Offline Data Collection: Enabled.
      30. Self-test execution status: ( 0) The previous self-test routine completed
      31. without error or no self-test has ever
      32. been run.
      33. Total time to complete Offline
      34. data collection: (37080) seconds.
      35. Offline data collection
      36. capabilities: (0x7b) SMART execute Offline immediate.
      37. Auto Offline data collection on/off support.
      38. Suspend Offline collection upon new
      39. command.
      40. Offline surface scan supported.
      41. Self-test supported.
      42. Conveyance Self-test supported.
      43. Selective Self-test supported.
      44. SMART capabilities: (0x0003) Saves SMART data before entering
      45. power-saving mode.
      46. Supports SMART auto save timer.
      47. Error logging capability: (0x01) Error logging supported.
      48. General Purpose Logging supported.
      49. Short self-test routine
      50. recommended polling time: ( 2) minutes.
      51. Extended self-test routine
      52. recommended polling time: ( 255) minutes.
      53. Conveyance self-test routine
      54. recommended polling time: ( 5) minutes.
      55. SCT capabilities: (0x3035) SCT Status supported.
      56. SCT Feature Control supported.
      57. SCT Data Table supported.
      58. SMART Attributes Data Structure revision number: 16
      59. Vendor Specific SMART Attributes with Thresholds:
      60. ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
      61. 1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0
      62. 3 Spin_Up_Time 0x0027 165 156 021 Pre-fail Always - 6741
      63. 4 Start_Stop_Count 0x0032 098 098 000 Old_age Always - 2971
      64. 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
      65. 7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0
      66. 9 Power_On_Hours 0x0032 090 090 000 Old_age Always - 7932
      67. 10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0
      68. 11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0
      69. 12 Power_Cycle_Count 0x0032 099 099 000 Old_age Always - 1524
      70. 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 122
      71. 193 Load_Cycle_Count 0x0032 193 193 000 Old_age Always - 22121
      72. 194 Temperature_Celsius 0x0022 120 115 000 Old_age Always - 30
      73. 196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
      74. 197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0
      75. 198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 0
      76. 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
      77. 200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 0
      78. SMART Error Log Version: 1
      79. No Errors Logged
      80. SMART Self-test log structure revision number 1
      81. No self-tests have been logged. [To run self-tests, use: smartctl -t]
      82. SMART Selective self-test log data structure revision number 1
      83. SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
      84. 1 0 0 Not_testing
      85. 2 0 0 Not_testing
      86. 3 0 0 Not_testing
      87. 4 0 0 Not_testing
      88. 5 0 0 Not_testing
      89. Selective self-test flags (0x0):
      90. After scanning selected spans, do NOT read-scan remainder of disk.
      91. If Selective self-test is pending on power-up, resume after 0 minute delay.
      92. root@Server:~# smartctl -a /dev/sdc
      93. smartctl 5.40 2010-07-12 r3124 [x86_64-unknown-linux-gnu] (local build)
      94. Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net
      95. === START OF INFORMATION SECTION ===
      96. Device Model: WDC WD20EARX-00PASB0
      97. Serial Number: WD-WCAZAH277130
      98. Firmware Version: 51.0AB51
      99. User Capacity: 2.000.398.934.016 bytes
      100. Device is: Not in smartctl database [for details use: -P showall]
      101. ATA Version is: 8
      102. ATA Standard is: Exact ATA specification draft version not indicated
      103. Local Time is: Fri Jun 20 20:30:23 2014 CEST
      104. SMART support is: Available - device has SMART capability.
      105. SMART support is: Enabled
      106. === START OF READ SMART DATA SECTION ===
      107. SMART overall-health self-assessment test result: PASSED
      108. General SMART Values:
      109. Offline data collection status: (0x82) Offline data collection activity
      110. was completed without error.
      111. Auto Offline Data Collection: Enabled.
      112. Self-test execution status: ( 0) The previous self-test routine completed
      113. without error or no self-test has ever
      114. been run.
      115. Total time to complete Offline
      116. data collection: (39600) seconds.
      117. Offline data collection
      118. capabilities: (0x7b) SMART execute Offline immediate.
      119. Auto Offline data collection on/off support.
      120. Suspend Offline collection upon new
      121. command.
      122. Offline surface scan supported.
      123. Self-test supported.
      124. Conveyance Self-test supported.
      125. Selective Self-test supported.
      126. SMART capabilities: (0x0003) Saves SMART data before entering
      127. power-saving mode.
      128. Supports SMART auto save timer.
      129. Error logging capability: (0x01) Error logging supported.
      130. General Purpose Logging supported.
      131. Short self-test routine
      132. recommended polling time: ( 2) minutes.
      133. Extended self-test routine
      134. recommended polling time: ( 255) minutes.
      135. Conveyance self-test routine
      136. recommended polling time: ( 5) minutes.
      137. SCT capabilities: (0x3035) SCT Status supported.
      138. SCT Feature Control supported.
      139. SCT Data Table supported.
      140. SMART Attributes Data Structure revision number: 16
      141. Vendor Specific SMART Attributes with Thresholds:
      142. ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
      143. 1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0
      144. 3 Spin_Up_Time 0x0027 166 156 021 Pre-fail Always - 6658
      145. 4 Start_Stop_Count 0x0032 098 098 000 Old_age Always - 2414
      146. 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
      147. 7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0
      148. 9 Power_On_Hours 0x0032 092 092 000 Old_age Always - 6569
      149. 10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0
      150. 11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0
      151. 12 Power_Cycle_Count 0x0032 099 099 000 Old_age Always - 1524
      152. 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 121
      153. 193 Load_Cycle_Count 0x0032 196 196 000 Old_age Always - 13557
      154. 194 Temperature_Celsius 0x0022 121 115 000 Old_age Always - 29
      155. 196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
      156. 197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0
      157. 198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 0
      158. 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
      159. 200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 0
      160. SMART Error Log Version: 1
      161. No Errors Logged
      162. SMART Self-test log structure revision number 1
      163. No self-tests have been logged. [To run self-tests, use: smartctl -t]
      164. SMART Selective self-test log data structure revision number 1
      165. SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
      166. 1 0 0 Not_testing
      167. 2 0 0 Not_testing
      168. 3 0 0 Not_testing
      169. 4 0 0 Not_testing
      170. 5 0 0 Not_testing
      171. Selective self-test flags (0x0):
      172. After scanning selected spans, do NOT read-scan remainder of disk.
      173. If Selective self-test is pending on power-up, resume after 0 minute delay.
      174. root@Server:~# smartctl -a /dev/sdd
      175. smartctl 5.40 2010-07-12 r3124 [x86_64-unknown-linux-gnu] (local build)
      176. Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net
      177. === START OF INFORMATION SECTION ===
      178. Device Model: WDC WD30EFRX-68EUZN0
      179. Serial Number: WD-WMC4N1785106
      180. Firmware Version: 80.00A80
      181. User Capacity: 3.000.592.982.016 bytes
      182. Device is: Not in smartctl database [for details use: -P showall]
      183. ATA Version is: 9
      184. ATA Standard is: Exact ATA specification draft version not indicated
      185. Local Time is: Fri Jun 20 20:30:25 2014 CEST
      186. SMART support is: Available - device has SMART capability.
      187. SMART support is: Enabled
      188. === START OF READ SMART DATA SECTION ===
      189. SMART overall-health self-assessment test result: PASSED
      190. General SMART Values:
      191. Offline data collection status: (0x80) Offline data collection activity
      192. was never started.
      193. Auto Offline Data Collection: Enabled.
      194. Self-test execution status: ( 0) The previous self-test routine completed
      195. without error or no self-test has ever
      196. been run.
      197. Total time to complete Offline
      198. data collection: (39360) seconds.
      199. Offline data collection
      200. capabilities: (0x7b) SMART execute Offline immediate.
      201. Auto Offline data collection on/off support.
      202. Suspend Offline collection upon new
      203. command.
      204. Offline surface scan supported.
      205. Self-test supported.
      206. Conveyance Self-test supported.
      207. Selective Self-test supported.
      208. SMART capabilities: (0x0003) Saves SMART data before entering
      209. power-saving mode.
      210. Supports SMART auto save timer.
      211. Error logging capability: (0x01) Error logging supported.
      212. General Purpose Logging supported.
      213. Short self-test routine
      214. recommended polling time: ( 2) minutes.
      215. Extended self-test routine
      216. recommended polling time: ( 255) minutes.
      217. Conveyance self-test routine
      218. recommended polling time: ( 5) minutes.
      219. SCT capabilities: (0x703d) SCT Status supported.
      220. SCT Error Recovery Control supported.
      221. SCT Feature Control supported.
      222. SCT Data Table supported.
      223. SMART Attributes Data Structure revision number: 16
      224. Vendor Specific SMART Attributes with Thresholds:
      225. ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
      226. 1 Raw_Read_Error_Rate 0x002f 100 253 051 Pre-fail Always - 0
      227. 3 Spin_Up_Time 0x0027 179 174 021 Pre-fail Always - 6050
      228. 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 610
      229. 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
      230. 7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0
      231. 9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 587
      232. 10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0
      233. 11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0
      234. 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 610
      235. 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 2
      236. 193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 703
      237. 194 Temperature_Celsius 0x0022 122 116 000 Old_age Always - 28
      238. 196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
      239. 197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0
      240. 198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0
      241. 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
      242. 200 Multi_Zone_Error_Rate 0x0008 100 253 000 Old_age Offline - 0
      243. SMART Error Log Version: 1
      244. No Errors Logged
      245. SMART Self-test log structure revision number 1
      246. No self-tests have been logged. [To run self-tests, use: smartctl -t]
      247. SMART Selective self-test log data structure revision number 1
      248. SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
      249. 1 0 0 Not_testing
      250. 2 0 0 Not_testing
      251. 3 0 0 Not_testing
      252. 4 0 0 Not_testing
      253. 5 0 0 Not_testing
      254. Selective self-test flags (0x0):
      255. After scanning selected spans, do NOT read-scan remainder of disk.
      256. If Selective self-test is pending on power-up, resume after 0 minute delay.
      257. root@Server:~#
      Display All
      Files
      • smart1.png

        (55.65 kB, downloaded 746 times, last: )
      • hdd.png

        (23.16 kB, downloaded 774 times, last: )
      Server:
      BitFenix Phenom M - ASRock B75 Pro3-M - 8GB Crucial B.S. - Intel G2140 - Corsair CX430M - 1x 64GB SanDisk SDSSDP (system) - 2x 2TB WD20EARX (data) - 1x 3TB WD30EFRX (Snapraid) - TBS6981 DVB-S2 - OMV 0.5x
    • Re: Der grüne Festplattentod - Load Cycle Count

      Keien Ahnung,

      ich habe keine Probleme bislang gehabt, allerdings gehe ich davon aus, daß meine Platten immer an sind
      Es sind hier aber auch keine vermehrten Plattensterben bekannt geworden.
      Abgesehen davon finde ich das lustig, das sind Desktop-Platten, die sind dafür gebaut, daß sie ständig rauf und runter fahren (Powersave). Warum soll das also jetzt in einem NAS schlechter sein?

      Sorry, for not answering in english.
      Everything is possible, sometimes it requires Google to find out how.
    • Das ist ein spezielles Problem der WD Green Festplatten und einiger WD 2.5" Platten.
      Die Laufwerke enthalten ein "tolles" Feature "IntellyPark", dass die Leseköpfe all x Sekunden in Parkposition fährt und die Platten stoppt.
      Das erneute anfahren und bereitmachen der Leseköpfe wird unter "load_cycle_count" gezählt.

      Das ist tatsächlich ein Problem, das man aber lösen kann und sollte! Der Hersteller gibt der Platte eine Haltberkeitsdauer von 300 000 "load_cycles" (siehe WD), was wie in meinem Fall beim parken alle 4 Sekunden, also 900 "load_cycles" pro Stunde und 24/7 Betrieb, ziemlich schnell durch ist.

      Infos bei ubuntuusers

      Wie kann man die Anzahl der "load_cycle" am besten bestimmen?

      Source Code

      1. smartctl -A /dev/sdd | egrep "ID|Load|Power"

      Das gibt Dir in dezimal Zahlen die Werte für Betriebsstunden, An/Aus und Load Cycle.
      Du kannst die Hexwerte (RAW_VALUE) auch direkt aus der Tabelle oben auslesen, mußt aber umrechnen...

      Habe ich ein Problem?
      Meine Platte z.B. wurde 120 mal An/Aus geschaltet, hat gut 1000 Betriebsstunden hinter sich, aber schon 80 000 Load Cycle! Also ja, sieht ganz nach einem Problem aus.
      Es reicht auch einfach die Abfrage (siehe oben) im Abstand von 5 Minuten zu machen, wenn da mehr als 1 Load Cycle Unterschied ist = Problem

      Was kann ich tun?
      Erster Ansatzpunkt, wäre den IntelliPark quatsch ganz abzuschalten, dafür gibt es ein DOS Tool von WD namens wdidle3 und eine Linux Alternative, idle3-tools. Siehe oben bei ubuntuusers
      Bei meiner Platte (ein 2.5 Zoll Modell) hat das leider nichts gebracht, ist eignetlich auch mehr für die "Green" Modelle entwickelt.
      Du kannst aber auch in Linux dem Spuk ein Ende machen. siehe
      Entweder mit hdparm, dass muß dann in das hdparm Startscript eingefügt werden. Aber bei openmediavault kann man das auch im WebGUI, über Physical Disks + Auswahl + Edit machen, dort einfach den Wert für Advanced Power Management entsprechend setzen.
      Das ging bei mir leider nicht da ich die Platte auch noch per USB Adapter angeschlossen habe und hdparm da gar nicht mit will.
      smartmontools tut es aber gern.

      Source Code

      1. smartctl -g apm /dev/sdX # zeigt den aktuellen Wert für das Power Management
      2. smartctl -s apm,254 /dev/sdX # stellt den Wert für das Power Management auf max., kaum Energie sparen, länger leben, 255 stellt es aus, aber das funktioniert nicht bei allen Platten

      Ich habe das ganze in der

      Source Code

      1. /etc/smartd.conf
      an die entsprechnde Plattenconfig angehängt, ACHTUNG in der smartd. conf ist es

      Source Code

      1. -e apm,254 # nicht -s apm,xxx wie bei smartctl
      .
      Jetzt noch ein reboot oder einfach

      Source Code

      1. service smartmontools restart
      und schon sollte der Load_Cycle_Counter still stehen.