HDD for virtual machines

  • Hi !


    At the moment i have an old 500 GB WD hard drive for my virtual machines in OMV running mailserver and nextcloud server.


    The HDD now has 30000 power on hours and at the moment i get smart errors, but the drive is still working.


    But i think i have to replace it soon.


    I don't need many space, which means that at least 200-250 gb would be enough.


    Would you buy another HDD (which one ?) or would you go to a 250 GB SSD ?


    The server is on 24/7 and the drive is backed up every night.


    Thank you for your replys


    Greets Karl

  • is it important which one ?


    Are there any criteria to choose ?


    The real enterprise SSD are much to expensive.


    I have a small system SSD (Sandisk) since 3 or 4 years running without problems.

  • Are there any criteria to choose ?

    • Performance: the most random IOPS and the lowest latency you want to afford
    • Reliability: Only buy from manufacturers who expose the SSD's wear out indicator through SMART. So you can watch your SSD wearing out (slowly dying) and replace it when this indicator gets below the 10% barrier
  • Hi !


    Ok Thank you. I ordered a Kingston SSD UV400 now.


    My system SSD is a very cheap pSandisk ... according smart Information it has now 36000 hours and still 90% of life.


    So that is really good I think.


    Hope the new one is as good as the old one.


    Greets Karl

  • Hi !


    I have the following one as system SSD:


    32GB SanDisk ReadyCache 2.5" SATA 6Gb -> bought in december 2013.


    Here the Smart values:



    === START OF INFORMATION SECTION ===
    Model Family: SanDisk based SSDs
    Device Model: SanDisk SDSSDRC032G
    Serial Number: 134106401464
    LU WWN Device Id: 5 001b44 a44f246b8
    Firmware Version: 3.0.0
    User Capacity: 32,017,047,552 bytes [32.0 GB]
    Sector Size: 512 bytes logical/physical
    Rotation Rate: Solid State Device
    Form Factor: 1.8 inches
    Device is: In smartctl database [for details use: -P show]
    ATA Version is: ACS-2 T13/2015-D revision 3
    SATA Version is: SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
    Local Time is: Fri Mar 9 18:00:17 2018 CET
    SMART support is: Available - device has SMART capability.
    SMART support is: Enabled
    AAM feature is: Unavailable
    APM level is: 254 (maximum performance)
    Rd look-ahead is: Enabled
    Write cache is: Enabled
    ATA Security is: Disabled, frozen [SEC2]
    Wt Cache Reorder: Unavailable


    === START OF READ SMART DATA SECTION ===
    SMART overall-health self-assessment test result: PASSED


    General SMART Values:
    Offline data collection status: (0x00) Offline data collection activity
    was never started.
    Auto Offline Data Collection: Disabled.
    Self-test execution status: ( 25) The self-test routine was aborted by
    the host.
    Total time to complete Offline
    data collection: ( 120) seconds.
    Offline data collection
    capabilities: (0x51) SMART execute Offline immediate.
    No Auto Offline data collection support.
    Suspend Offline collection upon new
    command.
    No Offline surface scan supported.
    Self-test supported.
    No Conveyance Self-test supported.
    Selective Self-test supported.
    SMART capabilities: (0x0003) Saves SMART data before entering
    power-saving mode.
    Supports SMART auto save timer.
    Error logging capability: (0x01) Error logging supported.
    General Purpose Logging supported.
    Short self-test routine
    recommended polling time: ( 2) minutes.
    Extended self-test routine
    recommended polling time: ( 7) minutes.


    SMART Attributes Data Structure revision number: 1
    Vendor Specific SMART Attributes with Thresholds:
    ID# ATTRIBUTE_NAME FLAGS VALUE WORST THRESH FAIL RAW_VALUE
    5 Reallocated_Sector_Ct -O---- 100 100 000 - 0
    9 Power_On_Hours -O---- 100 100 000 - 36633
    12 Power_Cycle_Count -O---- 100 100 000 - 81
    171 Program_Fail_Count -O---- 100 100 000 - 0
    172 Erase_Fail_Count -O---- 100 100 000 - 0
    173 Avg_Write/Erase_Count -O---- 100 100 000 - 219
    174 Unexpect_Power_Loss_Ct -O---- 100 100 000 - 32
    187 Reported_Uncorrect -O---- 100 100 000 - 0
    230 Perc_Write/Erase_Count -O---- 100 100 000 - 730
    232 Perc_Avail_Resrvd_Space PO---- 100 100 005 - 0
    234 Perc_Write/Erase_Ct_BC -O---- 100 100 000 - 364
    241 Total_LBAs_Written -O---- 100 100 000 - 4495811996
    242 Total_LBAs_Read -O---- 100 100 000 - 1548106641
    ||||||_ K auto-keep
    |||||__ C event count
    ||||___ R error rate
    |||____ S speed/performance
    ||_____ O updated online
    |______ P prefailure warning


    General Purpose Log Directory Version 1
    SMART Log Directory Version 1 [multi-sector log support]
    Address Access R/W Size Description
    0x00 GPL,SL R/O 1 Log Directory
    0x01 GPL,SL R/O 1 Summary SMART error log
    0x03 GPL,SL R/O 16 Ext. Comprehensive SMART error log
    0x04 GPL,SL R/O 8 Device Statistics log
    0x06 GPL,SL R/O 1 SMART self-test log
    0x09 GPL,SL R/W 1 Selective self-test log
    0x10 GPL,SL R/O 1 SATA NCQ Queued Error log
    0x11 GPL,SL R/O 1 SATA Phy Event Counters log
    0x30 GPL,SL R/O 9 IDENTIFY DEVICE data log
    0x80-0x9f GPL,SL R/W 16 Host vendor specific log
    0xa1 GPL,SL VS 1 Device vendor specific log
    0xa2 GPL,SL VS 2 Device vendor specific log
    0xa3 GPL,SL VS 1 Device vendor specific log
    0xa6-0xa7 GPL,SL VS 255 Device vendor specific log



    SMART Self-test log structure revision number 1
    Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
    # 1 Short offline Aborted by host 90% 36584 -

  • I think this should be the value

    Nope, you're doing a dangerous mistake and try to interpret the occurence of some percentage value displayed somewhere but indicating a problem as something else:


    Code
    SMART Self-test log structure revision number 1
    Num  Test_Description    Status            Remaining    LifeTime(hours)   LBA_of_first_error
    # 1  Short offline       Aborted by host   90%          36584             -

    These 90% mean that even a short SMART test is aborted after few seconds (less than 10% percent finished) since 90% of the test remain unfinished. And the 'LifeTime(hours)' value only tell when the SMART test has started, this will always be less than or equal to 'Power_On_Hours' SMART attribute.


    Your SanDisk SSD does not provide any SMART attribute that would provide a 'wear out indicator' which is something that would prevent using this SSD for any important usage (at least for me, we don't use any SSD that is not able to tell about internal wear out via SMART so we can replace it if only 10% remaining life are left). I hope the Kingston you bought does provide such a SMART attribute (don't know, I would never buy any flash memory product from Kingston or those other companies that just rehash things they bought somewhere else)

Jetzt mitmachen!

Sie haben noch kein Benutzerkonto auf unserer Seite? Registrieren Sie sich kostenlos und nehmen Sie an unserer Community teil!