RAID 5 with 3 Drives 3TB each - Two drives faild and RAID faild - help recover

  • Hello,


    I recieved today notification from my OMV server that 2 drives faild.
    The RAID status in the WebGUI shows "clean, FAILD" and oly drive /dev/sdd listed.


    This is the info:


    cat /etc/mdadm/mdadm.conf

    Code
    ARRAY /dev/md/NAS metadata=1.2 spares=1 name=openmediavault:NAS UUID=332d8084:c2b3a139:44a4f8e1:6865cc49


    Please let me know what more info I need to provide.


    Thank you

    • Offizieller Beitrag

    mdadm --stop /dev/md127
    mdadm --assemble --force --verbose /dev/md127 /dev/sd[dfg]


    Normally when it says failed, that is bad though. I hope you have a backup.

    omv 7.0.5-1 sandworm | 64 bit | 6.8 proxmox kernel

    plugins :: omvextrasorg 7.0 | kvm 7.0.13 | compose 7.2 | k8s 7.1.0-3 | cputemp 7.0.1 | mergerfs 7.0.4 | scripts 7.0.1


    omv-extras.org plugins source code and issue tracker - github - changelogs


    Please try ctrl-shift-R and read this before posting a question.

    Please put your OMV system details in your signature.
    Please don't PM for support... Too many PMs!

  • Hi,


    I shutdown my server and connected the SATA cables better, then I saw all my Drives again.
    I ran both commands from above and recieved the following:


    Code
    mdadm: stopped /dev/md127
    sudo mdadm --assemble --force --verbose /dev/md127 /dev/sd[bcd]
    mdadm: looking for devices for /dev/md127
    mdadm: /dev/sdb is busy - skipping
    mdadm: /dev/sdc is busy - skipping
    mdadm: /dev/sdd is identified as a member of /dev/md127, slot 2.
    mdadm: no uptodate device for slot 0 of /dev/md127
    mdadm: no uptodate device for slot 1 of /dev/md127
    mdadm: added /dev/sdd to /dev/md127 as 2
    mdadm: /dev/md127 assembled from 1 drive - not enough to start the array.


    I have looked on the System Info and saw that suddently I have also md126 which is inactive


    What action should I take in order to preserve my data?


    Thank you

    • Offizieller Beitrag

    mdadm --stop /dev/md126
    mdadm --stop /dev/md127
    mdadm --assemble --force --verbose /dev/md127 /dev/sd[bcd]

    omv 7.0.5-1 sandworm | 64 bit | 6.8 proxmox kernel

    plugins :: omvextrasorg 7.0 | kvm 7.0.13 | compose 7.2 | k8s 7.1.0-3 | cputemp 7.0.1 | mergerfs 7.0.4 | scripts 7.0.1


    omv-extras.org plugins source code and issue tracker - github - changelogs


    Please try ctrl-shift-R and read this before posting a question.

    Please put your OMV system details in your signature.
    Please don't PM for support... Too many PMs!

  • Thank you very much for the prompt reply.
    It seems that one of the drives is out of date and was not captured as part of the RAID.


    How to I know the progress?



    updated system info:

    • Offizieller Beitrag

    Let the array finish building (look at cat /proc/mdstat) and then zero the out of date drive and recover the drive to the array.

    omv 7.0.5-1 sandworm | 64 bit | 6.8 proxmox kernel

    plugins :: omvextrasorg 7.0 | kvm 7.0.13 | compose 7.2 | k8s 7.1.0-3 | cputemp 7.0.1 | mergerfs 7.0.4 | scripts 7.0.1


    omv-extras.org plugins source code and issue tracker - github - changelogs


    Please try ctrl-shift-R and read this before posting a question.

    Please put your OMV system details in your signature.
    Please don't PM for support... Too many PMs!

  • This means it has finished:


    Code
    sudo cat /proc/mdstat
    Personalities : [raid6] [raid5] [raid4]
    md127 : active raid5 sdc[1] sdd[2]
          5855716352 blocks super 1.2 level 5, 512k chunk, algorithm 2 [3/2] [_UU]
    
    
    unused devices: <none>
    • Offizieller Beitrag

    Yes.


    dd if=/dev/zero of=/dev/sdb bs=512 count=10000
    mdadm --zero-superblock /dev/sdb
    Then use recover in raid tab.

    omv 7.0.5-1 sandworm | 64 bit | 6.8 proxmox kernel

    plugins :: omvextrasorg 7.0 | kvm 7.0.13 | compose 7.2 | k8s 7.1.0-3 | cputemp 7.0.1 | mergerfs 7.0.4 | scripts 7.0.1


    omv-extras.org plugins source code and issue tracker - github - changelogs


    Please try ctrl-shift-R and read this before posting a question.

    Please put your OMV system details in your signature.
    Please don't PM for support... Too many PMs!

  • Hi,


    I have followed the steps and in step 2 recieved the below outcome.
    Is it ok? Can i start RAID Recovery now?


    Code
    sudo dd if=/dev/zero of=/dev/sdb bs=512 count=10000
    [sudo] password for media:
    10000+0 records in
    10000+0 records out
    5120000 bytes (5.1 MB) copied, 4.99599 s, 1.0 MB/s
    $ sudo mdadm --zero-superblock /dev/sdb
    mdadm: Unrecognised md component device - /dev/sdb
    $ sudo mdadm --zero-superblock /dev/sdb
    mdadm: Unrecognised md component device - /dev/sdb
    • Offizieller Beitrag

    That is expected in the second step. It is a "just in case" step. Proceed to the next step.

    omv 7.0.5-1 sandworm | 64 bit | 6.8 proxmox kernel

    plugins :: omvextrasorg 7.0 | kvm 7.0.13 | compose 7.2 | k8s 7.1.0-3 | cputemp 7.0.1 | mergerfs 7.0.4 | scripts 7.0.1


    omv-extras.org plugins source code and issue tracker - github - changelogs


    Please try ctrl-shift-R and read this before posting a question.

    Please put your OMV system details in your signature.
    Please don't PM for support... Too many PMs!

  • Hi,


    From some reason during the RAID Repair my Server shutdown.
    When it boot up again, it ave an error regardign the RAID filesystem and ask for Manual intervantion.


    I executed:

    Code
    fsck.ext4 /dev/md127

    when it finished it gave a warninig that there still File system issues.
    I ran it again and it gave the same error, I restarted and run it again and still the same error.
    Now every time I want to restart the server I need to press Contorl+D.


    How do perform a file system check that will solve the issue?
    This is my boot log:


    This is the checkfs log:


    I also get SpareEvent emails:

    Code
    A SparesMissing event had been detected on md device /dev/md127.


    One more thing, how do I recover data from lost+found, I can't even access the folder.


    Thank you

    • Offizieller Beitrag

    fsck.ext4 -f /dev/md127

    omv 7.0.5-1 sandworm | 64 bit | 6.8 proxmox kernel

    plugins :: omvextrasorg 7.0 | kvm 7.0.13 | compose 7.2 | k8s 7.1.0-3 | cputemp 7.0.1 | mergerfs 7.0.4 | scripts 7.0.1


    omv-extras.org plugins source code and issue tracker - github - changelogs


    Please try ctrl-shift-R and read this before posting a question.

    Please put your OMV system details in your signature.
    Please don't PM for support... Too many PMs!

    • Offizieller Beitrag

    Boot systemrescuecd to try and fix it. It has much newer versions of filesystem utilities.

    omv 7.0.5-1 sandworm | 64 bit | 6.8 proxmox kernel

    plugins :: omvextrasorg 7.0 | kvm 7.0.13 | compose 7.2 | k8s 7.1.0-3 | cputemp 7.0.1 | mergerfs 7.0.4 | scripts 7.0.1


    omv-extras.org plugins source code and issue tracker - github - changelogs


    Please try ctrl-shift-R and read this before posting a question.

    Please put your OMV system details in your signature.
    Please don't PM for support... Too many PMs!

    • Offizieller Beitrag

    systemrescuecd is something I have been using for years and it is great. Write it to a usb stick with linux live usb creator.


    No tools. just the same fsck.ext4 command. The fsck libraries are much newer and might be able to fix your problem.

    omv 7.0.5-1 sandworm | 64 bit | 6.8 proxmox kernel

    plugins :: omvextrasorg 7.0 | kvm 7.0.13 | compose 7.2 | k8s 7.1.0-3 | cputemp 7.0.1 | mergerfs 7.0.4 | scripts 7.0.1


    omv-extras.org plugins source code and issue tracker - github - changelogs


    Please try ctrl-shift-R and read this before posting a question.

    Please put your OMV system details in your signature.
    Please don't PM for support... Too many PMs!

    • Offizieller Beitrag

    The -y flag just answers yes to every question. Were you answering no when running the command I gave you?

    omv 7.0.5-1 sandworm | 64 bit | 6.8 proxmox kernel

    plugins :: omvextrasorg 7.0 | kvm 7.0.13 | compose 7.2 | k8s 7.1.0-3 | cputemp 7.0.1 | mergerfs 7.0.4 | scripts 7.0.1


    omv-extras.org plugins source code and issue tracker - github - changelogs


    Please try ctrl-shift-R and read this before posting a question.

    Please put your OMV system details in your signature.
    Please don't PM for support... Too many PMs!

  • Apparently there were 2 folders sized 0. This caused the fsck to fail because they were not correctly. By tweaking the folders with chown and chmod I could create and remove a folder in them and size became valid. Then ran fsck again and FS repaired.


    I still have an issue now with a message regarding Spare drive



    Sent from my iPhone using Tapatalk

Jetzt mitmachen!

Sie haben noch kein Benutzerkonto auf unserer Seite? Registrieren Sie sich kostenlos und nehmen Sie an unserer Community teil!