[SOLVED] Software RAID1, need some help

  • Hi,
    We run OMV for some months. I recycled an old computer, put 2 * 2 TB disks, and used OMV to create a software RAID1. This server is connected to our old Windows Server 2003 Active Directory. This has worked without any trouble since the beginning.


    Some days ago, first alert, my colleagues couldn't access the Samba shares anymore by using their regular AD credentials. Not a big deal, I rebooted the server and everything came back as usual.


    But few days ago (last friday, let's be accurate!), something weird and worrying happened, it was like if the server RAID switched back some weeks in the past (as if we switched back to an old snapshot for a VM). We lost some data, and see old data back which are normally automatically cleaned after 10 days.


    Here I am, I don't want to let things like this, and would like to diagnose a bit to prevent, as much as possible, the same thing to happen again. I need some help, because, for instance, the syslog doesn't contain traces older than today. Any help would be greatly appreciated, thanks.

    • Offizieller Beitrag

    There should be some older gzip'd syslog logs in the /var/log directory. Are you running any rsync scripts? What the OS installed on?

    omv 7.1.0-2 sandworm | 64 bit | 6.8 proxmox kernel

    plugins :: omvextrasorg 7.0 | kvm 7.0.13 | compose 7.2 | k8s 7.1.0-3 | cputemp 7.0.1 | mergerfs 7.0.5 | scripts 7.0.7


    omv-extras.org plugins source code and issue tracker - github - changelogs


    Please try ctrl-shift-R and read this before posting a question.

    Please put your OMV system details in your signature.
    Please don't PM for support... Too many PMs!

  • Hi, and thanks for helping.


    I went through almost all logs and found nothing relevant. But I'm not an expert. I wonder if this can come from a disk failure. But as these disks are mirrored in the software RAID, there is no way to get a SMART assessment.


    No, I don't run any rsync script nor cron jobs, except default OMV tasks. The OMV version is the latest available (now 0.5.48), the system is regularly updated. Any idea how I can diagnose this issue?

    • Offizieller Beitrag

    What about the type of device the OS is installed on? usb stick? hard drive? ssd?

    omv 7.1.0-2 sandworm | 64 bit | 6.8 proxmox kernel

    plugins :: omvextrasorg 7.0 | kvm 7.0.13 | compose 7.2 | k8s 7.1.0-3 | cputemp 7.0.1 | mergerfs 7.0.5 | scripts 7.0.7


    omv-extras.org plugins source code and issue tracker - github - changelogs


    Please try ctrl-shift-R and read this before posting a question.

    Please put your OMV system details in your signature.
    Please don't PM for support... Too many PMs!

  • Hard drive, seems clean based on SMART data.


    I think I've found someting. Normally, I setup a software RAID1, made of 2 WD 2 TB HDDs. But on OMV, I see this:


    Code
    <machine>:raid - /dev/md0 - clean,degraded - mirror - 1,82TiB - /dev/sdc
    <machine>:raid - /dev/md127 - clean,degraded - mirror - 1,82TiB - /dev/sdd


    That's pretty disturbing don't you think? So I tried to deep into mdadm analysis tools.


    If I run: mdadm --query --detail /dev/md0, the output is:



    If I run: mdadm --query --detail /dev/md127, the output is:


    My newbie first thought would be: "my raid doesn't work at all". What do you think?

    • Offizieller Beitrag

    I've seen that before. Your array must have become degraded a couple of weeks ago for your data to be old. Now you need to figure out which one has the current data on it. What is the output of:


    mount

    omv 7.1.0-2 sandworm | 64 bit | 6.8 proxmox kernel

    plugins :: omvextrasorg 7.0 | kvm 7.0.13 | compose 7.2 | k8s 7.1.0-3 | cputemp 7.0.1 | mergerfs 7.0.5 | scripts 7.0.7


    omv-extras.org plugins source code and issue tracker - github - changelogs


    Please try ctrl-shift-R and read this before posting a question.

    Please put your OMV system details in your signature.
    Please don't PM for support... Too many PMs!

  • Code
    /dev/md0 on /media/f605a663-0455-4580-9139-fe9350690b51 type ext4 (rw,noexec,_netdev,acl,user_xattr,usrjquota=aquota.user,grpjquota=aquota.group,jqfmt=vfsv0)


    No /md127 in the output. I guess I've to rebuild my raid, don't you think?

    • Offizieller Beitrag

    I would backup everything at this point and the try the following which assumes the current data is on md127:


    Code
    mdadm --stop /dev/md0
    mdadm --remove /dev/md0
    mdadm --zero-superblock /dev/sdc
    mdadm --add /dev/md127 /dev/sdc

    omv 7.1.0-2 sandworm | 64 bit | 6.8 proxmox kernel

    plugins :: omvextrasorg 7.0 | kvm 7.0.13 | compose 7.2 | k8s 7.1.0-3 | cputemp 7.0.1 | mergerfs 7.0.5 | scripts 7.0.7


    omv-extras.org plugins source code and issue tracker - github - changelogs


    Please try ctrl-shift-R and read this before posting a question.

    Please put your OMV system details in your signature.
    Please don't PM for support... Too many PMs!

  • Hi,
    I finally fixed the issue by removing the RAID totally. And re-created from scratch. Bonus, I didn't lose any data, OMV (mdadm indeed) synced everything right.


    /dev/md127 must not be used, it's a sign that your raid is in a bad shape. So be aware, if you see /dev/md127, your raid needs some attention.

    • Offizieller Beitrag

    Glad to hear you got it working :)


    Zitat von "romu"

    /dev/md127 must not be used, it's a sign that your raid is in a bad shape. So be aware, if you see /dev/md127, your raid needs some attention.


    Not true. I have three systems running right now with raid at md127. It is the default starting number. The state column is what you should be looking at.

    omv 7.1.0-2 sandworm | 64 bit | 6.8 proxmox kernel

    plugins :: omvextrasorg 7.0 | kvm 7.0.13 | compose 7.2 | k8s 7.1.0-3 | cputemp 7.0.1 | mergerfs 7.0.5 | scripts 7.0.7


    omv-extras.org plugins source code and issue tracker - github - changelogs


    Please try ctrl-shift-R and read this before posting a question.

    Please put your OMV system details in your signature.
    Please don't PM for support... Too many PMs!

Jetzt mitmachen!

Sie haben noch kein Benutzerkonto auf unserer Seite? Registrieren Sie sich kostenlos und nehmen Sie an unserer Community teil!