Dear All,
after several days using OMV7 my Apacer nvme module(s) started to "disappear" in GUI (Storage - Disks), my raid1 md0 array shows "clean, degraded" in GUI (Storage - Multiple Device).
First I thought that this was a nvmeHW storage failure, but after I had rebooted the Apacer nvme module(s) reappeared, raid array showed "clean" and all data were OK again. I did not even have to rebuild the md0 raid1 array as it showed "clean" straight away!
But this takes another hour or two, sometimes 5 minutes, sometimes a day and the situation with missing Apacer module(s), degraded raid1 array and unreachable data repeats ......until next reboot Sometimes only one Apacer nvme module disappears, sometimes both.
My modules are not overheating, all 4 blue slot diodes always show as active.
I don't generate any load either. On Apacer modules (md0 RAID) I just run docker, docker data mapped and
qbittorrent in Docker Compose.
What should I do as part of root cause analysis? Could anybody give me hints please?
Details follow:
I have OMV7 on RPi5 using Suptronics - X1011 M.2 NVMe 4 SSD shield and I have the following nvme modules (status BEFORE the Apacer modules disappear):
Status AFTER Apacer module(s) start to disappear:
mdadm -D /dev/md0:
Both Apacer modules missing until next restart:
DMESG
https://pastebin.com/SgpcfnHM (problems seem to start at [ 738.038272])