kernel rrdtool segfault error every 15min

  • it works in the shell with this command


    smartctl -x /dev/nvme0


    === START OF SMART DATA SECTION ===
    SMART overall-health self-assessment test result: PASSED


    SMART/Health Information (NVMe Log 0x02, NSID 0xffffffff)
    Critical Warning: 0x00
    Temperature: 49 Celsius
    Available Spare: 100%
    Available Spare Threshold: 5%
    Percentage Used: 2%
    Data Units Read: 1,599,392 [818 GB]
    Data Units Written: 2,948,543 [1.50 TB]
    Host Read Commands: 17,379,756
    Host Write Commands: 24,023,608
    Controller Busy Time: 119
    Power Cycles: 96
    Power On Hours: 3,522
    Unsafe Shutdowns: 62
    Media and Data Integrity Errors: 0
    Error Information Log Entries: 176
    Warning Comp. Temperature Time: 0
    Critical Comp. Temperature Time: 0


    Error Information (NVMe Log 0x01, max 63 entries)
    Num ErrCount SQId CmdId Status PELoc LBA NSID VS
    0 176 0 0x0015 0x4004 0x004 0 1 -
    1 175 0 0x0015 0x4004 0x004 0 1 -
    2 174 0 0x001d 0x4004 0x004 0 1 -
    3 173 0 0x001d 0x4004 0x004 0 1 -
    4 172 0 0x0004 0x4004 0x004 0 1 -
    5 171 0 0x0004 0x4004 0x004 0 1 -
    6 170 0 0x001d 0x4004 0x004 0 1 -
    7 169 0 0x001d 0x4004 0x004 0 1 -
    8 168 0 0x001c 0x4004 0x004 0 1 -
    9 167 0 0x001d 0x4004 0x004 0 1 -
    10 166 0 0x0015 0x4004 0x004 0 1 -
    11 165 0 0x001d 0x4004 0x004 0 1 -
    12 164 0 0x001c 0x4004 0x004 0 1 -
    13 163 0 0x0015 0x4004 0x004 0 1 -
    14 162 0 0x001c 0x4004 0x004 0 1 -
    15 161 0 0x001c 0x4004 0x004 0 1 -
    ... (47 entries not shown)

  • Maybe the nvme drive is getting hot (especially when the system is under load?) and doing bad things? Otherwise, I would go back to the ram check.

    omv 5.5.17-3 usul | 64 bit | 5.4 proxmox kernel | omvextrasorg 5.4.2
    omv-extras.org plugins source code and issue tracker - github


    Please read this before posting a question.
    Please don't PM for support... Too many PMs!

  • it works in the shell with this command


    smartctl -x /dev/nvme0

    This specific SMART issue is because smartmontools is not updated in OMV5. The bug has been solved by smartmontools team, but the updated version is not showing up. Found it out just a while ago.
    NVMe SMART status reading failure (wrong drive name)

    OMV BUILD - MY NAS KILLER - OMV 5.x + omvextrasorg


    Core i3-8300 - ASRock H370M-ITX/ac - 8GB RAM - Sandisk Ultra Flair 32GB (OMV), 256GB NVME SSD (Docker), 3x4TB HDD (Data) - Fractal Design Node 304 - Be quiet! Pure Power 11 350W

Participate now!

Don’t have an account yet? Register yourself now and be a part of our community!