[Solved] OMV unexpected reboot - softdog unexpected close

  • Hi all,



    on a fresh installation of OMV, (sardaukar 0.5.46 on Pendium d with 3GB RAM) using 2x3TB WD in RAID1 and Plex installed using commandline and other official plug-in, I'm experimenting unesxpeted reboot
    What i've seen on console monitor is a message "softdog unexpected close not stopping watchdog" and after few secondo a system reboot. :roll:


    The system log doens't say anything but minor (usb errors).


    I'm not able to say if it appens most when I stream video or not at moment.
    Any idea what's happening and why ? Anyone has experienced the same or similar issues?


    thnks,



    --


    Frecurring

    regards
    frecurring


    ---------------------------------
    MOS 7501 & TED 7370
    1,76 MHz with 16kB of RAM
    No HDD but I was excited

    2 Mal editiert, zuletzt von frecurring ()

  • Hi votdev,


    thanks for your reply.


    So every time CPU goes 100% i should expect a reboot ?
    With my old installation using same sowftware but old HD (for test purposes) I've never experienced the problem.


    How may I debug in detail what's causign the softdog not being touched ?


    --


    Frecurring

    regards
    frecurring


    ---------------------------------
    MOS 7501 & TED 7370
    1,76 MHz with 16kB of RAM
    No HDD but I was excited

    • Offizieller Beitrag

    No, the watchdog does not get triggered when CPU goes >= 100%, otherwise it would be a really bad feature or behaviour.
    YOu have to find out what causes the watchdog daemon not to write to the device file which is checked by the kernel every N seconds. If this write does not happen within this time the kernel triggers the reboot. So you have to find out if it is possible that the watchdog daemon was killed unexpected/unregulary..

  • Hi votdev,


    i'm trying to understand better the problem.


    I see during the omv startup process


    ...
    starting watchdog keepalive daemon: wd_keepalive
    stopping watchdog keepalive daemom
    stopping watchdog daemon
    ...


    checking ps -ef and wd_keepalive is not running


    I can confirm softdog message and reboot happens whenduring PLEX Media content streaming to my TV.


    does it is normal the wd_keepalive start and stop during start-up process and is not running ?


    thanks

    regards
    frecurring


    ---------------------------------
    MOS 7501 & TED 7370
    1,76 MHz with 16kB of RAM
    No HDD but I was excited

  • Zitat


    I can confirm softdog message and reboot happens whenduring PLEX Media content streaming to my TV.


    Volker said it could be due to high load that the watchdog gets triggered. What has your NAS for specs? How high is the load when streaming content?


    Greetings
    David

    "Well... lately this forum has become support for everything except omv" [...] "And is like someone is banning Google from their browsers"


    Only two things are infinite, the universe and human stupidity, and I'm not sure about the former.

    Upload Logfile via WebGUI/CLI
    #openmediavault on freenode IRC | German & English | GMT+1
    Absolutely no Support via PM!

  • Hi,


    sardaukar 0.5.46 on Pendium D 2.8 GHz with 3GB RAM, WD red RAID 1
    CPU go to 100% during streaming if this is what you mean with "how high ..."


    Receiving also some faithful email
    Date: Thu, 08 May 2014 05:37:32 +0100
    Action: alert
    Host: OMVHome
    Description: cpu user usage of 100.0% matches resource limit [cpu user usage>95.0%



    BTW
    Yesterday i've experimented reboot and Plex was non streaming anything :cry:

    regards
    frecurring


    ---------------------------------
    MOS 7501 & TED 7370
    1,76 MHz with 16kB of RAM
    No HDD but I was excited

  • HI all,


    coming back to home I've discovered that the RAID 1 (just setup few days ago with a fresh OMV installation) in resynch state :cry:


    the output of the mdstat if the following


    root@OMVHome:~# cat /proc/mdstat
    Personalities : [raid1]
    md127 : active raid1 sdb[0] sda[1]
    2930265424 blocks super 1.2 [2/2] [UU]
    [>....................] resync = 1.1% (34032128/2930265424) finish=2864.6min speed=16850K/sec

    unused devices: <none>


    and trying to improve speed resynch again


    root@OMVHome:~# mdadm --grow --bitmap=internal /dev/md127
    mdadm: failed to set internal bitmap.


    i'm wondering if there is something wrong with HD or with HW in general


    the resynch state is due to continue reboot in your opinion ?
    any idea / suggestion ?


    thanks

    regards
    frecurring


    ---------------------------------
    MOS 7501 & TED 7370
    1,76 MHz with 16kB of RAM
    No HDD but I was excited

  • HI all,



    cause week-end is coming I'm going to enjoy free-time trying to solve the issues encountered.


    I've found one error in one disk as follows :



    My idea is start from scratch and trying to rebuild RAID 1 array again.


    So my question what should be the right procedure/steps to rightly dismantle the RAID 1 array ?
    do I need to erase the superblock information before reuse the two disks on array building ?
    because the HDD is still in the return back time policy do you think I should return the HDD ?



    thanks for your help,

    regards
    frecurring


    ---------------------------------
    MOS 7501 & TED 7370
    1,76 MHz with 16kB of RAM
    No HDD but I was excited

  • I finally found that there was a faulty RAM memory banks


    I removed the RAM banks and the problem has disappeared but in the meantime it has corrupted the filesystem


    I re-installed everything again.

    regards
    frecurring


    ---------------------------------
    MOS 7501 & TED 7370
    1,76 MHz with 16kB of RAM
    No HDD but I was excited

  • It was really a PAIN and time consuming :o


    the real error I did is introduce 2 different impovements at the same time.


    I introduced the
    1) RAM banks and
    2) reinstall everything using a new Disk


    Fre

    regards
    frecurring


    ---------------------------------
    MOS 7501 & TED 7370
    1,76 MHz with 16kB of RAM
    No HDD but I was excited

Jetzt mitmachen!

Sie haben noch kein Benutzerkonto auf unserer Seite? Registrieren Sie sich kostenlos und nehmen Sie an unserer Community teil!