System crash

  • Hello, I am encountering a problem with my OMV, this one cash completely every Friday around 4:15 a.m. impossible to find logs that explain it. Do you have any clues?


    Release: 6.8.0-1

    Codename: Shaitan


    Thanks in advance.

  • macom

    Approved the thread.
  • I have access to the logs but no information gives me an idea of the cash my system is on an SSD. no flashmemory-pugin.

  • votdev

    Added the Label OMV 6.x
  • I have monitoring it no longer responds to ping and all my docker is not accessible :/



    • New
    • Official Post

    I have monitoring it no longer responds to ping and all my docker is not accessible

    Could it be a network issue? To find out you could connect a display and keyboard to see if you have access locally once it is not available via network.


    I hope somebody else has other ideas how to find out what is going on.

    • New
    • Official Post

    I have monitoring it

    Quote

    ...

    Sep 16 02:36:42 apollon smartd[5047]: Device: /dev/disk/by-id/ata-WDC_WD40EFZX-68AWUN0_WD-WXA2DA1LHX9R [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 115 to 114

    ...


    16 de septiembre 02:36:42 apollon smartd[5047]: Dispositivo: /dev/disk/by-id/ata-ST4000VN006-3CW104_ZW60BQH0 [SAT], Atributo de prefallo SMART: 1 Raw_Read_Error_Rate cambió de 77 a 78

    ...

    That log shows impossible temperatures on some hard drives. I doubt that any hard drive will continue to operate at 115ºC. Even temperatures of 65ºC would be worrying. I assume this is a SMART readings error. I would try to find out what the real temperatures are.

    In addition there is also some recording of ascending prefault values.


    Aside from hard drives, I would investigate container activity. I would stop them all to see if the errors mentioned continue to occur. If the failures stop occurring, then one of the containers would be the culprit. So you would have to try one by one.

  • Thank you for your help


    I have already explored the track of my containers and the crash system without any depending on the hardware problem but I would not have a crash at the same time. no visible planning task on Friday at 4:15 a.m.


    I'm thinking of reinstalling my system.

  • No I use it for my docker with only a basic smb service.

    That's why I'm asking if there isn't a programmed task in the system that could explain my problem.


    Thank you very much for your help, sorry for my English translation

    • New
    • Official Post

    I can't think of anything else, sorry.

    I'm thinking of reinstalling my system.

    If you are going to reinstall this could help you.

    Simply make a backup with omv-regen, disconnect the current system drive and do a clean install of OMV on a different drive. You can keep your current drive in case you want to go back.

    Then regenerate the system with omv-regen and continue working as before. Then see if your system continues to behave in the same way. If he stops doing it you will have the problem solved. If it continues to behave the same, it is because some container or some configuration you have made is causing this. The only thing that omv-regen does is reproduce the current configuration on a new system by copying the existing information in your database, from there everything is reconfigured as it was on the original server. But it is a new system.

Participate now!

Don’t have an account yet? Register yourself now and be a part of our community!