Consistent crashing problems with new setup

    • Consistent crashing problems with new setup

      So I've set up a new openmediavault box and some plugins, however I am getting consistent crashes around every hour or so when I'm migrating data. My setup is OMV + aufs plugin + transmission and I am sharing the aufs pool and writing both my files and having transmission save on the pool. The odd thing is that everything seems to die (webui, SMB, transmission) however if I go to the PC I can login as root:password and it works (using admin:password just brings up the login again). I tried looking though the logs but can notice anything really repeating.
      EDIT: This may even be a networking error where for some reason the connection is dropped, need to look into this further, restarting the networking service did not work though on first trial.

      Here is an example of a syslog a bit before the crash (happens at around ~10:11), the crash, then me logging as root and doing sudo reboot:

      Source Code

      1. Nov 30 22:00:01 qube-store /USR/SBIN/CRON[3641]: (root) CMD (/usr/sbin/omv-mkgraph >/dev/null 2>&1)
      2. Nov 30 22:00:01 qube-store rrdcached[2307]: Received FLUSHALL
      3. Nov 30 22:09:01 qube-store /USR/SBIN/CRON[3866]: (root) CMD ( [ -x /usr/lib/php5/maxlifetime ] && [ -x /usr/lib/php5/sessionclean ] && [ -d /var/lib/php5 ] && /usr/lib/php5/sessionclean /var/lib/php5 $(/usr/lib/php5/maxlifetime))
      4. Nov 30 22:11:41 qube-store kernel: [ 4475.108583] r8169 0000:02:00.0: eth0: link up
      5. Nov 30 22:13:11 qube-store kernel: [ 4565.776721] r8169 0000:02:00.0: eth0: link up
      6. Nov 30 22:13:13 qube-store shutdown[3929]: shutting down for system reboot
      7. Nov 30 22:13:13 qube-store init: Switching to runlevel: 6
      8. Nov 30 22:13:14 qube-store monit[2442]: Shutting down monit HTTP server
      9. Nov 30 22:13:14 qube-store watchdog[2477]: stopping daemon (5.12)
      10. Nov 30 22:13:14 qube-store monit[2442]: monit HTTP server stopped
      11. Nov 30 22:13:14 qube-store monit[2442]: monit daemon with pid [2442] killed
      12. Nov 30 22:13:14 qube-store monit[2442]: 'localhost' Monit stopped
      13. Nov 30 22:13:19 qube-store wd_keepalive[3965]: starting watchdog keepalive daemon (5.12):
      14. Nov 30 22:13:19 qube-store wd_keepalive[3965]: int=10 alive=/dev/watchdog realtime=yes
      15. Nov 30 22:13:19 qube-store wd_keepalive[3965]: hardware wartchdog identity: Software Watchdog
      16. Nov 30 22:13:19 qube-store wd_keepalive[3965]: unable to disable oom handling!
      17. Nov 30 22:13:19 qube-store avahi-daemon[2373]: Got SIGTERM, quitting.
      18. Nov 30 22:13:19 qube-store collectd[2326]: Exiting normally.
      19. Nov 30 22:13:19 qube-store collectd[2326]: collectd: Stopping 5 read threads.
      20. Nov 30 22:13:19 qube-store rrdcached[2307]: caught SIGTERM
      21. Nov 30 22:13:19 qube-store rrdcached[2307]: starting shutdown
      22. Nov 30 22:13:19 qube-store avahi-daemon[2373]: Leaving mDNS multicast group on interface eth0.IPv4 with address 192.168.3.113.
      23. Nov 30 22:13:20 qube-store avahi-daemon[2373]: avahi-daemon 0.6.31 exiting.
      24. Nov 30 22:13:20 qube-store rrdcached[2307]: clean shutdown; all RRDs flushed
      25. Nov 30 22:13:20 qube-store rrdcached[2307]: removing journals
      26. Nov 30 22:13:20 qube-store rrdcached[2307]: removing old journal /var/lib/rrdcached/journal/rrd.journal.1417399044.511545
      27. Nov 30 22:13:20 qube-store rrdcached[2307]: removing old journal /var/lib/rrdcached/journal/rrd.journal.1417402644.512710
      28. Nov 30 22:13:20 qube-store rrdcached[2307]: goodbye
      29. Nov 30 22:13:27 qube-store acpid: exiting
      30. Nov 30 22:13:27 qube-store wd_keepalive[3965]: stopping watchdog keepalive daemon (5.12)
      31. Nov 30 22:14:26 qube-store kernel: imklog 5.8.11, log source = /proc/kmsg started.
      32. Nov 30 22:14:26 qube-store rsyslogd: [origin software="rsyslogd" swVersion="5.8.11" x-pid="2168" x-info="http://www.rsyslog.com"] start
      33. Nov 30 22:14:26 qube-store kernel: [ 0.000000] Initializing cgroup subsys cpuset
      34. Nov 30 22:14:26 qube-store kernel: [ 0.000000] Initializing cgroup subsys cpu
      35. Nov 30 22:14:26 qube-store kernel: [ 0.000000] Linux version 3.2.0-4-amd64 (debian-kernel@lists.debian.org) (gcc version 4.6.3 (Debian 4.6.3-14) ) #1 SMP Debian 3.2.63-2+deb7u1
      36. Nov 30 22:14:26 qube-store kernel: [ 0.000000] Command line: BOOT_IMAGE=/boot/vmlinuz-3.2.0-4-amd64 root=UUID=af7cd085-289c-4575-80dc-5715c685ada7 ro quiet
      37. Nov 30 22:14:26 qube-store kernel: [ 0.000000] BIOS-provided physical RAM map:
      Display All


      It then decides to die again at ~10:30 for some reason, at which point I hit restart button.

      Source Code

      1. Nov 30 22:14:33 qube-store monit[2421]: State file '/var/lib/monit/state': Unable to read magic
      2. Nov 30 22:14:38 qube-store watchdog[2456]: starting daemon (5.12):
      3. Nov 30 22:14:38 qube-store watchdog[2456]: int=1s realtime=yes sync=no soft=no mla=0 mem=0
      4. Nov 30 22:14:38 qube-store watchdog[2456]: ping: no machine to check
      5. Nov 30 22:14:38 qube-store watchdog[2456]: file: no file to check
      6. Nov 30 22:14:38 qube-store watchdog[2456]: pidfile: no server process to check
      7. Nov 30 22:14:38 qube-store watchdog[2456]: interface: no interface to check
      8. Nov 30 22:14:38 qube-store watchdog[2456]: test=none(0) repair=none(0) alive=/dev/watchdog heartbeat=none temp=none to=root no_act=no
      9. Nov 30 22:14:38 qube-store watchdog[2456]: hardware wartchdog identity: Software Watchdog
      10. Nov 30 22:15:01 qube-store /USR/SBIN/CRON[2557]: (root) CMD (/usr/sbin/omv-mkgraph >/dev/null 2>&1)
      11. Nov 30 22:15:01 qube-store rrdcached[2218]: Received FLUSHALL
      12. Nov 30 22:15:03 qube-store monit[2421]: Starting monit HTTP server at [localhost:2812]
      13. Nov 30 22:15:03 qube-store monit[2421]: monit HTTP server started
      14. Nov 30 22:15:03 qube-store monit[2421]: 'localhost' Monit started
      15. Nov 30 22:17:01 qube-store /USR/SBIN/CRON[2685]: (root) CMD ( cd / && run-parts --report /etc/cron.hourly)
      16. Nov 30 22:17:01 qube-store postfix/postsuper[2688]: fatal: scan_dir_push: open directory hold: No such file or directory
      17. Nov 30 22:29:43 qube-store kernel: [ 934.292408] r8169 0000:02:00.0: eth0: link up
      18. Nov 30 22:30:01 qube-store /USR/SBIN/CRON[3381]: (root) CMD (/usr/sbin/omv-mkgraph >/dev/null 2>&1)
      19. Nov 30 22:30:01 qube-store rrdcached[2218]: Received FLUSHALL
      20. Nov 30 22:33:38 qube-store kernel: imklog 5.8.11, log source = /proc/kmsg started.
      21. Nov 30 22:33:38 qube-store rsyslogd: [origin software="rsyslogd" swVersion="5.8.11" x-pid="2454" x-info="http://www.rsyslog.com"] start
      22. Nov 30 22:33:38 qube-store kernel: [ 0.000000] Initializing cgroup subsys cpuset
      23. Nov 30 22:33:38 qube-store kernel: [ 0.000000] Initializing cgroup subsys cpu
      24. Nov 30 22:33:38 qube-store kernel: [ 0.000000] Linux version 3.2.0-4-amd64 (debian-kernel@lists.debian.org) (gcc version 4.6.3 (Debian 4.6.3-14) ) #1 SMP Debian 3.2.63-2+deb7u1
      25. Nov 30 22:33:38 qube-store kernel: [ 0.000000] Command line: BOOT_IMAGE=/boot/vmlinuz-3.2.0-4-amd64 root=UUID=af7cd085-289c-4575-80dc-5715c685ada7 ro quiet
      26. Nov 30 22:33:38 qube-store kernel: [ 0.000000] BIOS-provided physical RAM map:
      Display All

      Here is a more complete log of the most recent crash:
      pastebin.com/hYet99Fh

      And full log here:
      drive.google.com/file/d/0B4p6B…HUGdSU3M/view?usp=sharing

      EDIT1: Going to attempt disabling transmission and see if that does anything.
      EDIT1.5: This did nothing, crashed about 5 minutes later thus interrupting the file transfer once more.
      EDIT2: Set static IP outside DHCP, didn't seem to make a difference, upgraded kernel and left it on overnight powered on, it did not disconnect during this time. Back to transferring files from windows machine, will need to see if it continues to disconnect.

      The post was edited 2 times, last by magixx ().

    • please install kernel 3.16 via omv-extras and try again. It maybe gives you better support for your hardware.
      "Glowing days. Don't cry because they are over. Smile because they happened." - Confucius

      Server: 1x 32GB SSD (system) - 5x 2TB Data - 1x 2TB Snapraid-Parity - latest OMV 1.x
      No Support through PM
      Tutorials --- Howto install OMV-Extras --- Upgrade/Update-Problems --- If autoshutdown doesn' -work