hey guys,
I’m having an issue either with OMV(software) or my system(hardware), and I’m hoping i can describe my issues here and get some help if its OMV. I’ve reinstalled OMV about a week ago and everything was running great until about 4 days ago. after a reboot the system wouldn’t finish booting and threw out a lot of errors.
usb 2-2 device descriptor read/64, error -71
xhci_hcd 00:0:0:0 setup error: setup context command for slot 1
usb 2-2 hub failed to enable device
i figured after a few hours that it was the keyboard that i had plugged into the usb3.0 terminal and when i plugged it into the 2.0 terminal system booted no errors no problem. this was using the new kernel so i was surprised that usb 3.0 wasn’t working especially since i enabled usb 3.0 in the mobo bios.
the next morning i woke up and my Nasbox was hard locked up; unreachable by smb/ssh/ftp etc, so i had to hold down the power and start it up. I thought it was a fluke until the next morning I woke up and found the nasbox unresponsive yet again.
at this point i started looking up potential issues. The system logs didn’t show anything out of the ordinary other than the timestamp of a “flushall” was 7 hours before the rebooting logging started. So to me it seemed as though whatever happened wasn’t being logged.
full disclosure, I’m still new to linux and couldn’t find an error log so i was only going off of the syslog in the GUI.
steps I’ve taken 2 nights ago.
I thought the drives might be overheating so i separated my 4 drives from 1 bay into 2 x 2 bays.
I plugged the sata cables into different terminals on my motherboard after checking for faulty cabling.
i uninstalled all plugins except for plex.
after a few reboots, i started watching some vids and during playback the nasbox froze yet again after about 30 min of video watching. I rebooted once and all seemed fine for the night.
then this morning i woke up to a frozen box again so i held down power to reboot again. this time when i rebooted, I received a few errors on the startup screen which lead me to believe a hard drive is failing.
33.888139 ata2.00 failed command: read dma
43.295999 ata2.00 revalidation failed (errno=-5
end_request i/0 error, dev sad sect 8
ect…
I let the machine boot all the way up and one of my raids was offline so i rebooted, and this time i used the old 3.2.0 kernel vs the 3.16 kernel. The machine booted all the way in and the raid was there so i started backing up the media on that drive to another drive and during the transfer the box locked up frozen again. Im not sure if a bad drive would totally freeze the machine especially if its just been sitting for hours. and thats where I’m at right now.
so I think there are a few issues i could possibly be facing.
Maybe my OMV installation got borked.
maybe my HDD is borked causing system lockups
maybe my cpu is overheating, although I’ve got a serious fan on it and around 6 case fans with serious airflow.
maybe ram is borked?
so many variables i don’t know where to start. I can’t find an error log to point me in the right direction. Ill post logs (if i can figure out how) when i get home from work later, i just wanted to get this down before i forgot everything I’ve done.
anybody have any ideas? what other information do you need from me?