Thread (23 messages) 23 messages, 7 authors, 2009-11-17

Re: RAID 6 Failure follow up

From: Ryan Wagoner <hidden>
Date: 2009-11-10 12:45:50

Boot up a CentOS 5 LiveCD. It should detect your arrays and try
running smartctl. From my experience with different distros I have
found that Red Hat spends a good amount of time making sure enterprise
hardware is stable on their system. Ubuntu seems to focus more on
desktops.

Ryan

On Tue, Nov 10, 2009 at 5:55 AM, Andrew Dunn [off-list ref] wrote:
I am able to reproduce this smart error now. I have done it twice, so
maybe other things are causing this also.

When I scanned the devices this morning with smartctl via webmin I lost
8 of the 9 drives. They are howerver still in my /dev folder.

Now I sent out my logs from the first failure last night, smartctl was
on the system... I dont know if ubuntu server's default smartd
configuration makes it do periodic scans because I didnt change anything.

I would hate to move back to 9.10 and see this problem again.

Should I just not install smartmontools? This seems like a bad solution
because now I wont be able to check the drives in advance for failures.

Have you installed LSI's linux drivers? Some people say this solves
their issue.

From the logs sent out last night do you think it could be something else?

Thanks a ton,

Gabor Gombas wrote:
quoted
On Mon, Nov 09, 2009 at 05:08:23AM -0500, Andrew Dunn wrote:

quoted
does it momentarily offline the disks? like they re-appear in /dev
within moments? That would be similar behavior to what I am
experiencing, the disks drop from the array, but they are in /dev by the
time I get a chance to see them.
No, either the disks need to be physically removed and re-inserted, or
the machine needs to be rebooted.

Gabor
--
Andrew Dunn
http://agdunn.net
Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help