Thread (27 messages) 27 messages, 6 authors, 2013-07-04

Re: Mdadm server eating drives

From: Stan Hoeppner <hidden>
Date: 2013-07-02 01:57:57

On 7/1/2013 7:17 PM, Barrett Lewis wrote:
I am very sorry to keep bugging this list, but I am really lost.
I apologize as I just noticed this thread.  If I'd jumped in sooner you
might already have it fixed.  I pulled your previous posts from my
archive folder and read with interest.
I noticed one drive was going up and down and determined that
the drive had actual physical damage to the power connecter and
was losing and regaining power through vibration.
This intermittent contact could have damaged the PSU.  You've continued
to have drive and lockup problems since replacing this drive with bad
connector.

The pink elephant in the room is thermal failure due to insufficient
airflow.  The symptoms you describe sound like drives overheating.  What
chassis is this?  Make/model please.  If you've installed individual
drive hot swap cages, etc, it would be helpful if you snapped a photo or
two and made those available.

I've seen many instances of this type of failure over the years and, in
order of prevalence, they are:

1.  Failed cheap backplane
2.  Insufficient airflow
3.  Failed or cheap PSU
4.  Failed HBA (or Southbridge)

-- 
Stan
Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help