Thread (13 messages) 13 messages, 7 authors, 2003-07-08

Re: Checking consistency of Linux software RAID

From: Bernd Schubert <hidden>
Date: 2003-07-07 18:29:14

On Monday 30 June 2003 14:58, Martin Bene wrote:
Hi,

Administrationg quite a few systems with HW raid controllers, I've come to
really like a feature that seems to be missing from current SW raid:

Scheduling a (weekly) complete media scan where all surfaces of all drives
get read; in case of read errors a repair is tried: the content for the
failed sector is reconstructed just as if the drive had completely failed
and rewritten to the failed sector; if reading works afterwards, regard the
repair as successfull and continue using the drive.

Is there any way to do this with SW raid? I truly hate situations where
some sectors on a drive fail silently and you don't notice until a 2nd
drive dies and you find you can't recostruct your raid data becaus of
silent "bitrot".
Hi,

/proc/mdstat is to monitor the status of your raid, so when one drive fails it 
becomes dropped out of the raid-array. Using mdadm you can monitor 
/proc/mdstat and it even can send you a mail when one of your disks fails. So 
if you really want to scan your disk once a week, why not running 'dd 
if=/dev/mdX of=/dev/zero' ? So every block of every raid-disk should become 
read and the md-driver should automatically drop a failing disk  out of the 
raid. 
I guess you could even try to repair a disk when it became dropped out of the 
raid by running some scripts, but since I never trusted any disk that had 
failed ones, I never worried about it.

Bernd
Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help