Thread (6 messages) 6 messages, 4 authors, 2013-01-26

Re: Recovering from an URE on a RAID5 rebuild/resize

From: Chris Murphy <hidden>
Date: 2013-01-25 20:43:42

On Jan 25, 2013, at 1:28 PM, Chris Murphy [off-list ref] wrote:
In a way you'd be better off just blowing away that whole stripe
http://smartmontools.sourceforge.net/badblockhowto.html

So if you do a smartctl -t long on the affected drive, it will stop the test at the first bad sector and report the LBA. If you write zeros to that LBA only, the drive firmware will "fix" the problem, but now you have zeros for that sector.

Note how this document says part of the data in the file will be overwritten with zeros. Well, this is not a file, it's part of an md chunk, which is part of a stripe, then it's a file system, then it's a file. So by zeroing one sector, you've got ambiguous data in the whole stripe.

But, even if you have a totally wrong reconstruction on a whole stripe, at least the rest of the rebuild has a chance to proceed without further URE. So then it's an open question what was affected. If it's a movie, maybe you lose 5 seconds of video or sound, depending on how it's encoded. Depending the player's tolerances, maybe it crashes. Who knows.

Chris Murphy
Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help