Thread (52 messages) 52 messages, 9 authors, 2013-01-31

Re: Huge values of mismatch_cnt on RAID 6 arrays under Fedora 18

From: Chris Murphy <hidden>
Date: 2013-01-28 02:16:46

On Jan 27, 2013, at 6:42 PM, Chris Murphy [off-list ref] wrote:
The mismatch number is not divisible by 16, yet your chunk size is 16KB. It is divisible by 4 and 8, so I'm going to guess that the physical sector size is 4096 bytes. If correct, I'm coming up with a maximum of 346GiB worth of sectors may be adversely affected, assuming every sector in the mismatch count is bad (which is probably not true, but could be).
Since the upgrade to Fedora 18 for this particular RAID 6, can you estimate how much data has been written to the array? Could it be in the 90GiB to 350GiB range?

On Jan 27, 2013, at 7:07 PM, Brad Campbell [off-list ref] wrote:
Massive mismatch counts are indicative of an insidious problem further down the storage stack. Check your drivers, cards, cables and PSU.
Yeah, this may be difficult, but I think the array needs to be remounted ro or unmounted. The writes may be killing it. But there's not enough information yet to know, it's smoke but no fire so far.

man 4 md says the same, for raid 5 and 6, mismatches are not expected to be software problems, but much more likely hardware. But if it's true that the OP's problems started exactly with the upgrade to Fedora 18, that it could be a device driver. What HBA is being used?


Chris Murphy
Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help