Thread (7 messages) 7 messages, 2 authors, 2020-08-15

Re: Confusing output of --examine-badblocks1 message

From: Roy Sigurd Karlsbakk <hidden>
Date: 2020-08-13 18:50:28

quoted
However, back to --examine-badblocks. It seems it's reporting the same sector
numbers in the list for several (up to eight) drives. If I understand this
correctly, something strange has hit and damanged all drives on fixed sector
numbers, such as this

Bad-blocks on /dev/sdm:
          436362944 for 128 sectors

It doesn't seem very likely, to be honest, that a lot of drives suddenly damage
the same sector at once. I can see the same occur on a friend's server -
sectors with identical 'bad' sector numbers been listed on individual drives.
It seems very likely the badblocks list is just replicated to new drives. I just
started

# mdadm /dev/md0 --replace /dev/sdb --with /dev/sdk

where sdk is a drive known to be good. It's about halfway through and it's
already copied part of the badblocks list. No I/O errors have been reported in
dmesg or otherwise.

Any idea how to remove this list and start over?
I just tried another approach, mdadm --remove on the spares, mdadm --examine on the removed spares, no superblock. Then madm --fail for one of the drives and mdadm --add for another, now spare for a few milliseconds until recovery started. This runs as it should, slower than --replace, but I don't care. After 12% or so, I checked with --examine-badblocks, and the same sectors are popping up again. This was just a small test to see i --replace was the "bad guy" here or if a full recovery would do the same. It does.

Vennlig hilsen

roy
-- 
Roy Sigurd Karlsbakk
(+47) 98013356
http://blogg.karlsbakk.net/
GPG Public key: http://karlsbakk.net/roysigurdkarlsbakk.pubkey.txt
--
Hið góða skaltu í stein höggva, hið illa í snjó rita.
Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help