Thread (16 messages) 16 messages, 8 authors, 2004-09-02

Re: Driver retries disk errors.

From: Alan Cox <hidden>
Date: 2004-08-31 15:17:57
Also in: lkml

On Maw, 2004-08-31 at 14:54, Rogier Wolff wrote:
So, can we agree on: 
	- might be needed for 
		- Floppies?
		- MO drives
		- older drives
Other random stuff it saves our backside on we don't know about.
How about we set the num-retries to 1, and increase to 8 for
"weird devices" (floppy, MO), and older drives. 
Disagree. I want it robust. If you want to set low retry counts then
the user should do so for special cases like forensics.
I do want to make the num_retries thing a configurable parameter,
should the autodetect get it wrong: We get drives that we want to
recover without the kernel-level retries...
Making it configurable is good, but I can't help feeling that this
belongs at the block layer - I wonder what Jens thinks, it might well
have to be done by the driver because only the driver knows enough but
the ioctl/config option ought to be common.
(still: I argue that you need to consider a "retry-works" error as an
early warning that your media is going bad, and you need to get your
data off ASAP! If the kernel silently retries and succeeds, the user
won't notice a thing and continue using the drive (or MO media) until
the error becomes irrecoverable. I recommend we put the retry at the
user level. As in "person behind keyboard".)
M/O media retries generally do the right thing and have the right
effect. If you want to know if your drive is failing use SMART and ask
the drive

Remember: Storage appliance not disk. Treat it like a storage
appliance and you'll get better results.

Alan
Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help