Thread (13 messages) 13 messages, 4 authors, 2015-05-22

Re: Recent drive errors

From: Mikael Abrahamsson <hidden>
Date: 2015-05-22 13:38:06

On Thu, 21 May 2015, Thomas Fjellstrom wrote:
quoted
for x in /sys/block/sd[a-z] ; do
         echo 180  > $x/device/timeout
done
I presume it's ok to do that even if the drives do ERC/TLER? Just woke up, but
my brain seems to be telling me it shouldn't break anything since the ERC
drives should always return after 7s no matter what...
Correct, the only downside is that if the drive really dies, it's going to 
take longer to detect this.

I'd rather have longer timeouts to make sure drives never kicked out 
because of lack of ERC or something else, than to have drives kicked for 
some reason (controller reset, lack of ERC, or something else).

I really really want to avoid drives being kicked for any other reason 
than them really being dead. I'd rather have reads stalled for a few 
minutes than this happening. For other use-cases, the requirements may be 
different.

-- 
Mikael Abrahamsson    email: swmike@swm.pp.se
Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help