Re: [RFC PATCH v1 0/7] Block/XFS: Support alternative mirror device retry

[RFC PATCH v1 0/7] Block/XFS: Support alternative mirror device retry · Allison Henderson <hidden> · 2018-11-28
[PATCH v1 2/7] block: expand write_hint of bio/request to rw_hint · Allison Henderson <hidden> · 2018-11-28
[PATCH v1 1/7] block: add nr_mirrors to request_queue · Allison Henderson <hidden> · 2018-11-28
[PATCH v1 3/7] md: raid1: handle bi_rw_hint accordingly · Allison Henderson <hidden> · 2018-11-28
[PATCH v1 4/7] xfs: Add b_rw_hint to xfs_buf · Allison Henderson <hidden> · 2018-11-28
Re: [PATCH v1 4/7] xfs: Add b_rw_hint to xfs_buf · Dave Chinner <david@fromorbit.com> · 2018-11-28
[PATCH v1 6/7] xfs: Rewrite retried read · Allison Henderson <hidden> · 2018-11-28
Re: [PATCH v1 6/7] xfs: Rewrite retried read · Dave Chinner <david@fromorbit.com> · 2018-11-28
Re: [PATCH v1 6/7] xfs: Rewrite retried read · Darrick J. Wong <hidden> · 2018-11-28
Re: [PATCH v1 6/7] xfs: Rewrite retried read · Dave Chinner <david@fromorbit.com> · 2018-11-28
[PATCH v1 5/7] xfs: Add device retry · Allison Henderson <hidden> · 2018-11-28
Re: [PATCH v1 5/7] xfs: Add device retry · Dave Chinner <david@fromorbit.com> · 2018-11-28
Re: [PATCH v1 5/7] xfs: Add device retry · Darrick J. Wong <hidden> · 2018-11-28
Re: [PATCH v1 5/7] xfs: Add device retry · Dave Chinner <david@fromorbit.com> · 2018-11-28
Re: [PATCH v1 5/7] xfs: Add device retry · Christoph Hellwig <hch@infradead.org> · 2018-11-28
Re: [PATCH v1 5/7] xfs: Add device retry · Bob Liu <hidden> · 2018-11-28
Re: [PATCH v1 5/7] xfs: Add device retry · Allison Henderson <hidden> · 2018-11-28
[PATCH v1 7/7] xfs: Add tracepoints and logging to alternate device retry · Allison Henderson <hidden> · 2018-11-28
Re: [RFC PATCH v1 0/7] Block/XFS: Support alternative mirror device retry · Dave Chinner <david@fromorbit.com> · 2018-11-28
Re: [RFC PATCH v1 0/7] Block/XFS: Support alternative mirror device retry · Darrick J. Wong <hidden> · 2018-11-28
Re: [RFC PATCH v1 0/7] Block/XFS: Support alternative mirror device retry · Dave Chinner <david@fromorbit.com> · 2018-11-28
Re: [RFC PATCH v1 0/7] Block/XFS: Support alternative mirror device retry · Darrick J. Wong <hidden> · 2018-11-28
Re: [RFC PATCH v1 0/7] Block/XFS: Support alternative mirror device retry · Andreas Dilger <hidden> · 2018-11-29
Re: [RFC PATCH v1 0/7] Block/XFS: Support alternative mirror device retry · Christoph Hellwig <hch@infradead.org> · 2018-11-28
Re: [RFC PATCH v1 0/7] Block/XFS: Support alternative mirror device retry · Dave Chinner <david@fromorbit.com> · 2018-11-28
Re: [RFC PATCH v1 0/7] Block/XFS: Support alternative mirror device retry · Christoph Hellwig <hch@infradead.org> · 2018-11-28
Re: [RFC PATCH v1 0/7] Block/XFS: Support alternative mirror device retry · Christoph Hellwig <hch@infradead.org> · 2018-11-28
Re: [RFC PATCH v1 0/7] Block/XFS: Support alternative mirror device retry · Bob Liu <hidden> · 2018-12-08
Re: [RFC PATCH v1 0/7] Block/XFS: Support alternative mirror device retry · Darrick J. Wong <hidden> · 2018-12-10

From: Dave Chinner <david@fromorbit.com>
Date: 2018-11-28 16:33:30
Also in: linux-fsdevel, linux-xfs, lkml

On Tue, Nov 27, 2018 at 08:49:44PM -0700, Allison Henderson wrote:

Motivation:
When fs data/metadata checksum mismatch, lower block devices may have other
correct copies. e.g. If XFS successfully reads a metadata buffer off a raid1 but
decides that the metadata is garbage, today it will shut down the entire
filesystem without trying any of the other mirrors.  This is a severe
loss of service, and we propose these patches to have XFS try harder to
avoid failure.

This patch prototype this mirror retry idea by:
* Adding @nr_mirrors to struct request_queue which is similar as
  blk_queue_nonrot(), filesystem can grab device request queue and check max
  mirrors this block device has.
  Helper functions were also added to get/set the nr_mirrors.

* Expanding bi_write_hint to bi_rw_hint, now @bi_rw_hint has three meanings.
 1.Original write_hint.
 2.end_io() will update @bi_rw_hint to reflect which mirror this i/o really happened.
 3.Fs set @bi_rw_hint to force driver e.g raid1 read from a specific mirror.

* Modify md/raid1 to support this retry feature.

* Add b_rw_hint to xfs_buf
  This patch adds a new field b_rw_hint to xfs_buf.  We will use this to set the
  new bio->bi_rw_hint when submitting the read request, and also to store the
  returned mirror when the read compleates

One thing that is going to make this more complex at the XFS layer
is discontiguous buffers. They require multiple IOs (and therefore
bios) and so we are going to need to ensure that all the bios use
the same bi_rw_hint.

This is another reason I suggest that bi_rw_hint has a magic value
for "block layer selects mirror" and separate the initial read from
the retry iterations. That allows us to let he block layer ot pick
whatever leg it wants for the initial read, but if we get a failure
we directly control the mirror we retry from and all bios in the
buffer go to that same mirror.

We're not planning to take over all 16 bits of the read hint field; just looking for
feedback about the sanity of the overall approach.

It seems conceptually simple enough - the biggest questions I have
are:

	- how does propagation through stacked layers work?
	- is it generic/abstract enough to be able to work with
	  RAID5/6 to trigger verification/recovery from the parity
	  information in the stripe?

Cheers,

Dave.
-- 
Dave Chinner
david@fromorbit.com

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help