Re: [PATCH 2/5] xfs: separate CIL commit record IO

[PATCH 0/5] xfs: various log stuff... · Dave Chinner <david@fromorbit.com> · 2021-01-28
[PATCH 2/5] xfs: separate CIL commit record IO · Dave Chinner <david@fromorbit.com> · 2021-01-28
Re: [PATCH 2/5] xfs: separate CIL commit record IO · Brian Foster <hidden> · 2021-01-28
Re: [PATCH 2/5] xfs: separate CIL commit record IO · Dave Chinner <david@fromorbit.com> · 2021-01-28
Re: [PATCH 2/5] xfs: separate CIL commit record IO · Chandan Babu R <hidden> · 2021-01-30
Re: [PATCH 2/5] xfs: separate CIL commit record IO · Christoph Hellwig <hch@infradead.org> · 2021-02-01
[PATCH 5/5] xfs: reduce buffer log item shadow allocations · Dave Chinner <david@fromorbit.com> · 2021-01-28
Re: [PATCH 5/5] xfs: reduce buffer log item shadow allocations · Brian Foster <hidden> · 2021-01-28
Re: [PATCH 5/5] xfs: reduce buffer log item shadow allocations · Dave Chinner <david@fromorbit.com> · 2021-01-28
Re: [PATCH 5/5] xfs: reduce buffer log item shadow allocations · Chandan Babu R <hidden> · 2021-02-02
[PATCH 3/5] xfs: journal IO cache flush reductions · Dave Chinner <david@fromorbit.com> · 2021-01-28
Re: [PATCH 3/5] xfs: journal IO cache flush reductions · Brian Foster <hidden> · 2021-01-28
Re: [PATCH 3/5] xfs: journal IO cache flush reductions · Dave Chinner <david@fromorbit.com> · 2021-01-28
Re: [PATCH 3/5] xfs: journal IO cache flush reductions · Dave Chinner <david@fromorbit.com> · 2021-01-28
Re: [PATCH 3/5] xfs: journal IO cache flush reductions · Chandan Babu R <hidden> · 2021-01-30
[PATCH 1/5] xfs: log stripe roundoff is a property of the log · Dave Chinner <david@fromorbit.com> · 2021-01-28
Re: [PATCH 1/5] xfs: log stripe roundoff is a property of the log · Brian Foster <hidden> · 2021-01-28
Re: [PATCH 1/5] xfs: log stripe roundoff is a property of the log · Dave Chinner <david@fromorbit.com> · 2021-01-28
Re: [PATCH 1/5] xfs: log stripe roundoff is a property of the log · "Darrick J. Wong" <djwong@kernel.org> · 2021-01-28
Re: [PATCH 1/5] xfs: log stripe roundoff is a property of the log · Dave Chinner <david@fromorbit.com> · 2021-01-28
[PATCH 4/5] xfs: Fix CIL throttle hang when CIL space used going backwards · Dave Chinner <david@fromorbit.com> · 2021-01-28
Re: [PATCH 4/5] xfs: Fix CIL throttle hang when CIL space used going backwards · Brian Foster <hidden> · 2021-01-28
Re: [PATCH 4/5] xfs: Fix CIL throttle hang when CIL space used going backwards · Chandan Babu R <hidden> · 2021-02-02
Re: [PATCH 4/5] xfs: Fix CIL throttle hang when CIL space used going backwards · Paul Menzel <hidden> · 2021-02-17
Re: [PATCH 4/5] xfs: Fix CIL throttle hang when CIL space used going backwards · Donald Buczek <hidden> · 2021-02-17
Re: [PATCH 0/5] xfs: various log stuff... · Christoph Hellwig <hch@infradead.org> · 2021-02-01
Re: [PATCH 0/5] xfs: various log stuff... · Dave Chinner <david@fromorbit.com> · 2021-02-03

From: Christoph Hellwig <hch@infradead.org>
Date: 2021-02-01 13:00:18

On Thu, Jan 28, 2021 at 03:41:51PM +1100, Dave Chinner wrote:

From: Dave Chinner <redacted>

To allow for iclog IO device cache flush behaviour to be optimised,
we first need to separate out the commit record iclog IO from the
rest of the checkpoint so we can wait for the checkpoint IO to
complete before we issue the commit record.

This separate is only necessary if the commit record is being

s/separate/separation/g

written into a different iclog to the start of the checkpoint. If
the entire checkpoint and commit is in the one iclog, then they are
both covered by the one set of cache flush primitives on the iclog
and hence there is no need to separate them.

Otherwise, we need to wait for all the previous iclogs to complete
so they are ordered correctly and made stable by the REQ_PREFLUSH
that the commit record iclog IO issues. This guarantees that if a
reader sees the commit record in the journal, they will also see the
entire checkpoint that commit record closes off.

This also provides the guarantee that when the commit record IO
completes, we can safely unpin all the log items in the checkpoint
so they can be written back because the entire checkpoint is stable
in the journal.

I'm a little worried about the direction for devices without a volatile
write cache like all highend enterprise SSDs, Arrays and hard drives,
where we not introduce another synchronization point without any gains
from the reduction in FUA/flush traffic that is a no-op there.

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help