Re: [PATCH v4 13/27] lib: add errseq_t type and infrastructure for handling it

[PATCH v4 00/27] fs: introduce new writeback error reporting and convert existing API as a wrapper around it · Jeff Layton <hidden> · 2017-05-09
[PATCH v4 01/27] fs: remove unneeded forward definition of mm_struct from fs.h · Jeff Layton <hidden> · 2017-05-09
Re: [PATCH v4 01/27] fs: remove unneeded forward definition of mm_struct from fs.h · Jan Kara <jack@suse.cz> · 2017-05-10
[PATCH v4 02/27] mm: drop "wait" parameter from write_one_page · Jeff Layton <hidden> · 2017-05-09
[PATCH v4 03/27] mm: fix mapping_set_error call in me_pagecache_dirty · Jeff Layton <hidden> · 2017-05-09
[PATCH v4 04/27] buffer: use mapping_set_error instead of setting the flag · Jeff Layton <hidden> · 2017-05-09
[PATCH v4 05/27] btrfs: btrfs_wait_tree_block_writeback can be void return · Jeff Layton <hidden> · 2017-05-09
Re: [PATCH v4 05/27] btrfs: btrfs_wait_tree_block_writeback can be void return · Jan Kara <jack@suse.cz> · 2017-05-10
Re: [PATCH v4 05/27] btrfs: btrfs_wait_tree_block_writeback can be void return · Liu Bo <hidden> · 2017-05-19
[PATCH v4 06/27] fs: check for writeback errors after syncing out buffers in generic_file_fsync · Jeff Layton <hidden> · 2017-05-09
Re: [PATCH v4 06/27] fs: check for writeback errors after syncing out buffers in generic_file_fsync · Matthew Wilcox <willy@infradead.org> · 2017-05-10
[PATCH v4 07/27] orangefs: don't call filemap_write_and_wait from fsync · Jeff Layton <hidden> · 2017-05-09
[PATCH v4 08/27] dax: set errors in mapping when writeback fails · Jeff Layton <hidden> · 2017-05-09
[PATCH v4 09/27] nilfs2: set the mapping error when calling SetPageError on writeback · Jeff Layton <hidden> · 2017-05-09
[PATCH v4 10/27] 9p: set mapping error when writeback fails in launder_page · Jeff Layton <hidden> · 2017-05-09
[PATCH v4 11/27] fuse: set mapping error in writepage_locked when it fails · Jeff Layton <hidden> · 2017-05-09
Re: [PATCH v4 11/27] fuse: set mapping error in writepage_locked when it fails · Jan Kara <jack@suse.cz> · 2017-05-10
[PATCH v4 12/27] cifs: set mapping error when page writeback fails in writepage or launder_pages · Jeff Layton <hidden> · 2017-05-09
Re: [PATCH v4 12/27] cifs: set mapping error when page writeback fails in writepage or launder_pages · Jan Kara <jack@suse.cz> · 2017-05-10
[PATCH v4 13/27] lib: add errseq_t type and infrastructure for handling it · Jeff Layton <hidden> · 2017-05-09
Re: [PATCH v4 13/27] lib: add errseq_t type and infrastructure for handling it · NeilBrown <neil@brown.name> · 2017-05-09
Re: [PATCH v4 13/27] lib: add errseq_t type and infrastructure for handling it · Jeff Layton <hidden> · 2017-05-10
Re: [PATCH v4 13/27] lib: add errseq_t type and infrastructure for handling it · Jan Kara <jack@suse.cz> · 2017-05-10
Re: [PATCH v4 13/27] lib: add errseq_t type and infrastructure for handling it · Matthew Wilcox <willy@infradead.org> · 2017-05-10
Re: [PATCH v4 13/27] lib: add errseq_t type and infrastructure for handling it · Jeff Layton <hidden> · 2017-05-10
[PATCH v4 14/27] fs: new infrastructure for writeback error handling and reporting · Jeff Layton <hidden> · 2017-05-09
Re: [PATCH v4 14/27] fs: new infrastructure for writeback error handling and reporting · Jan Kara <jack@suse.cz> · 2017-05-10
Re: [PATCH v4 14/27] fs: new infrastructure for writeback error handling and reporting · Jeff Layton <hidden> · 2017-05-10
Re: [PATCH v4 14/27] fs: new infrastructure for writeback error handling and reporting · Jan Kara <jack@suse.cz> · 2017-05-10
[PATCH v4 15/27] fs: retrofit old error reporting API onto new infrastructure · Jeff Layton <hidden> · 2017-05-09
Re: [PATCH v4 15/27] fs: retrofit old error reporting API onto new infrastructure · Jan Kara <jack@suse.cz> · 2017-05-15
Re: [PATCH v4 15/27] fs: retrofit old error reporting API onto new infrastructure · Jeff Layton <hidden> · 2017-05-15
[PATCH v4 16/27] fs: adapt sync_file_range to new reporting infrastructure · Jeff Layton <hidden> · 2017-05-09
[PATCH v4 17/27] mm: remove AS_EIO and AS_ENOSPC flags · Jeff Layton <hidden> · 2017-05-09
[PATCH v4 18/27] mm: don't TestClearPageError in __filemap_fdatawait_range · Jeff Layton <hidden> · 2017-05-09
[PATCH v4 19/27] buffer: set errors in mapping at the time that the error occurs · Jeff Layton <hidden> · 2017-05-09
Re: [PATCH v4 19/27] buffer: set errors in mapping at the time that the error occurs · Jan Kara <jack@suse.cz> · 2017-05-15
[PATCH v4 20/27] cifs: cleanup writeback handling errors and comments · Jeff Layton <hidden> · 2017-05-09
[PATCH v4 21/27] mm: clean up error handling in write_one_page · Jeff Layton <hidden> · 2017-05-09
Re: [PATCH v4 21/27] mm: clean up error handling in write_one_page · Jan Kara <jack@suse.cz> · 2017-05-15
[PATCH v4 22/27] jbd2: don't reset error in journal_finish_inode_data_buffers · Jeff Layton <hidden> · 2017-05-09
Re: [PATCH v4 22/27] jbd2: don't reset error in journal_finish_inode_data_buffers · Jan Kara <jack@suse.cz> · 2017-05-15
[PATCH v4 23/27] gfs2: clean up some filemap_* calls · Jeff Layton <hidden> · 2017-05-09
Re: [PATCH v4 23/27] gfs2: clean up some filemap_* calls · Bob Peterson <hidden> · 2017-05-10
[PATCH v4 24/27][RFC] nfs: convert to new errseq_t based error tracking for writeback errors · Jeff Layton <hidden> · 2017-05-09
[PATCH v4 25/27] Documentation: flesh out the section in vfs.txt on storing and reporting writeback errors · Jeff Layton <hidden> · 2017-05-09
Re: [PATCH v4 25/27] Documentation: flesh out the section in vfs.txt on storing and reporting writeback errors · Jeff Layton <hidden> · 2017-05-09
[PATCH v4 26/27] mm: flesh out comments over mapping_set_error · Jeff Layton <hidden> · 2017-05-09
[PATCH v4 27/27] mm: clean up comments in me_pagecache_dirty · Jeff Layton <hidden> · 2017-05-09

From: Jan Kara <jack@suse.cz>
Date: 2017-05-10 11:34:30
Also in: linux-btrfs, linux-cifs, linux-ext4, linux-f2fs-devel, linux-fsdevel, linux-mm, linux-nfs, linux-xfs, lkml

On Tue 09-05-17 11:49:16, Jeff Layton wrote:

An errseq_t is a way of recording errors in one place, and allowing any
number of "subscribers" to tell whether an error has been set again
since a previous time.

It's implemented as an unsigned 32-bit value that is managed with atomic
operations. The low order bits are designated to hold an error code
(max size of MAX_ERRNO). The upper bits are used as a counter.

The API works with consumers sampling an errseq_t value at a particular
point in time. Later, that value can be used to tell whether new errors
have been set since that time.

Note that there is a 1 in 512k risk of collisions here if new errors
are being recorded frequently, since we have so few bits to use as a
counter. To mitigate this, one bit is used as a flag to tell whether the
value has been sampled since a new value was recorded. That allows
us to avoid bumping the counter if no one has sampled it since it
was last bumped.

Later patches will build on this infrastructure to change how writeback
errors are tracked in the kernel.

Signed-off-by: Jeff Layton <redacted>

The patch looks good to me. Feel free to add:

Reviewed-by: Jan Kara <jack@suse.cz>

Just two nits below:
...

+int errseq_check_and_advance(errseq_t *eseq, errseq_t *since)
+{
+	int err = 0;
+	errseq_t old, new;
+
+	/*
+	 * Most callers will want to use the inline wrapper to check this,
+	 * so that the common case of no error is handled without needing
+	 * to lock.
+	 */

I'm not sure which locking you are speaking about here. Is the comment
stale?

+	old = READ_ONCE(*eseq);
+	if (old != *since) {
+		/*
+		 * Set the flag and try to swap it into place if it has
+		 * changed.
+		 *
+		 * We don't care about the outcome of the swap here. If the
+		 * swap doesn't occur, then it has either been updated by a
+		 * writer who is bumping the seq count anyway, or another

"bumping the seq count anyway" part is not quite true. Writer may see
ERRSEQ_SEEN not set and so just update the error code and leave seq count
as is. But since you compare full errseq_t for equality, this works out as
designed...

+		 * reader who is just setting the "seen" flag. Either outcome
+		 * is OK, and we can advance "since" and return an error based
+		 * on what we have.
+		 */
+		new = old | ERRSEQ_SEEN;
+		if (new != old)
+			cmpxchg(eseq, old, new);
+		*since = new;
+		err = -(new & MAX_ERRNO);
+	}
+	return err;
+}
+EXPORT_SYMBOL(errseq_check_and_advance);

								Honza
-- 
Jan Kara [off-list ref]
SUSE Labs, CR

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help