Re: [PATCH v10 06/14] btrfs: optionally extend i_size in cow_file_range_inline()

[PATCH v10 00/14] btrfs: add ioctls and send/receive support for reading/writing compressed data · Omar Sandoval <osandov@osandov.com> · 2021-08-17
[PATCH v10 01/14] fs: export rw_verify_area() · Omar Sandoval <osandov@osandov.com> · 2021-08-17
[PATCH v10 02/14] fs: export variant of generic_write_checks without iov_iter · Omar Sandoval <osandov@osandov.com> · 2021-08-17
Re: [PATCH v10 02/14] fs: export variant of generic_write_checks without iov_iter · Nikolay Borisov <hidden> · 2021-08-20
Re: [PATCH v10 02/14] fs: export variant of generic_write_checks without iov_iter · Omar Sandoval <osandov@osandov.com> · 2021-08-20
[PATCH v10 03/14] btrfs: don't advance offset for compressed bios in btrfs_csum_one_bio() · Omar Sandoval <osandov@osandov.com> · 2021-08-17
Re: [PATCH v10 03/14] btrfs: don't advance offset for compressed bios in btrfs_csum_one_bio() · Nikolay Borisov <hidden> · 2021-08-20
Re: [PATCH v10 03/14] btrfs: don't advance offset for compressed bios in btrfs_csum_one_bio() · Omar Sandoval <osandov@osandov.com> · 2021-08-20
[PATCH v10 04/14] btrfs: add ram_bytes and offset to btrfs_ordered_extent · Omar Sandoval <osandov@osandov.com> · 2021-08-17
Re: [PATCH v10 04/14] btrfs: add ram_bytes and offset to btrfs_ordered_extent · Nikolay Borisov <hidden> · 2021-08-20
Re: [PATCH v10 04/14] btrfs: add ram_bytes and offset to btrfs_ordered_extent · Omar Sandoval <osandov@osandov.com> · 2021-08-20
[PATCH v10 05/14] btrfs: support different disk extent size for delalloc · Omar Sandoval <osandov@osandov.com> · 2021-08-17
[PATCH v10 06/14] btrfs: optionally extend i_size in cow_file_range_inline() · Omar Sandoval <osandov@osandov.com> · 2021-08-17
Re: [PATCH v10 06/14] btrfs: optionally extend i_size in cow_file_range_inline() · Nikolay Borisov <hidden> · 2021-08-20
Re: [PATCH v10 06/14] btrfs: optionally extend i_size in cow_file_range_inline() · Qu Wenruo <hidden> · 2021-08-20
Re: [PATCH v10 06/14] btrfs: optionally extend i_size in cow_file_range_inline() · Omar Sandoval <osandov@osandov.com> · 2021-08-20
Re: [PATCH v10 06/14] btrfs: optionally extend i_size in cow_file_range_inline() · Qu Wenruo <hidden> · 2021-08-21
Re: [PATCH v10 06/14] btrfs: optionally extend i_size in cow_file_range_inline() · Omar Sandoval <osandov@osandov.com> · 2021-08-23
Re: [PATCH v10 06/14] btrfs: optionally extend i_size in cow_file_range_inline() · Qu Wenruo <hidden> · 2021-08-23
Re: [PATCH v10 06/14] btrfs: optionally extend i_size in cow_file_range_inline() · Omar Sandoval <osandov@osandov.com> · 2021-08-23
[PATCH v10 07/14] btrfs: add definitions + documentation for encoded I/O ioctls · Omar Sandoval <osandov@osandov.com> · 2021-08-17
Re: [PATCH v10 07/14] btrfs: add definitions + documentation for encoded I/O ioctls · Nikolay Borisov <hidden> · 2021-08-20
Re: [PATCH v10 07/14] btrfs: add definitions + documentation for encoded I/O ioctls · Omar Sandoval <osandov@osandov.com> · 2021-08-20
[PATCH v10 08/14] btrfs: add BTRFS_IOC_ENCODED_READ · Omar Sandoval <osandov@osandov.com> · 2021-08-17
Re: [PATCH v10 08/14] btrfs: add BTRFS_IOC_ENCODED_READ · Nikolay Borisov <hidden> · 2021-08-20
Re: [PATCH v10 08/14] btrfs: add BTRFS_IOC_ENCODED_READ · Omar Sandoval <osandov@osandov.com> · 2021-08-20
[PATCH v10 11/14] btrfs: send: write larger chunks when using stream v2 · Omar Sandoval <osandov@osandov.com> · 2021-08-17
[PATCH v10 09/14] btrfs: add BTRFS_IOC_ENCODED_WRITE · Omar Sandoval <osandov@osandov.com> · 2021-08-17
Re: [PATCH v10 09/14] btrfs: add BTRFS_IOC_ENCODED_WRITE · Nikolay Borisov <hidden> · 2021-08-20
Re: [PATCH v10 09/14] btrfs: add BTRFS_IOC_ENCODED_WRITE · Omar Sandoval <osandov@osandov.com> · 2021-08-20
[PATCH v10 12/14] btrfs: send: allocate send buffer with alloc_page() and vmap() for v2 · Omar Sandoval <osandov@osandov.com> · 2021-08-17
[PATCH v10 10/14] btrfs: add send stream v2 definitions · Omar Sandoval <osandov@osandov.com> · 2021-08-17
[PATCH v10 14/14] btrfs: send: enable support for stream v2 and compressed writes · Omar Sandoval <osandov@osandov.com> · 2021-08-17
[PATCH v10 13/14] btrfs: send: send compressed extents with encoded writes · Omar Sandoval <osandov@osandov.com> · 2021-08-17
[PATCH v10 02/10] btrfs-progs: receive: dynamically allocate sctx->read_buf · Omar Sandoval <osandov@osandov.com> · 2021-08-17
[PATCH v10 01/10] btrfs-progs: receive: support v2 send stream larger tlv_len · Omar Sandoval <osandov@osandov.com> · 2021-08-17
[PATCH v10 03/10] btrfs-progs: receive: support v2 send stream DATA tlv format · Omar Sandoval <osandov@osandov.com> · 2021-08-17
[PATCH v10 04/10] btrfs-progs: receive: add send stream v2 cmds and attrs to send.h · Omar Sandoval <osandov@osandov.com> · 2021-08-17
[PATCH v10 05/10] btrfs-progs: receive: process encoded_write commands · Omar Sandoval <osandov@osandov.com> · 2021-08-17
[PATCH v10 07/10] btrfs-progs: receive: process fallocate commands · Omar Sandoval <osandov@osandov.com> · 2021-08-17
[PATCH v10 08/10] btrfs-progs: receive: process setflags ioctl commands · Omar Sandoval <osandov@osandov.com> · 2021-08-17
[PATCH v10 06/10] btrfs-progs: receive: encoded_write fallback to explicit decode and write · Omar Sandoval <osandov@osandov.com> · 2021-08-17
Re: [PATCH v10 06/10] btrfs-progs: receive: encoded_write fallback to explicit decode and write · Omar Sandoval <osandov@osandov.com> · 2021-08-18
[PATCH v10 09/10] btrfs-progs: send: stream v2 ioctl flags · Omar Sandoval <osandov@osandov.com> · 2021-08-17
[PATCH v10 10/10] btrfs-progs: receive: add tests for basic encoded_write send/receive · Omar Sandoval <osandov@osandov.com> · 2021-08-17

From: Omar Sandoval <osandov@osandov.com>
Date: 2021-08-23 18:16:54
Also in: linux-btrfs, linux-fsdevel

On Sat, Aug 21, 2021 at 09:11:26AM +0800, Qu Wenruo wrote:


On 2021/8/21 上午2:11, Omar Sandoval wrote:

quoted

On Fri, Aug 20, 2021 at 05:13:34PM +0800, Qu Wenruo wrote:

quoted


On 2021/8/20 下午4:51, Nikolay Borisov wrote:

quoted


On 18.08.21 г. 0:06, Omar Sandoval wrote:

quoted

From: Omar Sandoval <redacted>

Currently, an inline extent is always created after i_size is extended
from btrfs_dirty_pages(). However, for encoded writes, we only want to
update i_size after we successfully created the inline extent.

To me, the idea of write first then update isize is just going to cause
tons of inline extent related prblems.

The current example is falloc, which only update the isize after the
falloc finishes.

This behavior has already bothered me quite a lot, as it can easily
create mixed inline and regular extents.

Do you have an example of how this would happen? I have the inode and
extent bits locked during an encoded write, and I see that fallocate
does the same.

xfs_io -f -c "pwrite 0 1K" -c "sync" -c "falloc 0 4k" -c "pwrite 4k 4k"

The [0, 1K) will be written as inline without doubt.

Then we go to falloc, it will try to zero the range [1K, 4K), but it
doesn't increase the isize.
Thus the page [0, 4k) will still be written back as inline, since isize
is still 1K.

Later [4K, 8K) will be written back as regular, causing mixed extents.

I'll have to read fallocate more closely to follow what's going on here
and figure out if it applies to encoded writes. Please help me out if
you see how this would be an issue with encoded writes.

quoted

Can't we remember the old isize (with proper locking), enlarge isize
(with holes filled), do the write.

If something wrong happened, we truncate the isize back to its old isize.

[...]

quoted

Urgh, just some days ago Qu was talking about how awkward it is to have
mixed extents in a file. And now, AFAIU, you are making them more likely
since now they can be created not just at the beginning of the file but
also after i_size write. While this won't be a problem in and of itself
it goes just the opposite way of us trying to shrink the possible cases
when we can have mixed extents.

Tree-checker should reject such inline extent at non-zero offset.

This change does not allow creating inline extents at a non-zero offset.

quoted

Qu what is your take on that?

My question is, why encoded write needs to bother the inline extents at all?

My intuition of such encoded write is, it should not create inline
extents at all.

Or is there any special use-case involved for encoded write?

We create compressed inline extents with normal writes. We should be
able to send and receive them without converting them into regular
extents.

But my first impression for any encoded write is that, they should work
like DIO, thus everything should be sectorsize aligned.

Then why could they create inline extent? As inline extent can only be
possible when the isize is smaller than sectorsize.

ENCODED_WRITE is not defined as "O_DIRECT, but encoded". It happens to
have some resemblance to O_DIRECT because we have alignment requirements
for new extents and because we bypass the page cache, but there's no
reason to copy arbitrary restrictions from O_DIRECT. If someone is using
ENCODED_WRITE to write compressed data, then they care about space
efficiency, so we should make efficient use of inline extents.

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help