Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly

[PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · NeilBrown <hidden> · 2021-07-27
[PATCH 01/11] VFS: show correct dev num in mountinfo · NeilBrown <hidden> · 2021-07-27
Re: [PATCH 01/11] VFS: show correct dev num in mountinfo · Al Viro <viro@zeniv.linux.org.uk> · 2021-07-30
Re: [PATCH 01/11] VFS: show correct dev num in mountinfo · NeilBrown <hidden> · 2021-07-30
Re: [PATCH 01/11] VFS: show correct dev num in mountinfo · Miklos Szeredi <miklos@szeredi.hu> · 2021-07-30
Re: [PATCH 01/11] VFS: show correct dev num in mountinfo · NeilBrown <hidden> · 2021-07-30
Re: [PATCH 01/11] VFS: show correct dev num in mountinfo · Miklos Szeredi <miklos@szeredi.hu> · 2021-07-30
Re: [PATCH 01/11] VFS: show correct dev num in mountinfo · NeilBrown <hidden> · 2021-07-30
Re: [PATCH 01/11] VFS: show correct dev num in mountinfo · Miklos Szeredi <miklos@szeredi.hu> · 2021-07-30
A Third perspective on BTRFS nfsd subvol dev/inode number issues. · NeilBrown <hidden> · 2021-08-02
Re: A Third perspective on BTRFS nfsd subvol dev/inode number issues. · Al Viro <viro@zeniv.linux.org.uk> · 2021-08-02
Re: A Third perspective on BTRFS nfsd subvol dev/inode number issues. · NeilBrown <hidden> · 2021-08-02
Re: A Third perspective on BTRFS nfsd subvol dev/inode number issues. · Amir Goldstein <amir73il@gmail.com> · 2021-08-02
Re: A Third perspective on BTRFS nfsd subvol dev/inode number issues. · Josef Bacik <josef@toxicpanda.com> · 2021-08-02
Re: A Third perspective on BTRFS nfsd subvol dev/inode number issues. · Qu Wenruo <hidden> · 2021-08-03
RE: A Third perspective on BTRFS nfsd subvol dev/inode number issues. · Frank Filz <hidden> · 2021-08-02
Re: A Third perspective on BTRFS nfsd subvol dev/inode number issues. · NeilBrown <hidden> · 2021-08-02
Re: A Third perspective on BTRFS nfsd subvol dev/inode number issues. · Martin Steigerwald <hidden> · 2021-08-02
Re: A Third perspective on BTRFS nfsd subvol dev/inode number issues. · NeilBrown <hidden> · 2021-08-02
Re: A Third perspective on BTRFS nfsd subvol dev/inode number issues. · J. Bruce Fields <hidden> · 2021-08-02
Re: A Third perspective on BTRFS nfsd subvol dev/inode number issues. · Patrick Goetz <hidden> · 2021-08-02
Re: A Third perspective on BTRFS nfsd subvol dev/inode number issues. · J. Bruce Fields <hidden> · 2021-08-02
Re: A Third perspective on BTRFS nfsd subvol dev/inode number issues. · NeilBrown <hidden> · 2021-08-02
Re: A Third perspective on BTRFS nfsd subvol dev/inode number issues. · J. Bruce Fields <hidden> · 2021-08-02
Re: A Third perspective on BTRFS nfsd subvol dev/inode number issues. · NeilBrown <hidden> · 2021-08-02
Re: A Third perspective on BTRFS nfsd subvol dev/inode number issues. · J. Bruce Fields <hidden> · 2021-08-02
Re: A Third perspective on BTRFS nfsd subvol dev/inode number issues. · NeilBrown <hidden> · 2021-08-02
Re: A Third perspective on BTRFS nfsd subvol dev/inode number issues. · J. Bruce Fields <hidden> · 2021-08-03
[PATCH 02/11] VFS: allow d_automount to create in-place bind-mount. · NeilBrown <hidden> · 2021-07-27
[PATCH 03/11] VFS: pass lookup_flags into follow_down() · NeilBrown <hidden> · 2021-07-27
[PATCH 04/11] VFS: export lookup_mnt() · NeilBrown <hidden> · 2021-07-27
Re: [PATCH 04/11] VFS: export lookup_mnt() · Al Viro <viro@zeniv.linux.org.uk> · 2021-07-30
Re: [PATCH 04/11] VFS: export lookup_mnt() · NeilBrown <hidden> · 2021-07-30
[PATCH 05/11] VFS: new function: mount_is_internal() · NeilBrown <hidden> · 2021-07-27
Re: [PATCH 05/11] VFS: new function: mount_is_internal() · Al Viro <viro@zeniv.linux.org.uk> · 2021-07-28
Re: [PATCH 05/11] VFS: new function: mount_is_internal() · NeilBrown <hidden> · 2021-07-28
Re: [PATCH 05/11] VFS: new function: mount_is_internal() · Al Viro <viro@zeniv.linux.org.uk> · 2021-07-30
[PATCH 06/11] nfsd: include a vfsmount in struct svc_fh · NeilBrown <hidden> · 2021-07-27
[PATCH 07/11] exportfs: Allow filehandle lookup to cross internal mount points. · NeilBrown <hidden> · 2021-07-27
Re: [PATCH 07/11] exportfs: Allow filehandle lookup to cross internal mount points. · Amir Goldstein <amir73il@gmail.com> · 2021-07-28
Re: [PATCH 07/11] exportfs: Allow filehandle lookup to cross internal mount points. · NeilBrown <hidden> · 2021-07-29
Re: [PATCH 07/11] exportfs: Allow filehandle lookup to cross internal mount points. · Amir Goldstein <amir73il@gmail.com> · 2021-07-29
Re: [PATCH 07/11] exportfs: Allow filehandle lookup to cross internal mount points. · Miklos Szeredi <miklos@szeredi.hu> · 2021-08-06
Re: [PATCH 07/11] exportfs: Allow filehandle lookup to cross internal mount points. · Amir Goldstein <amir73il@gmail.com> · 2021-08-06
Re: [PATCH 07/11] exportfs: Allow filehandle lookup to cross internal mount points. · Miklos Szeredi <miklos@szeredi.hu> · 2021-08-06
Re: [PATCH 07/11] exportfs: Allow filehandle lookup to cross internal mount points. · J. Bruce Fields <hidden> · 2021-07-28
Re: [PATCH 07/11] exportfs: Allow filehandle lookup to cross internal mount points. · NeilBrown <hidden> · 2021-07-28
[PATCH 08/11] nfsd: change get_parent_attributes() to nfsd_get_mounted_on() · NeilBrown <hidden> · 2021-07-27
[PATCH 09/11] nfsd: Allow filehandle lookup to cross internal mount points. · NeilBrown <hidden> · 2021-07-27
Re: [PATCH 09/11] nfsd: Allow filehandle lookup to cross internal mount points. · J. Bruce Fields <hidden> · 2021-07-28
Re: [PATCH 09/11] nfsd: Allow filehandle lookup to cross internal mount points. · NeilBrown <hidden> · 2021-07-28
Re: [PATCH 09/11] nfsd: Allow filehandle lookup to cross internal mount points. · Al Viro <viro@zeniv.linux.org.uk> · 2021-07-30
Re: [PATCH 09/11] nfsd: Allow filehandle lookup to cross internal mount points. · NeilBrown <hidden> · 2021-07-30
[PATCH 10/11] btrfs: introduce mapping function from location to inum · NeilBrown <hidden> · 2021-07-27
[PATCH 11/11] btrfs: use automount to bind-mount all subvol roots. · NeilBrown <hidden> · 2021-07-27
[RFC PATCH] btrfs: btrfs_mountpoint_expiry_timeout can be static · kernel test robot <hidden> · 2021-07-28
Re: [PATCH 11/11] btrfs: use automount to bind-mount all subvol roots. · kernel test robot <hidden> · 2021-07-28
Re: [PATCH 11/11] btrfs: use automount to bind-mount all subvol roots. · Christian Brauner <hidden> · 2021-07-28
Re: [PATCH 11/11] btrfs: use automount to bind-mount all subvol roots. · NeilBrown <hidden> · 2021-07-29
Re: [PATCH 11/11] btrfs: use automount to bind-mount all subvol roots. · Christian Brauner <hidden> · 2021-07-29
[btrfs] 5874902268: xfstests.btrfs.202.fail · kernel test robot <hidden> · 2021-07-31
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · Al Viro <viro@zeniv.linux.org.uk> · 2021-07-28
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · Wang Yugui <hidden> · 2021-07-28
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · Wang Yugui <hidden> · 2021-07-28
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · NeilBrown <hidden> · 2021-07-28
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · Neal Gompa <hidden> · 2021-07-28
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · J. Bruce Fields <hidden> · 2021-07-28
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · Zygo Blaxell <hidden> · 2021-07-29
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · NeilBrown <hidden> · 2021-07-29
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · Zygo Blaxell <hidden> · 2021-07-29
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · NeilBrown <hidden> · 2021-07-28
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · Zygo Blaxell <hidden> · 2021-07-29
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · NeilBrown <hidden> · 2021-07-29
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · Zygo Blaxell <hidden> · 2021-07-29
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · NeilBrown <hidden> · 2021-07-30
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · Qu Wenruo <hidden> · 2021-07-30
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · Qu Wenruo <hidden> · 2021-07-30
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · Amir Goldstein <amir73il@gmail.com> · 2021-07-30
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · NeilBrown <hidden> · 2021-07-30
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · Qu Wenruo <hidden> · 2021-07-30
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · NeilBrown <hidden> · 2021-07-30
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · Qu Wenruo <hidden> · 2021-07-30
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · NeilBrown <hidden> · 2021-07-30
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · Qu Wenruo <hidden> · 2021-07-30
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · Zygo Blaxell <hidden> · 2021-07-30
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · J. Bruce Fields <hidden> · 2021-07-30
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · Josef Bacik <josef@toxicpanda.com> · 2021-07-30
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · Forza <hidden> · 2021-07-30
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · Zygo Blaxell <hidden> · 2021-07-30
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · Amir Goldstein <amir73il@gmail.com> · 2021-07-30
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · <hidden> · 2021-07-28
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · NeilBrown <hidden> · 2021-07-29
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · Graham Cobb <hidden> · 2021-07-29
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · NeilBrown <hidden> · 2021-07-28
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · Wang Yugui <hidden> · 2021-07-28
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · J. Bruce Fields <hidden> · 2021-07-28
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · Josef Bacik <josef@toxicpanda.com> · 2021-07-28
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · Al Viro <viro@zeniv.linux.org.uk> · 2021-07-30
Re: [PATCH/RFC 00/11] expose btrfs subvols in mount table correctly · NeilBrown <hidden> · 2021-07-30
[PATCH] VFS/BTRFS/NFSD: provide more unique inode number for btrfs export · NeilBrown <hidden> · 2021-08-13
Re: [PATCH] VFS/BTRFS/NFSD: provide more unique inode number for btrfs export · Josef Bacik <josef@toxicpanda.com> · 2021-08-13
Re: [PATCH] VFS/BTRFS/NFSD: provide more unique inode number for btrfs export · Goffredo Baroncelli <hidden> · 2021-08-15
Re: [PATCH] VFS/BTRFS/NFSD: provide more unique inode number for btrfs export · Roman Mamedov <hidden> · 2021-08-15
Re: [PATCH] VFS/BTRFS/NFSD: provide more unique inode number for btrfs export · Goffredo Baroncelli <hidden> · 2021-08-15
Re: [PATCH] VFS/BTRFS/NFSD: provide more unique inode number for btrfs export · NeilBrown <hidden> · 2021-08-15
Re: [PATCH] VFS/BTRFS/NFSD: provide more unique inode number for btrfs export · Goffredo Baroncelli <hidden> · 2021-08-17
Re: [PATCH] VFS/BTRFS/NFSD: provide more unique inode number for btrfs export · NeilBrown <hidden> · 2021-08-17
Re: [PATCH] VFS/BTRFS/NFSD: provide more unique inode number for btrfs export · Goffredo Baroncelli <hidden> · 2021-08-18
Re: [PATCH] VFS/BTRFS/NFSD: provide more unique inode number for btrfs export · NeilBrown <hidden> · 2021-08-15
Re: [PATCH] VFS/BTRFS/NFSD: provide more unique inode number for btrfs export · Amir Goldstein <amir73il@gmail.com> · 2021-08-19
Re: [PATCH] VFS/BTRFS/NFSD: provide more unique inode number for btrfs export · NeilBrown <hidden> · 2021-08-20
Re: [PATCH] VFS/BTRFS/NFSD: provide more unique inode number for btrfs export · Amir Goldstein <amir73il@gmail.com> · 2021-08-20
[PATCH v2] BTRFS/NFSD: provide more unique inode number for btrfs export · NeilBrown <hidden> · 2021-08-23
Re: [PATCH] VFS/BTRFS/NFSD: provide more unique inode number for btrfs export · Wang Yugui <hidden> · 2021-08-18
Re: [PATCH] VFS/BTRFS/NFSD: provide more unique inode number for btrfs export · NeilBrown <hidden> · 2021-08-18
Re: [PATCH] VFS/BTRFS/NFSD: provide more unique inode number for btrfs export · Zygo Blaxell <hidden> · 2021-08-19
Re: [PATCH] VFS/BTRFS/NFSD: provide more unique inode number for btrfs export · NeilBrown <hidden> · 2021-08-20
Re: [PATCH] VFS/BTRFS/NFSD: provide more unique inode number for btrfs export · Zygo Blaxell <hidden> · 2021-08-22
Re: [PATCH] VFS/BTRFS/NFSD: provide more unique inode number for btrfs export · NeilBrown <hidden> · 2021-08-23
Re: [PATCH] VFS/BTRFS/NFSD: provide more unique inode number for btrfs export · NeilBrown <hidden> · 2021-08-23
Re: [PATCH] VFS/BTRFS/NFSD: provide more unique inode number for btrfs export · Zygo Blaxell <hidden> · 2021-08-25
Re: [PATCH] VFS/BTRFS/NFSD: provide more unique inode number for btrfs export · Wang Yugui <hidden> · 2021-08-23

From: NeilBrown <hidden>
Date: 2021-07-30 05:58:18
Also in: linux-fsdevel, linux-nfs

On Fri, 30 Jul 2021, Qu Wenruo wrote:

On 2021/7/30 上午10:36, NeilBrown wrote:

quoted

I've been pondering all the excellent feedback, and what I have learnt
from examining the code in btrfs, and I have developed a different
perspective.

Great! Some new developers into the btrfs realm!

:-)

quoted

Maybe "subvol" is a poor choice of name because it conjures up
connections with the Volumes in LVM, and btrfs subvols are very different
things.  Btrfs subvols are really just subtrees that can be treated as a
unit for operations like "clone" or "destroy".

As such, they don't really deserve separate st_dev numbers.

Maybe the different st_dev numbers were introduced as a "cheap" way to
extend to size of the inode-number space.  Like many "cheap" things, it
has hidden costs.

Maybe objects in different subvols should still be given different inode
numbers.  This would be problematic on 32bit systems, but much less so on
64bit systems.

The patch below, which is just a proof-of-concept, changes btrfs to
report a uniform st_dev, and different (64bit) st_ino in different subvols.

It has problems:
  - it will break any 32bit readdir and 32bit stat.  I don't know how big
    a problem that is these days (ino_t in the kernel is "unsigned long",
    not "unsigned long long). That surprised me).
  - It might break some user-space expectations.  One thing I have learnt
    is not to make any assumption about what other people might expect.

Wouldn't any filesystem boundary check fail to stop at subvolume boundary?

You mean like "du -x"?? Yes.  You would lose the misleading illusion
that there are multiple filesystems.  That is one user-expectation that
would need to be addressed before people opt-in

Then it will go through the full btrfs subvolumes/snapshots, which can
be super slow.

quoted

However, it would be quite easy to make this opt-in (or opt-out) with a
mount option, so that people who need the current inode numbers and will
accept the current breakage can keep working.

I think this approach would be a net-win for NFS export, whether BTRFS
supports it directly or not.  I might post a patch which modifies NFS to
intuit improved inode numbers for btrfs exports....

Some extra ideas, but not familiar with VFS enough to be sure.

Can we generate "fake" superblock for each subvolume?

I don't see how that would help.  Either subvols are like filesystems
and appear in /proc/mounts, or they aren't like filesystems and don't
get different st_dev.  Either of these outcomes can be achieved without
fake superblocks.  If you really need BTRFS subvols to have some
properties of filesystems but not all, then you are in for a whole world
of pain.

Maybe btrfs subvols should be treated more like XFS "managed trees".  At
least there you have precedent and someone else to share the pain.
Maybe we should train people to use "quota" to check the usage of a
subvol, rather than "du" (which will stop working with my patch if it
contains refs to other subvols) or "df" (which already doesn't work), or
"btrs df"

Like using the subolume UUID to replace the FSID of each subvolume.
Could that migrate the problem?

Which problem, exactly?  My first approach to making subvols work on NFS
took essentially that approach.  It was seen (quite reasonably) as a
hack to work around poor behaviour in btrfs.

Given that NFS has always seen all of a btrfs filesystem as have a
uniform fsid, I'm now of the opinion that we don't want to change that,
but should just fix the duplicate-inode-number problem.

If I could think of some way for NFSD to see different inode numbers
than VFS, I would push hard for fixs nfsd by giving it more sane inode
numbers.

Thanks,
NeilBrown

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help