Re: [PATCH v3 04/27] mm/userfaultfd: Introduce special pte for unmapped... | linux-mm

[PATCH v3 00/27] userfaultfd-wp: Support shmem and hugetlbfs · Peter Xu <peterx@redhat.com> · 2021-05-27
[PATCH v3 01/27] mm/shmem: Unconditionally set pte dirty in mfill_atomic_install_pte · Peter Xu <peterx@redhat.com> · 2021-05-27
[PATCH v3 02/27] shmem/userfaultfd: Take care of UFFDIO_COPY_MODE_WP · Peter Xu <peterx@redhat.com> · 2021-05-27
[PATCH v3 03/27] mm: Clear vmf->pte after pte_unmap_same() returns · Peter Xu <peterx@redhat.com> · 2021-05-27
[PATCH v3 04/27] mm/userfaultfd: Introduce special pte for unmapped file-backed mem · Peter Xu <peterx@redhat.com> · 2021-05-27
Re: [PATCH v3 04/27] mm/userfaultfd: Introduce special pte for unmapped file-backed mem · Alistair Popple <apopple@nvidia.com> · 2021-05-28
Re: [PATCH v3 04/27] mm/userfaultfd: Introduce special pte for unmapped file-backed mem · Peter Xu <peterx@redhat.com> · 2021-05-28
Re: [PATCH v3 04/27] mm/userfaultfd: Introduce special pte for unmapped file-backed mem · Alistair Popple <apopple@nvidia.com> · 2021-06-03
Re: [PATCH v3 04/27] mm/userfaultfd: Introduce special pte for unmapped file-backed mem · Peter Xu <peterx@redhat.com> · 2021-06-03
Re: [PATCH v3 04/27] mm/userfaultfd: Introduce special pte for unmapped file-backed mem · Alistair Popple <apopple@nvidia.com> · 2021-06-04
Re: [PATCH v3 04/27] mm/userfaultfd: Introduce special pte for unmapped file-backed mem · Hugh Dickins <hughd@google.com> · 2021-06-04
Re: [PATCH v3 04/27] mm/userfaultfd: Introduce special pte for unmapped file-backed mem · Alistair Popple <apopple@nvidia.com> · 2021-06-04
Re: [PATCH v3 04/27] mm/userfaultfd: Introduce special pte for unmapped file-backed mem · Peter Xu <peterx@redhat.com> · 2021-06-04
Re: [PATCH v3 04/27] mm/userfaultfd: Introduce special pte for unmapped file-backed mem · Alistair Popple <apopple@nvidia.com> · 2021-06-08
Re: [PATCH v3 04/27] mm/userfaultfd: Introduce special pte for unmapped file-backed mem · Alistair Popple <apopple@nvidia.com> · 2021-06-09
Re: [PATCH v3 04/27] mm/userfaultfd: Introduce special pte for unmapped file-backed mem · Peter Xu <peterx@redhat.com> · 2021-06-09
[PATCH v3 06/27] shmem/userfaultfd: Handle uffd-wp special pte in page fault handler · Peter Xu <peterx@redhat.com> · 2021-05-27
Re: [PATCH v3 06/27] shmem/userfaultfd: Handle uffd-wp special pte in page fault handler · Alistair Popple <apopple@nvidia.com> · 2021-06-17
Re: [PATCH v3 06/27] shmem/userfaultfd: Handle uffd-wp special pte in page fault handler · Peter Xu <peterx@redhat.com> · 2021-06-17
[PATCH v3 05/27] mm/swap: Introduce the idea of special swap ptes · Peter Xu <peterx@redhat.com> · 2021-05-27
[PATCH v3 07/27] mm: Drop first_index/last_index in zap_details · Peter Xu <peterx@redhat.com> · 2021-05-27
Re: [PATCH v3 07/27] mm: Drop first_index/last_index in zap_details · Alistair Popple <apopple@nvidia.com> · 2021-06-21
[PATCH v3 08/27] mm: Introduce zap_details.zap_flags · Peter Xu <peterx@redhat.com> · 2021-05-27
Re: [PATCH v3 08/27] mm: Introduce zap_details.zap_flags · Alistair Popple <apopple@nvidia.com> · 2021-06-21
Re: [PATCH v3 08/27] mm: Introduce zap_details.zap_flags · Peter Xu <peterx@redhat.com> · 2021-06-21
Re: [PATCH v3 08/27] mm: Introduce zap_details.zap_flags · Alistair Popple <apopple@nvidia.com> · 2021-06-22
[PATCH v3 09/27] mm: Introduce ZAP_FLAG_SKIP_SWAP · Peter Xu <peterx@redhat.com> · 2021-05-27
Re: [PATCH v3 09/27] mm: Introduce ZAP_FLAG_SKIP_SWAP · Alistair Popple <apopple@nvidia.com> · 2021-06-21
Re: [PATCH v3 09/27] mm: Introduce ZAP_FLAG_SKIP_SWAP · Peter Xu <peterx@redhat.com> · 2021-06-21
Re: [PATCH v3 09/27] mm: Introduce ZAP_FLAG_SKIP_SWAP · Alistair Popple <apopple@nvidia.com> · 2021-06-22
[PATCH v3 10/27] mm: Pass zap_flags into unmap_mapping_pages() · Peter Xu <peterx@redhat.com> · 2021-05-27
[PATCH v3 11/27] shmem/userfaultfd: Persist uffd-wp bit across zapping for file-backed · Peter Xu <peterx@redhat.com> · 2021-05-27
Re: [PATCH v3 11/27] shmem/userfaultfd: Persist uffd-wp bit across zapping for file-backed · Alistair Popple <apopple@nvidia.com> · 2021-06-21
Re: [PATCH v3 11/27] shmem/userfaultfd: Persist uffd-wp bit across zapping for file-backed · Peter Xu <peterx@redhat.com> · 2021-06-22
Re: [PATCH v3 11/27] shmem/userfaultfd: Persist uffd-wp bit across zapping for file-backed · Alistair Popple <apopple@nvidia.com> · 2021-06-22
Re: [PATCH v3 11/27] shmem/userfaultfd: Persist uffd-wp bit across zapping for file-backed · Peter Xu <peterx@redhat.com> · 2021-06-22
Re: [PATCH v3 11/27] shmem/userfaultfd: Persist uffd-wp bit across zapping for file-backed · Alistair Popple <apopple@nvidia.com> · 2021-06-23
Re: [PATCH v3 11/27] shmem/userfaultfd: Persist uffd-wp bit across zapping for file-backed · Peter Xu <peterx@redhat.com> · 2021-06-23
Re: [PATCH v3 11/27] shmem/userfaultfd: Persist uffd-wp bit across zapping for file-backed · Alistair Popple <apopple@nvidia.com> · 2021-07-06
Re: [PATCH v3 11/27] shmem/userfaultfd: Persist uffd-wp bit across zapping for file-backed · Peter Xu <peterx@redhat.com> · 2021-07-06
Re: [PATCH v3 11/27] shmem/userfaultfd: Persist uffd-wp bit across zapping for file-backed · Alistair Popple <apopple@nvidia.com> · 2021-07-08
[PATCH v3 12/27] shmem/userfaultfd: Allow wr-protect none pte for file-backed mem · Peter Xu <peterx@redhat.com> · 2021-05-27
[PATCH v3 13/27] shmem/userfaultfd: Allows file-back mem to be uffd wr-protected on thps · Peter Xu <peterx@redhat.com> · 2021-05-27
[PATCH v3 14/27] shmem/userfaultfd: Handle the left-overed special swap ptes · Peter Xu <peterx@redhat.com> · 2021-05-27
[PATCH v3 15/27] shmem/userfaultfd: Pass over uffd-wp special swap pte when fork() · Peter Xu <peterx@redhat.com> · 2021-05-27
[PATCH v3 16/27] mm/hugetlb: Drop __unmap_hugepage_range definition from hugetlb.h · Peter Xu <peterx@redhat.com> · 2021-05-27
[PATCH v3 17/27] mm/hugetlb: Introduce huge pte version of uffd-wp helpers · Peter Xu <peterx@redhat.com> · 2021-05-27
[PATCH v3 18/27] hugetlb/userfaultfd: Hook page faults for uffd write protection · Peter Xu <peterx@redhat.com> · 2021-05-27
[PATCH v3 19/27] hugetlb/userfaultfd: Take care of UFFDIO_COPY_MODE_WP · Peter Xu <peterx@redhat.com> · 2021-05-27
[PATCH v3 20/27] hugetlb/userfaultfd: Handle UFFDIO_WRITEPROTECT · Peter Xu <peterx@redhat.com> · 2021-05-27
[PATCH v3 21/27] mm/hugetlb: Introduce huge version of special swap pte helpers · Peter Xu <peterx@redhat.com> · 2021-05-27
[PATCH v3 22/27] hugetlb/userfaultfd: Handle uffd-wp special pte in hugetlb pf handler · Peter Xu <peterx@redhat.com> · 2021-05-27
[PATCH v3 23/27] hugetlb/userfaultfd: Allow wr-protect none ptes · Peter Xu <peterx@redhat.com> · 2021-05-27
[PATCH v3 24/27] hugetlb/userfaultfd: Only drop uffd-wp special pte if required · Peter Xu <peterx@redhat.com> · 2021-05-27
[PATCH v3 25/27] mm/pagemap: Recognize uffd-wp bit for shmem/hugetlbfs · Peter Xu <peterx@redhat.com> · 2021-05-27
[PATCH v3 26/27] mm/userfaultfd: Enable write protection for shmem & hugetlbfs · Peter Xu <peterx@redhat.com> · 2021-05-27
[PATCH v3 27/27] userfaultfd/selftests: Enable uffd-wp for shmem/hugetlbfs · Peter Xu <peterx@redhat.com> · 2021-05-27
Re: [PATCH v3 00/27] userfaultfd-wp: Support shmem and hugetlbfs · Peter Xu <peterx@redhat.com> · 2021-06-02
Re: [PATCH v3 00/27] userfaultfd-wp: Support shmem and hugetlbfs · Andrew Morton <akpm@linux-foundation.org> · 2021-06-02
Re: [PATCH v3 00/27] userfaultfd-wp: Support shmem and hugetlbfs · Peter Xu <peterx@redhat.com> · 2021-06-03

Re: [PATCH v3 04/27] mm/userfaultfd: Introduce special pte for unmapped file-backed mem

From: Alistair Popple <apopple@nvidia.com>
Date: 2021-06-04 00:55:35
Also in: lkml

On Friday, 4 June 2021 12:51:19 AM AEST Peter Xu wrote:

External email: Use caution opening links or attachments

On Thu, Jun 03, 2021 at 09:53:45PM +1000, Alistair Popple wrote:

quoted

On Friday, 28 May 2021 10:56:02 PM AEST Peter Xu wrote:

quoted

On Fri, May 28, 2021 at 06:32:52PM +1000, Alistair Popple wrote:

quoted

On Friday, 28 May 2021 6:19:04 AM AEST Peter Xu wrote:

quoted

This patch introduces a very special swap-like pte for file-backed
memories.

Currently it's only defined for x86_64 only, but as long as any arch
that
can properly define the UFFD_WP_SWP_PTE_SPECIAL value as requested,
it
should conceptually work too.

We will use this special pte to arm the ptes that got either
unmapped or
swapped out for a file-backed region that was previously
wr-protected.
This special pte could trigger a page fault just like swap entries,
and
as long as the page fault will satisfy pte_none()==false &&
pte_present()==false.

Then we can revive the special pte into a normal pte backed by the
page
cache.

This idea is greatly inspired by Hugh and Andrea in the discussion,
which is referenced in the links below.

The other idea (from Hugh) is that we use swp_type==1 and
swp_offset=0
as
the special pte.  The current solution (as pointed out by Andrea) is
slightly preferred in that we don't even need swp_entry_t knowledge
at
all
in trapping these accesses.  Meanwhile, we also reuse
_PAGE_SWP_UFFD_WP
from the anonymous swp entries.

So to confirm my understanding the reason you use this special swap
pte
instead of a new swp_type is that you only need the fault and have no
extra
information that needs storing in the pte?

Yes.

quoted

Personally I think it might be better to define a new swp_type for
this
rather than introducing a new arch-specific concept.

The concept should not be arch-specific, it's the pte that's
arch-specific.

Right, agree this is a minor detail.

I can't say it's a minor detail, as that's still indeed one of the major
ideas that I'd like to get comment for within the whole series.  It's
currently an outcome from previous discussion with Andrea and Hugh, but of
course if there's better idea with reasoning I can always consider to
rework the series.

Sorry, I wasn't very clear there. What I meant is the high level arch-
independent concept of using a special swap pte for this is the most important 
aspect of the design and looks good to me.

The detail which is perhaps less important is whether to implement this using 
a new swap entry type or arch-specific swap bit. The argument for using a swap 
type is it will work across architectures due to the use of pte_to_swp_entry() 
and swp_entry_to_pte() to convert to and from the arch-dependent and 
independent representations.

The argument against seems to have been that it is wasting a swap type. 
However if I'm understanding correctly that's not true for all architectures, 
and needing to reserve a bit is more wasteful than using a swap type. For 
example ARM encodes swap entries like so:

 * Encode and decode a swap entry.  Swap entries are stored in the Linux
 * page tables as follows:
 *
 *   3 3 2 2 2 2 2 2 2 2 2 2 1 1 1 1 1 1 1 1 1 1
 *   1 0 9 8 7 6 5 4 3 2 1 0 9 8 7 6 5 4 3 2 1 0 9 8 7 6 5 4 3 2 1 0
 *   <--------------- offset ------------------------> < type -> 0 0

So the only way to get a spare bit is to reduce the width of type (or offset) 
which would halve the number of swap types. And if I understand correctly the 
same argument might apply to x86 - the spare bit being used here could instead 
be used to expand the width of type if a lack of available swap types is a 
concern.

quoted

swp_type entries are portable so wouldn't need extra arch-specific
bits
defined. And as I understand things not all architectures (eg. ARM)
have
spare bits in their swap entry encoding anyway so would have to
reserve a
bit specifically for this which would be less efficient than using a
swp_type.

It looks a trade-off to me: I think it's fine to use swap type in my
series, as you said it's portable, but it will also waste the swap
address space for the arch when the arch enables it.

The format of the special pte to trigger the fault in this series should
be
only a small portion of the code change.  The main logic should still be
the same - we just replace this pte with that one.  IMHO it also means
the format can be changed in the future, it's just that I don't know
whether it's wise to take over a new swap type from start.

quoted

Anyway it seems I missed the initial discussion so don't have a strong
opinion here, mainly just wanted to check my understanding of what's
required and how these special entries work.

Thanks for mentioning this and join the discussion. I don't know ARM
enough
so good to know we may have issue on finding the bits.  Actually before
finding this bit for file-backed uffd-wp specifically, we need to
firstly
find a bit in the normal pte for ARM too anyways (see _PAGE_UFFD_WP). 
If
there's no strong reason to switch to a new swap type, I'd tend to leave
all these to the future when we enable them on ARM.

Yeah, makes sense to me. As you say it should be easy to change and other
architectures need to find another bit anyway. Not sure how useful it will
be but I'll try and take a look over the rest of the series as well.

I'll highly appreciate that.  Thanks Alistair!

--
Peter Xu

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help