Thread (60 messages) 60 messages, 4 authors, 2021-07-08

Re: [PATCH v3 04/27] mm/userfaultfd: Introduce special pte for unmapped file-backed mem

From: Peter Xu <peterx@redhat.com>
Date: 2021-05-28 12:56:13
Also in: lkml

On Fri, May 28, 2021 at 06:32:52PM +1000, Alistair Popple wrote:
On Friday, 28 May 2021 6:19:04 AM AEST Peter Xu wrote:
quoted
This patch introduces a very special swap-like pte for file-backed memories.

Currently it's only defined for x86_64 only, but as long as any arch that
can properly define the UFFD_WP_SWP_PTE_SPECIAL value as requested, it
should conceptually work too.

We will use this special pte to arm the ptes that got either unmapped or
swapped out for a file-backed region that was previously wr-protected.  This
special pte could trigger a page fault just like swap entries, and as long
as the page fault will satisfy pte_none()==false && pte_present()==false.

Then we can revive the special pte into a normal pte backed by the page
cache.

This idea is greatly inspired by Hugh and Andrea in the discussion, which is
referenced in the links below.

The other idea (from Hugh) is that we use swp_type==1 and swp_offset=0 as
the special pte.  The current solution (as pointed out by Andrea) is
slightly preferred in that we don't even need swp_entry_t knowledge at all
in trapping these accesses.  Meanwhile, we also reuse _PAGE_SWP_UFFD_WP
from the anonymous swp entries.
So to confirm my understanding the reason you use this special swap pte 
instead of a new swp_type is that you only need the fault and have no extra 
information that needs storing in the pte?
Yes.
Personally I think it might be better to define a new swp_type for this rather 
than introducing a new arch-specific concept.
The concept should not be arch-specific, it's the pte that's arch-specific.
swp_type entries are portable so wouldn't need extra arch-specific bits
defined. And as I understand things not all architectures (eg. ARM) have
spare bits in their swap entry encoding anyway so would have to reserve a bit
specifically for this which would be less efficient than using a swp_type.
It looks a trade-off to me: I think it's fine to use swap type in my series, as
you said it's portable, but it will also waste the swap address space for the
arch when the arch enables it.

The format of the special pte to trigger the fault in this series should be
only a small portion of the code change.  The main logic should still be the
same - we just replace this pte with that one.  IMHO it also means the format
can be changed in the future, it's just that I don't know whether it's wise to
take over a new swap type from start.
Anyway it seems I missed the initial discussion so don't have a strong opinion 
here, mainly just wanted to check my understanding of what's required and how 
these special entries work.
Thanks for mentioning this and join the discussion. I don't know ARM enough so
good to know we may have issue on finding the bits.  Actually before finding
this bit for file-backed uffd-wp specifically, we need to firstly find a bit in
the normal pte for ARM too anyways (see _PAGE_UFFD_WP).  If there's no strong
reason to switch to a new swap type, I'd tend to leave all these to the future
when we enable them on ARM.

-- 
Peter Xu

Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help