Re: [RFC PATCH RESEND 00/28] per-VMA locks proposal

[RFC PATCH RESEND 00/28] per-VMA locks proposal · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
[RFC PATCH RESEND 02/28] mm: rcu safe VMA freeing · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
[RFC PATCH RESEND 01/28] mm: introduce CONFIG_PER_VMA_LOCK · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
[RFC PATCH RESEND 03/28] mm: introduce __find_vma to be used without mmap_lock protection · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
Re: [RFC PATCH RESEND 03/28] mm: introduce __find_vma to be used without mmap_lock protection · Kent Overstreet <kent.overstreet@linux.dev> · 2022-09-01
Re: [RFC PATCH RESEND 03/28] mm: introduce __find_vma to be used without mmap_lock protection · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
[RFC PATCH RESEND 04/28] mm: move mmap_lock assert function definitions · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
Re: [RFC PATCH RESEND 04/28] mm: move mmap_lock assert function definitions · Kent Overstreet <kent.overstreet@linux.dev> · 2022-09-01
Re: [RFC PATCH RESEND 04/28] mm: move mmap_lock assert function definitions · Liam Howlett <hidden> · 2022-09-01
Re: [RFC PATCH RESEND 04/28] mm: move mmap_lock assert function definitions · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
Re: [RFC PATCH RESEND 04/28] mm: move mmap_lock assert function definitions · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2022-09-02
Re: [RFC PATCH RESEND 04/28] mm: move mmap_lock assert function definitions · Suren Baghdasaryan <surenb@google.com> · 2022-09-02
[RFC PATCH RESEND 05/28] mm: add per-VMA lock and helper functions to control it · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
Re: [RFC PATCH RESEND 05/28] mm: add per-VMA lock and helper functions to control it · Laurent Dufour <hidden> · 2022-09-06
Re: [RFC PATCH RESEND 05/28] mm: add per-VMA lock and helper functions to control it · Suren Baghdasaryan <surenb@google.com> · 2022-09-06
[RFC PATCH RESEND 06/28] mm: mark VMA as locked whenever vma->vm_flags are modified · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
Re: [RFC PATCH RESEND 06/28] mm: mark VMA as locked whenever vma->vm_flags are modified · Laurent Dufour <hidden> · 2022-09-06
Re: [RFC PATCH RESEND 06/28] mm: mark VMA as locked whenever vma->vm_flags are modified · Suren Baghdasaryan <surenb@google.com> · 2022-09-06
Re: [RFC PATCH RESEND 06/28] mm: mark VMA as locked whenever vma->vm_flags are modified · Liam Howlett <hidden> · 2022-09-06
Re: [RFC PATCH RESEND 06/28] mm: mark VMA as locked whenever vma->vm_flags are modified · Suren Baghdasaryan <surenb@google.com> · 2022-09-06
[RFC PATCH RESEND 07/28] kernel/fork: mark VMAs as locked before copying pages during fork · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
Re: [RFC PATCH RESEND 07/28] kernel/fork: mark VMAs as locked before copying pages during fork · Laurent Dufour <hidden> · 2022-09-06
Re: [RFC PATCH RESEND 07/28] kernel/fork: mark VMAs as locked before copying pages during fork · Suren Baghdasaryan <surenb@google.com> · 2022-09-08
Re: [RFC PATCH RESEND 07/28] kernel/fork: mark VMAs as locked before copying pages during fork · Laurent Dufour <hidden> · 2022-09-09
Re: [RFC PATCH RESEND 07/28] kernel/fork: mark VMAs as locked before copying pages during fork · Suren Baghdasaryan <surenb@google.com> · 2022-09-09
[RFC PATCH RESEND 08/28] mm/khugepaged: mark VMA as locked while collapsing a hugepage · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
Re: [RFC PATCH RESEND 08/28] mm/khugepaged: mark VMA as locked while collapsing a hugepage · Laurent Dufour <hidden> · 2022-09-06
Re: [RFC PATCH RESEND 08/28] mm/khugepaged: mark VMA as locked while collapsing a hugepage · Suren Baghdasaryan <surenb@google.com> · 2022-09-09
[RFC PATCH RESEND 09/28] mm/mempolicy: mark VMA as locked when changing protection policy · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
Re: [RFC PATCH RESEND 09/28] mm/mempolicy: mark VMA as locked when changing protection policy · Laurent Dufour <hidden> · 2022-09-06
Re: [RFC PATCH RESEND 09/28] mm/mempolicy: mark VMA as locked when changing protection policy · Suren Baghdasaryan <surenb@google.com> · 2022-09-09
[RFC PATCH RESEND 10/28] mm/mmap: mark VMAs as locked in vma_adjust · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
Re: [RFC PATCH RESEND 10/28] mm/mmap: mark VMAs as locked in vma_adjust · Laurent Dufour <hidden> · 2022-09-06
Re: [RFC PATCH RESEND 10/28] mm/mmap: mark VMAs as locked in vma_adjust · Suren Baghdasaryan <surenb@google.com> · 2022-09-09
Re: [RFC PATCH RESEND 10/28] mm/mmap: mark VMAs as locked in vma_adjust · Laurent Dufour <hidden> · 2022-09-09
[RFC PATCH RESEND 11/28] mm/mmap: mark VMAs as locked before merging or splitting them · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
Re: [RFC PATCH RESEND 11/28] mm/mmap: mark VMAs as locked before merging or splitting them · Laurent Dufour <hidden> · 2022-09-06
[RFC PATCH RESEND 12/28] mm/mremap: mark VMA as locked while remapping it to a new address range · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
Re: [RFC PATCH RESEND 12/28] mm/mremap: mark VMA as locked while remapping it to a new address range · Laurent Dufour <hidden> · 2022-09-06
[RFC PATCH RESEND 13/28] mm: conditionally mark VMA as locked in free_pgtables and unmap_page_range · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
Re: [RFC PATCH RESEND 13/28] mm: conditionally mark VMA as locked in free_pgtables and unmap_page_range · Laurent Dufour <hidden> · 2022-09-09
Re: [RFC PATCH RESEND 13/28] mm: conditionally mark VMA as locked in free_pgtables and unmap_page_range · Suren Baghdasaryan <surenb@google.com> · 2022-09-09
[RFC PATCH RESEND 14/28] mm: mark VMAs as locked before isolating them · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
Re: [RFC PATCH RESEND 14/28] mm: mark VMAs as locked before isolating them · Laurent Dufour <hidden> · 2022-09-09
Re: [RFC PATCH RESEND 14/28] mm: mark VMAs as locked before isolating them · Suren Baghdasaryan <surenb@google.com> · 2022-09-09
[RFC PATCH RESEND 15/28] mm/mmap: mark adjacent VMAs as locked if they can grow into unmapped area · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
Re: [RFC PATCH RESEND 15/28] mm/mmap: mark adjacent VMAs as locked if they can grow into unmapped area · Laurent Dufour <hidden> · 2022-09-09
Re: [RFC PATCH RESEND 15/28] mm/mmap: mark adjacent VMAs as locked if they can grow into unmapped area · Suren Baghdasaryan <surenb@google.com> · 2022-09-09
[RFC PATCH RESEND 16/28] kernel/fork: assert no VMA readers during its destruction · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
Re: [RFC PATCH RESEND 16/28] kernel/fork: assert no VMA readers during its destruction · Laurent Dufour <hidden> · 2022-09-09
Re: [RFC PATCH RESEND 16/28] kernel/fork: assert no VMA readers during its destruction · Suren Baghdasaryan <surenb@google.com> · 2022-09-09
[RFC PATCH RESEND 17/28] mm/mmap: prevent pagefault handler from racing with mmu_notifier registration · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
Re: [RFC PATCH RESEND 17/28] mm/mmap: prevent pagefault handler from racing with mmu_notifier registration · Laurent Dufour <hidden> · 2022-09-09
Re: [RFC PATCH RESEND 17/28] mm/mmap: prevent pagefault handler from racing with mmu_notifier registration · Suren Baghdasaryan <surenb@google.com> · 2022-09-09
[RFC PATCH RESEND 18/28] mm: add FAULT_FLAG_VMA_LOCK flag · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
Re: [RFC PATCH RESEND 18/28] mm: add FAULT_FLAG_VMA_LOCK flag · Laurent Dufour <hidden> · 2022-09-09
[RFC PATCH RESEND 19/28] mm: disallow do_swap_page to handle page faults under VMA lock · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
Re: [RFC PATCH RESEND 19/28] mm: disallow do_swap_page to handle page faults under VMA lock · Peter Xu <peterx@redhat.com> · 2022-09-06
Re: [RFC PATCH RESEND 19/28] mm: disallow do_swap_page to handle page faults under VMA lock · Suren Baghdasaryan <surenb@google.com> · 2022-09-06
Re: [RFC PATCH RESEND 19/28] mm: disallow do_swap_page to handle page faults under VMA lock · Peter Xu <peterx@redhat.com> · 2022-09-06
Re: [RFC PATCH RESEND 19/28] mm: disallow do_swap_page to handle page faults under VMA lock · Suren Baghdasaryan <surenb@google.com> · 2022-09-07
Re: [RFC PATCH RESEND 19/28] mm: disallow do_swap_page to handle page faults under VMA lock · Laurent Dufour <hidden> · 2022-09-09
[RFC PATCH RESEND 20/28] mm: introduce per-VMA lock statistics · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
Re: [RFC PATCH RESEND 20/28] mm: introduce per-VMA lock statistics · Laurent Dufour <hidden> · 2022-09-09
Re: [RFC PATCH RESEND 20/28] mm: introduce per-VMA lock statistics · Suren Baghdasaryan <surenb@google.com> · 2022-09-09
[RFC PATCH RESEND 21/28] mm: introduce find_and_lock_anon_vma to be used from arch-specific code · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
Re: [RFC PATCH RESEND 21/28] mm: introduce find_and_lock_anon_vma to be used from arch-specific code · Laurent Dufour <hidden> · 2022-09-09
Re: [RFC PATCH RESEND 21/28] mm: introduce find_and_lock_anon_vma to be used from arch-specific code · Suren Baghdasaryan <surenb@google.com> · 2022-09-09
[RFC PATCH RESEND 22/28] x86/mm: try VMA lock-based page fault handling first · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
[RFC PATCH RESEND 23/28] x86/mm: define ARCH_SUPPORTS_PER_VMA_LOCK · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
Re: [RFC PATCH RESEND 23/28] x86/mm: define ARCH_SUPPORTS_PER_VMA_LOCK · Kent Overstreet <kent.overstreet@linux.dev> · 2022-09-01
Re: [RFC PATCH RESEND 23/28] x86/mm: define ARCH_SUPPORTS_PER_VMA_LOCK · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
[RFC PATCH RESEND 24/28] arm64/mm: try VMA lock-based page fault handling first · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
[RFC PATCH RESEND 25/28] arm64/mm: define ARCH_SUPPORTS_PER_VMA_LOCK · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
[RFC PATCH RESEND 26/28] powerc/mm: try VMA lock-based page fault handling first · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
[RFC PATCH RESEND 28/28] kernel/fork: throttle call_rcu() calls in vm_area_free · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
Re: [RFC PATCH RESEND 28/28] kernel/fork: throttle call_rcu() calls in vm_area_free · Laurent Dufour <hidden> · 2022-09-09
Re: [RFC PATCH RESEND 28/28] kernel/fork: throttle call_rcu() calls in vm_area_free · Suren Baghdasaryan <surenb@google.com> · 2022-09-09
Re: [RFC PATCH RESEND 28/28] kernel/fork: throttle call_rcu() calls in vm_area_free · Laurent Dufour <hidden> · 2022-09-09
[RFC PATCH RESEND 27/28] powerpc/mm: define ARCH_SUPPORTS_PER_VMA_LOCK · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
Re: [RFC PATCH RESEND 00/28] per-VMA locks proposal · Kent Overstreet <kent.overstreet@linux.dev> · 2022-09-01
Re: [RFC PATCH RESEND 00/28] per-VMA locks proposal · Suren Baghdasaryan <surenb@google.com> · 2022-09-01
Re: [RFC PATCH RESEND 00/28] per-VMA locks proposal · Vlastimil Babka <hidden> · 2022-09-11
Re: [RFC PATCH RESEND 00/28] per-VMA locks proposal · Suren Baghdasaryan <surenb@google.com> · 2022-09-28
Re: [RFC PATCH RESEND 00/28] per-VMA locks proposal · Vlastimil Babka <hidden> · 2022-09-29
Re: [RFC PATCH RESEND 00/28] per-VMA locks proposal · Peter Zijlstra <peterz@infradead.org> · 2022-09-02
Re: [RFC PATCH RESEND 00/28] per-VMA locks proposal · Suren Baghdasaryan <surenb@google.com> · 2022-09-02
Re: [RFC PATCH RESEND 00/28] per-VMA locks proposal · Michal Hocko <mhocko@suse.com> · 2022-09-05
Re: [RFC PATCH RESEND 00/28] per-VMA locks proposal · Suren Baghdasaryan <surenb@google.com> · 2022-09-05
Re: [RFC PATCH RESEND 00/28] per-VMA locks proposal · Kent Overstreet <kent.overstreet@linux.dev> · 2022-09-05
Re: [RFC PATCH RESEND 00/28] per-VMA locks proposal · Suren Baghdasaryan <surenb@google.com> · 2022-09-06

From: Kent Overstreet <kent.overstreet@linux.dev>
Date: 2022-09-01 20:59:28
Also in: linux-arm-kernel, linux-mm, lkml

On Thu, Sep 01, 2022 at 10:34:48AM -0700, Suren Baghdasaryan wrote:

Resending to fix the issue with the In-Reply-To tag in the original
submission at [4].

This is a proof of concept for per-vma locks idea that was discussed
during SPF [1] discussion at LSF/MM this year [2], which concluded with
suggestion that “a reader/writer semaphore could be put into the VMA
itself; that would have the effect of using the VMA as a sort of range
lock. There would still be contention at the VMA level, but it would be an
improvement.” This patchset implements this suggested approach.

When handling page faults we lookup the VMA that contains the faulting
page under RCU protection and try to acquire its lock. If that fails we
fall back to using mmap_lock, similar to how SPF handled this situation.

One notable way the implementation deviates from the proposal is the way
VMAs are marked as locked. Because during some of mm updates multiple
VMAs need to be locked until the end of the update (e.g. vma_merge,
split_vma, etc). Tracking all the locked VMAs, avoiding recursive locks
and other complications would make the code more complex. Therefore we
provide a way to "mark" VMAs as locked and then unmark all locked VMAs
all at once. This is done using two sequence numbers - one in the
vm_area_struct and one in the mm_struct. VMA is considered locked when
these sequence numbers are equal. To mark a VMA as locked we set the
sequence number in vm_area_struct to be equal to the sequence number
in mm_struct. To unlock all VMAs we increment mm_struct's seq number.
This allows for an efficient way to track locked VMAs and to drop the
locks on all VMAs at the end of the update.

I like it - the sequence numbers are a stroke of genuius. For what it's doing
the patchset seems almost small.

Two complaints so far:
 - I don't like the vma_mark_locked() name. To me it says that the caller
   already took or is taking the lock and this function is just marking that
   we're holding the lock, but it's really taking a different type of lock. But
   this function can block, it really is taking a lock, so it should say that.
   
   This is AFAIK a new concept, not sure I'm going to have anything good either,
   but perhaps vma_lock_multiple()?

 - I don't like the #ifdef and the separate fallback path in the fault handlers.

   Can we make find_and_lock_anon_vma() do the right thing, and not fail unless
   e.g. there isn't a vma at that address? Just have it wait for vm_lock_seq to
   change and then retry if needed.

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help