[PATCH v2 05/20] mm: Protect VMA modifications using VMA sequence count

[PATCH v2 00/20] Speculative page faults · Laurent Dufour <hidden> · 2017-08-17
[PATCH v2 01/20] mm: Dont assume page-table invariance during faults · Laurent Dufour <hidden> · 2017-08-17
[PATCH v2 03/20] mm: Introduce pte_spinlock for FAULT_FLAG_SPECULATIVE · Laurent Dufour <hidden> · 2017-08-17
[PATCH v2 04/20] mm: VMA sequence count · Laurent Dufour <hidden> · 2017-08-17
[PATCH v2 05/20] mm: Protect VMA modifications using VMA sequence count · Laurent Dufour <hidden> · 2017-08-17
[PATCH v2 07/20] mm: Cache some VMA fields in the vm_fault structure · Laurent Dufour <hidden> · 2017-08-17
[PATCH v2 11/20] mm: Introduce __maybe_mkwrite() · Laurent Dufour <hidden> · 2017-08-17
[PATCH v2 13/20] mm: Introduce __page_add_new_anon_rmap() · Laurent Dufour <hidden> · 2017-08-17
[PATCH v2 15/20] mm: Try spin lock in speculative path · Laurent Dufour <hidden> · 2017-08-17
[PATCH v2 16/20] mm: Adding speculative page fault failure trace events · Laurent Dufour <hidden> · 2017-08-17
[PATCH v2 18/20] perf tools: Add support for the SPF perf event · Laurent Dufour <hidden> · 2017-08-17
Re: [PATCH v2 18/20] perf tools: Add support for the SPF perf event · Anshuman Khandual <hidden> · 2017-08-21
Re: [PATCH v2 18/20] perf tools: Add support for the SPF perf event · Laurent Dufour <hidden> · 2017-08-25
[PATCH v2 20/20] powerpc/mm: Add speculative page fault · Laurent Dufour <hidden> · 2017-08-17
Re: [PATCH v2 20/20] powerpc/mm: Add speculative page fault · Anshuman Khandual <hidden> · 2017-08-21
Re: [PATCH v2 20/20] powerpc/mm: Add speculative page fault · Laurent Dufour <hidden> · 2017-08-29
[PATCH v2 19/20] x86/mm: Add speculative pagefault handling · Laurent Dufour <hidden> · 2017-08-17
Re: [PATCH v2 19/20] x86/mm: Add speculative pagefault handling · Anshuman Khandual <hidden> · 2017-08-21
Re: [PATCH v2 19/20] x86/mm: Add speculative pagefault handling · Laurent Dufour <hidden> · 2017-08-29
Re: [PATCH v2 19/20] x86/mm: Add speculative pagefault handling · Laurent Dufour <hidden> · 2017-08-29
[PATCH v2 17/20] perf: Add a speculative page fault sw event · Laurent Dufour <hidden> · 2017-08-17
Re: [PATCH v2 17/20] perf: Add a speculative page fault sw event · Anshuman Khandual <hidden> · 2017-08-21
Re: [PATCH v2 17/20] perf: Add a speculative page fault sw event · Michael Ellerman <mpe@ellerman.id.au> · 2017-08-22
[PATCH v2 14/20] mm: Provide speculative fault infrastructure · Laurent Dufour <hidden> · 2017-08-17
Re: [PATCH v2 14/20] mm: Provide speculative fault infrastructure · Sergey Senozhatsky <hidden> · 2017-08-20
Re: [PATCH v2 14/20] mm: Provide speculative fault infrastructure · Laurent Dufour <hidden> · 2017-08-25
Re: [PATCH v2 14/20] mm: Provide speculative fault infrastructure · Kirill A. Shutemov <hidden> · 2017-08-27
Re: [PATCH v2 14/20] mm: Provide speculative fault infrastructure · Peter Zijlstra <peterz@infradead.org> · 2017-08-28
Re: [PATCH v2 14/20] mm: Provide speculative fault infrastructure · Benjamin Herrenschmidt <benh@kernel.crashing.org> · 2017-08-28
Re: [PATCH v2 14/20] mm: Provide speculative fault infrastructure · Andi Kleen <hidden> · 2017-08-28
Re: [PATCH v2 14/20] mm: Provide speculative fault infrastructure · Peter Zijlstra <peterz@infradead.org> · 2017-08-29
Re: [PATCH v2 14/20] mm: Provide speculative fault infrastructure · Peter Zijlstra <peterz@infradead.org> · 2017-08-29
Re: [PATCH v2 14/20] mm: Provide speculative fault infrastructure · Peter Zijlstra <peterz@infradead.org> · 2017-08-29
Re: [PATCH v2 14/20] mm: Provide speculative fault infrastructure · Benjamin Herrenschmidt <benh@kernel.crashing.org> · 2017-08-29
Re: [PATCH v2 14/20] mm: Provide speculative fault infrastructure · Peter Zijlstra <peterz@infradead.org> · 2017-08-30
Re: [PATCH v2 14/20] mm: Provide speculative fault infrastructure · Laurent Dufour <hidden> · 2017-08-29
Re: [PATCH v2 14/20] mm: Provide speculative fault infrastructure · Peter Zijlstra <peterz@infradead.org> · 2017-08-29
Re: [PATCH v2 14/20] mm: Provide speculative fault infrastructure · Laurent Dufour <hidden> · 2017-08-29
Re: [PATCH v2 14/20] mm: Provide speculative fault infrastructure · Peter Zijlstra <peterz@infradead.org> · 2017-08-29
Re: [PATCH v2 14/20] mm: Provide speculative fault infrastructure · Anshuman Khandual <hidden> · 2017-08-30
Re: [PATCH v2 14/20] mm: Provide speculative fault infrastructure · Peter Zijlstra <peterz@infradead.org> · 2017-08-30
Re: [PATCH v2 14/20] mm: Provide speculative fault infrastructure · Laurent Dufour <hidden> · 2017-08-30
Re: [PATCH v2 14/20] mm: Provide speculative fault infrastructure · Anshuman Khandual <hidden> · 2017-08-31
Re: [PATCH v2 14/20] mm: Provide speculative fault infrastructure · Peter Zijlstra <peterz@infradead.org> · 2017-08-31
Re: [PATCH v2 14/20] mm: Provide speculative fault infrastructure · Laurent Dufour <hidden> · 2017-08-30
Re: [PATCH v2 14/20] mm: Provide speculative fault infrastructure · Anshuman Khandual <hidden> · 2017-08-30
Re: [PATCH v2 14/20] mm: Provide speculative fault infrastructure · Anshuman Khandual <hidden> · 2017-08-30
Re: [PATCH v2 14/20] mm: Provide speculative fault infrastructure · Laurent Dufour <hidden> · 2017-08-30
[PATCH v2 12/20] mm: Introduce __vm_normal_page() · Laurent Dufour <hidden> · 2017-08-17
[PATCH v2 10/20] mm: Introduce __lru_cache_add_active_or_unevictable · Laurent Dufour <hidden> · 2017-08-17
[PATCH v2 09/20] mm/migrate: Pass vm_fault pointer to migrate_misplaced_page() · Laurent Dufour <hidden> · 2017-08-17
[PATCH v2 08/20] mm: Protect SPF handler against anon_vma changes · Laurent Dufour <hidden> · 2017-08-17
[PATCH v2 06/20] mm: RCU free VMAs · Laurent Dufour <hidden> · 2017-08-17
[PATCH v2 02/20] mm: Prepare for FAULT_FLAG_SPECULATIVE · Laurent Dufour <hidden> · 2017-08-17
Re: [PATCH v2 00/20] Speculative page faults · Sergey Senozhatsky <hidden> · 2017-08-21
Re: [PATCH v2 00/20] Speculative page faults · Laurent Dufour <hidden> · 2017-09-08
Re: [PATCH v2 00/20] Speculative page faults · Sergey Senozhatsky <hidden> · 2017-09-11
Re: [PATCH v2 00/20] Speculative page faults · Laurent Dufour <hidden> · 2017-09-11
Re: [PATCH v2 00/20] Speculative page faults · Anshuman Khandual <hidden> · 2017-08-21
Re: [PATCH v2 00/20] Speculative page faults · Paul E. McKenney <hidden> · 2017-08-22
Re: [PATCH v2 00/20] Speculative page faults · Laurent Dufour <hidden> · 2017-08-25

From: Laurent Dufour <hidden>
Date: 2017-08-17 22:05:58
Also in: linux-mm, linuxppc-dev
Subsystem: filesystems (vfs and infrastructure), memory management, memory management - memory policy and migration, memory management - thp (transparent huge page), memory management - userfaultfd, memory mapping, memory mapping - madvise (memory advice), proc filesystem, the rest · Maintainers: Alexander Viro, Christian Brauner, Andrew Morton, David Hildenbrand, Lorenzo Stoakes, Mike Rapoport, Liam R. Howlett, Linus Torvalds

The VMA sequence count has been introduced to allow fast detection of
VMA modification when running a page fault handler without holding
the mmap_sem.

This patch provides protection against the VMA modification done in :
	- madvise()
	- mremap()
	- mpol_rebind_policy()
	- vma_replace_policy()
	- change_prot_numa()
	- mlock(), munlock()
	- mprotect()
	- mmap_region()
	- collapse_huge_page()
	- userfaultd registering services

In addition, VMA fields which will be read during the speculative fault
path needs to be written using WRITE_ONCE to prevent write to be split
and intermediate values to be pushed to other CPUs.

Signed-off-by: Laurent Dufour <redacted>
---
 fs/proc/task_mmu.c |  5 ++++-
 fs/userfaultfd.c   | 17 +++++++++++++----
 mm/khugepaged.c    |  3 +++
 mm/madvise.c       |  6 +++++-
 mm/mempolicy.c     | 51 ++++++++++++++++++++++++++++++++++-----------------
 mm/mlock.c         | 13 ++++++++-----
 mm/mmap.c          | 17 ++++++++++-------
 mm/mprotect.c      |  4 +++-
 mm/mremap.c        |  7 +++++++
 9 files changed, 87 insertions(+), 36 deletions(-)

diff --git a/fs/proc/task_mmu.c b/fs/proc/task_mmu.c
index fe8f3265e877..e682179edaae 100644
--- a/fs/proc/task_mmu.c
+++ b/fs/proc/task_mmu.c

@@ -1067,8 +1067,11 @@ static ssize_t clear_refs_write(struct file *file, const char __user *buf,
 					goto out_mm;
 				}
 				for (vma = mm->mmap; vma; vma = vma->vm_next) {
-					vma->vm_flags &= ~VM_SOFTDIRTY;
+					write_seqcount_begin(&vma->vm_sequence);
+					WRITE_ONCE(vma->vm_flags,
+						   vma->vm_flags & ~VM_SOFTDIRTY);
 					vma_set_page_prot(vma);
+					write_seqcount_end(&vma->vm_sequence);
 				}
 				downgrade_write(&mm->mmap_sem);
 				break;

diff --git a/fs/userfaultfd.c b/fs/userfaultfd.c
index b0d5897bc4e6..77b1e025c88e 100644
--- a/fs/userfaultfd.c
+++ b/fs/userfaultfd.c

@@ -612,8 +612,11 @@ int dup_userfaultfd(struct vm_area_struct *vma, struct list_head *fcs)
 
 	octx = vma->vm_userfaultfd_ctx.ctx;
 	if (!octx || !(octx->features & UFFD_FEATURE_EVENT_FORK)) {
+		write_seqcount_begin(&vma->vm_sequence);
 		vma->vm_userfaultfd_ctx = NULL_VM_UFFD_CTX;
-		vma->vm_flags &= ~(VM_UFFD_WP | VM_UFFD_MISSING);
+		WRITE_ONCE(vma->vm_flags,
+			   vma->vm_flags & ~(VM_UFFD_WP | VM_UFFD_MISSING));
+		write_seqcount_end(&vma->vm_sequence);
 		return 0;
 	}

@@ -838,8 +841,10 @@ static int userfaultfd_release(struct inode *inode, struct file *file)
 			vma = prev;
 		else
 			prev = vma;
-		vma->vm_flags = new_flags;
+		write_seqcount_begin(&vma->vm_sequence);
+		WRITE_ONCE(vma->vm_flags, new_flags);
 		vma->vm_userfaultfd_ctx = NULL_VM_UFFD_CTX;
+		write_seqcount_end(&vma->vm_sequence);
 	}
 	up_write(&mm->mmap_sem);
 	mmput(mm);

@@ -1357,8 +1362,10 @@ static int userfaultfd_register(struct userfaultfd_ctx *ctx,
 		 * the next vma was merged into the current one and
 		 * the current one has not been updated yet.
 		 */
-		vma->vm_flags = new_flags;
+		write_seqcount_begin(&vma->vm_sequence);
+		WRITE_ONCE(vma->vm_flags, new_flags);
 		vma->vm_userfaultfd_ctx.ctx = ctx;
+		write_seqcount_end(&vma->vm_sequence);
 
 	skip:
 		prev = vma;

@@ -1515,8 +1522,10 @@ static int userfaultfd_unregister(struct userfaultfd_ctx *ctx,
 		 * the next vma was merged into the current one and
 		 * the current one has not been updated yet.
 		 */
-		vma->vm_flags = new_flags;
+		write_seqcount_begin(&vma->vm_sequence);
+		WRITE_ONCE(vma->vm_flags, new_flags);
 		vma->vm_userfaultfd_ctx = NULL_VM_UFFD_CTX;
+		write_seqcount_end(&vma->vm_sequence);
 
 	skip:
 		prev = vma;

diff --git a/mm/khugepaged.c b/mm/khugepaged.c
index c01f177a1120..56dd994c05d0 100644
--- a/mm/khugepaged.c
+++ b/mm/khugepaged.c

@@ -1005,6 +1005,7 @@ static void collapse_huge_page(struct mm_struct *mm,
 	if (mm_find_pmd(mm, address) != pmd)
 		goto out;
 
+	write_seqcount_begin(&vma->vm_sequence);
 	anon_vma_lock_write(vma->anon_vma);
 
 	pte = pte_offset_map(pmd, address);

@@ -1040,6 +1041,7 @@ static void collapse_huge_page(struct mm_struct *mm,
 		pmd_populate(mm, pmd, pmd_pgtable(_pmd));
 		spin_unlock(pmd_ptl);
 		anon_vma_unlock_write(vma->anon_vma);
+		write_seqcount_end(&vma->vm_sequence);
 		result = SCAN_FAIL;
 		goto out;
 	}

@@ -1074,6 +1076,7 @@ static void collapse_huge_page(struct mm_struct *mm,
 	set_pmd_at(mm, address, pmd, _pmd);
 	update_mmu_cache_pmd(vma, address, pmd);
 	spin_unlock(pmd_ptl);
+	write_seqcount_end(&vma->vm_sequence);
 
 	*hpage = NULL;

diff --git a/mm/madvise.c b/mm/madvise.c
index 47d8d8a25eae..8fc4f73c8ac5 100644
--- a/mm/madvise.c
+++ b/mm/madvise.c

@@ -172,7 +172,9 @@ static long madvise_behavior(struct vm_area_struct *vma,
 	/*
 	 * vm_flags is protected by the mmap_sem held in write mode.
 	 */
-	vma->vm_flags = new_flags;
+	write_seqcount_begin(&vma->vm_sequence);
+	WRITE_ONCE(vma->vm_flags, new_flags);
+	write_seqcount_end(&vma->vm_sequence);
 out:
 	return error;
 }

@@ -440,9 +442,11 @@ static void madvise_free_page_range(struct mmu_gather *tlb,
 		.private = tlb,
 	};
 
+	write_seqcount_begin(&vma->vm_sequence);
 	tlb_start_vma(tlb, vma);
 	walk_page_range(addr, end, &free_walk);
 	tlb_end_vma(tlb, vma);
+	write_seqcount_end(&vma->vm_sequence);
 }
 
 static int madvise_free_single_vma(struct vm_area_struct *vma,

diff --git a/mm/mempolicy.c b/mm/mempolicy.c
index d911fa5cb2a7..8e2f67af8e05 100644
--- a/mm/mempolicy.c
+++ b/mm/mempolicy.c

@@ -378,8 +378,11 @@ void mpol_rebind_mm(struct mm_struct *mm, nodemask_t *new)
 	struct vm_area_struct *vma;
 
 	down_write(&mm->mmap_sem);
-	for (vma = mm->mmap; vma; vma = vma->vm_next)
+	for (vma = mm->mmap; vma; vma = vma->vm_next) {
+		write_seqcount_begin(&vma->vm_sequence);
 		mpol_rebind_policy(vma->vm_policy, new);
+		write_seqcount_end(&vma->vm_sequence);
+	}
 	up_write(&mm->mmap_sem);
 }

@@ -537,9 +540,11 @@ unsigned long change_prot_numa(struct vm_area_struct *vma,
 {
 	int nr_updated;
 
+	write_seqcount_begin(&vma->vm_sequence);
 	nr_updated = change_protection(vma, addr, end, PAGE_NONE, 0, 1);
 	if (nr_updated)
 		count_vm_numa_events(NUMA_PTE_UPDATES, nr_updated);
+	write_seqcount_end(&vma->vm_sequence);
 
 	return nr_updated;
 }

@@ -640,6 +645,7 @@ static int vma_replace_policy(struct vm_area_struct *vma,
 	if (IS_ERR(new))
 		return PTR_ERR(new);
 
+	write_seqcount_begin(&vma->vm_sequence);
 	if (vma->vm_ops && vma->vm_ops->set_policy) {
 		err = vma->vm_ops->set_policy(vma, new);
 		if (err)

@@ -647,11 +653,17 @@ static int vma_replace_policy(struct vm_area_struct *vma,
 	}
 
 	old = vma->vm_policy;
-	vma->vm_policy = new; /* protected by mmap_sem */
+	/*
+	 * The speculative page fault handler access this field without
+	 * hodling the mmap_sem.
+	 */
+	WRITE_ONCE(vma->vm_policy,  new);
+	write_seqcount_end(&vma->vm_sequence);
 	mpol_put(old);
 
 	return 0;
  err_out:
+	write_seqcount_end(&vma->vm_sequence);
 	mpol_put(new);
 	return err;
 }

@@ -1505,23 +1517,28 @@ COMPAT_SYSCALL_DEFINE6(mbind, compat_ulong_t, start, compat_ulong_t, len,
 struct mempolicy *__get_vma_policy(struct vm_area_struct *vma,
 						unsigned long addr)
 {
-	struct mempolicy *pol = NULL;
+	struct mempolicy *pol;
 
-	if (vma) {
-		if (vma->vm_ops && vma->vm_ops->get_policy) {
-			pol = vma->vm_ops->get_policy(vma, addr);
-		} else if (vma->vm_policy) {
-			pol = vma->vm_policy;
+	if (!vma)
+		return NULL;
 
-			/*
-			 * shmem_alloc_page() passes MPOL_F_SHARED policy with
-			 * a pseudo vma whose vma->vm_ops=NULL. Take a reference
-			 * count on these policies which will be dropped by
-			 * mpol_cond_put() later
-			 */
-			if (mpol_needs_cond_ref(pol))
-				mpol_get(pol);
-		}
+	if (vma->vm_ops && vma->vm_ops->get_policy)
+		return vma->vm_ops->get_policy(vma, addr);
+
+	/*
+	 * This could be called without holding the mmap_sem in the
+	 * speculative page fault handler's path.
+	 */
+	pol = READ_ONCE(vma->vm_policy);
+	if (pol) {
+		/*
+		 * shmem_alloc_page() passes MPOL_F_SHARED policy with
+		 * a pseudo vma whose vma->vm_ops=NULL. Take a reference
+		 * count on these policies which will be dropped by
+		 * mpol_cond_put() later
+		 */
+		if (mpol_needs_cond_ref(pol))
+			mpol_get(pol);
 	}
 
 	return pol;

diff --git a/mm/mlock.c b/mm/mlock.c
index b562b5523a65..23d16dbff7fb 100644
--- a/mm/mlock.c
+++ b/mm/mlock.c

@@ -438,7 +438,9 @@ static unsigned long __munlock_pagevec_fill(struct pagevec *pvec,
 void munlock_vma_pages_range(struct vm_area_struct *vma,
 			     unsigned long start, unsigned long end)
 {
-	vma->vm_flags &= VM_LOCKED_CLEAR_MASK;
+	write_seqcount_begin(&vma->vm_sequence);
+	WRITE_ONCE(vma->vm_flags, vma->vm_flags & VM_LOCKED_CLEAR_MASK);
+	write_seqcount_end(&vma->vm_sequence);
 
 	while (start < end) {
 		struct page *page;

@@ -563,10 +565,11 @@ static int mlock_fixup(struct vm_area_struct *vma, struct vm_area_struct **prev,
 	 * It's okay if try_to_unmap_one unmaps a page just after we
 	 * set VM_LOCKED, populate_vma_page_range will bring it back.
 	 */
-
-	if (lock)
-		vma->vm_flags = newflags;
-	else
+	if (lock) {
+		write_seqcount_begin(&vma->vm_sequence);
+		WRITE_ONCE(vma->vm_flags, newflags);
+		write_seqcount_end(&vma->vm_sequence);
+	} else
 		munlock_vma_pages_range(vma, start, end);
 
 out:

diff --git a/mm/mmap.c b/mm/mmap.c
index 140b22136cb7..b480043e38fb 100644
--- a/mm/mmap.c
+++ b/mm/mmap.c

@@ -825,17 +825,18 @@ int __vma_adjust(struct vm_area_struct *vma, unsigned long start,
 	}
 
 	if (start != vma->vm_start) {
-		vma->vm_start = start;
+		WRITE_ONCE(vma->vm_start, start);
 		start_changed = true;
 	}
 	if (end != vma->vm_end) {
-		vma->vm_end = end;
+		WRITE_ONCE(vma->vm_end, end);
 		end_changed = true;
 	}
-	vma->vm_pgoff = pgoff;
+	WRITE_ONCE(vma->vm_pgoff, pgoff);
 	if (adjust_next) {
-		next->vm_start += adjust_next << PAGE_SHIFT;
-		next->vm_pgoff += adjust_next;
+		WRITE_ONCE(next->vm_start,
+			   next->vm_start + (adjust_next << PAGE_SHIFT));
+		WRITE_ONCE(next->vm_pgoff, next->vm_pgoff + adjust_next);
 	}
 
 	if (root) {

@@ -1734,6 +1735,7 @@ unsigned long mmap_region(struct file *file, unsigned long addr,
 out:
 	perf_event_mmap(vma);
 
+	write_seqcount_begin(&vma->vm_sequence);
 	vm_stat_account(mm, vm_flags, len >> PAGE_SHIFT);
 	if (vm_flags & VM_LOCKED) {
 		if (!((vm_flags & VM_SPECIAL) || is_vm_hugetlb_page(vma) ||

@@ -1756,6 +1758,7 @@ unsigned long mmap_region(struct file *file, unsigned long addr,
 	vma->vm_flags |= VM_SOFTDIRTY;
 
 	vma_set_page_prot(vma);
+	write_seqcount_end(&vma->vm_sequence);
 
 	return addr;

@@ -2384,8 +2387,8 @@ int expand_downwards(struct vm_area_struct *vma,
 					mm->locked_vm += grow;
 				vm_stat_account(mm, vma->vm_flags, grow);
 				anon_vma_interval_tree_pre_update_vma(vma);
-				vma->vm_start = address;
-				vma->vm_pgoff -= grow;
+				WRITE_ONCE(vma->vm_start, address);
+				WRITE_ONCE(vma->vm_pgoff, vma->vm_pgoff - grow);
 				anon_vma_interval_tree_post_update_vma(vma);
 				vma_gap_update(vma);
 				spin_unlock(&mm->page_table_lock);

diff --git a/mm/mprotect.c b/mm/mprotect.c
index bd0f409922cb..0def85982d6c 100644
--- a/mm/mprotect.c
+++ b/mm/mprotect.c

@@ -344,7 +344,8 @@ mprotect_fixup(struct vm_area_struct *vma, struct vm_area_struct **pprev,
 	 * vm_flags and vm_page_prot are protected by the mmap_sem
 	 * held in write mode.
 	 */
-	vma->vm_flags = newflags;
+	write_seqcount_begin(&vma->vm_sequence);
+	WRITE_ONCE(vma->vm_flags, newflags);
 	dirty_accountable = vma_wants_writenotify(vma, vma->vm_page_prot);
 	vma_set_page_prot(vma);

@@ -359,6 +360,7 @@ mprotect_fixup(struct vm_area_struct *vma, struct vm_area_struct **pprev,
 			(newflags & VM_WRITE)) {
 		populate_vma_page_range(vma, start, end, NULL);
 	}
+	write_seqcount_end(&vma->vm_sequence);
 
 	vm_stat_account(mm, oldflags, -nrpages);
 	vm_stat_account(mm, newflags, nrpages);

diff --git a/mm/mremap.c b/mm/mremap.c
index 3f23715d3c69..1abadea8ab84 100644
--- a/mm/mremap.c
+++ b/mm/mremap.c

@@ -301,6 +301,10 @@ static unsigned long move_vma(struct vm_area_struct *vma,
 	if (!new_vma)
 		return -ENOMEM;
 
+	write_seqcount_begin(&vma->vm_sequence);
+	write_seqcount_begin_nested(&new_vma->vm_sequence,
+				    SINGLE_DEPTH_NESTING);
+
 	moved_len = move_page_tables(vma, old_addr, new_vma, new_addr, old_len,
 				     need_rmap_locks);
 	if (moved_len < old_len) {

@@ -317,6 +321,7 @@ static unsigned long move_vma(struct vm_area_struct *vma,
 		 */
 		move_page_tables(new_vma, new_addr, vma, old_addr, moved_len,
 				 true);
+		write_seqcount_end(&vma->vm_sequence);
 		vma = new_vma;
 		old_len = new_len;
 		old_addr = new_addr;

@@ -325,7 +330,9 @@ static unsigned long move_vma(struct vm_area_struct *vma,
 		mremap_userfaultfd_prep(new_vma, uf);
 		arch_remap(mm, old_addr, old_addr + old_len,
 			   new_addr, new_addr + new_len);
+		write_seqcount_end(&vma->vm_sequence);
 	}
+	write_seqcount_end(&new_vma->vm_sequence);
 
 	/* Conceal VM_ACCOUNT so old reservation is not undone */
 	if (vm_flags & VM_ACCOUNT) {

-- 
2.7.4

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help