Re: [PATCH 8/9] mm: compaction: Cache if a pageblock was scanned and no... | linux-mm

[PATCH 0/9] Reduce compaction scanning and lock contention · Mel Gorman <mgorman@suse.de> · 2012-09-21
[PATCH 1/9] Revert "mm: compaction: check lock contention first before taking lock" · Mel Gorman <mgorman@suse.de> · 2012-09-21
Re: [PATCH 1/9] Revert "mm: compaction: check lock contention first before taking lock" · Rafael Aquini <hidden> · 2012-09-21
[PATCH 3/9] Revert "mm: compaction: abort compaction loop if lock is contended or run too long" · Mel Gorman <mgorman@suse.de> · 2012-09-21
Re: [PATCH 3/9] Revert "mm: compaction: abort compaction loop if lock is contended or run too long" · Rafael Aquini <hidden> · 2012-09-21
[PATCH 5/9] mm: compaction: Acquire the zone->lru_lock as late as possible · Mel Gorman <mgorman@suse.de> · 2012-09-21
Re: [PATCH 5/9] mm: compaction: Acquire the zone->lru_lock as late as possible · Rafael Aquini <hidden> · 2012-09-21
Re: [PATCH 5/9] mm: compaction: Acquire the zone->lru_lock as late as possible · Minchan Kim <minchan@kernel.org> · 2012-09-25
Re: [PATCH 5/9] mm: compaction: Acquire the zone->lru_lock as late as possible · Mel Gorman <mgorman@suse.de> · 2012-09-25
Re: [PATCH 5/9] mm: compaction: Acquire the zone->lru_lock as late as possible · Minchan Kim <minchan@kernel.org> · 2012-09-25
Re: [PATCH 5/9] mm: compaction: Acquire the zone->lru_lock as late as possible · Andrew Morton <akpm@linux-foundation.org> · 2012-09-25
Re: [PATCH 5/9] mm: compaction: Acquire the zone->lru_lock as late as possible · Minchan Kim <minchan@kernel.org> · 2012-09-26
Re: [PATCH 5/9] mm: compaction: Acquire the zone->lru_lock as late as possible · Mel Gorman <mgorman@suse.de> · 2012-09-26
[PATCH 4/9] mm: compaction: Abort compaction loop if lock is contended or run too long · Mel Gorman <mgorman@suse.de> · 2012-09-21
Re: [PATCH 4/9] mm: compaction: Abort compaction loop if lock is contended or run too long · Rafael Aquini <hidden> · 2012-09-21
Re: [PATCH 4/9] mm: compaction: Abort compaction loop if lock is contended or run too long · Andrew Morton <akpm@linux-foundation.org> · 2012-09-21
Re: [PATCH 4/9] mm: compaction: Abort compaction loop if lock is contended or run too long · Minchan Kim <minchan@kernel.org> · 2012-09-25
[PATCH 7/9] Revert "mm: have order > 0 compaction start off where it left" · Mel Gorman <mgorman@suse.de> · 2012-09-21
Re: [PATCH 7/9] Revert "mm: have order > 0 compaction start off where it left" · Rafael Aquini <hidden> · 2012-09-21
Re: [PATCH 7/9] Revert "mm: have order > 0 compaction start off where it left" · Minchan Kim <minchan@kernel.org> · 2012-09-25
[PATCH 6/9] mm: compaction: Acquire the zone->lock as late as possible · Mel Gorman <mgorman@suse.de> · 2012-09-21
Re: [PATCH 6/9] mm: compaction: Acquire the zone->lock as late as possible · Rafael Aquini <hidden> · 2012-09-21
Re: [PATCH 6/9] mm: compaction: Acquire the zone->lock as late as possible · Andrew Morton <akpm@linux-foundation.org> · 2012-09-21
Re: [PATCH 6/9] mm: compaction: Acquire the zone->lock as late as possible · Mel Gorman <mgorman@suse.de> · 2012-09-24
Re: [PATCH 6/9] mm: compaction: Acquire the zone->lock as late as possible · Minchan Kim <minchan@kernel.org> · 2012-09-25
Re: [PATCH 6/9] mm: compaction: Acquire the zone->lock as late as possible · Minchan Kim <minchan@kernel.org> · 2012-09-25
[PATCH 9/9] mm: compaction: Restart compaction from near where it left off · Mel Gorman <mgorman@suse.de> · 2012-09-21
Re: [PATCH 9/9] mm: compaction: Restart compaction from near where it left off · Rafael Aquini <hidden> · 2012-09-21
[PATCH 8/9] mm: compaction: Cache if a pageblock was scanned and no pages were isolated · Mel Gorman <mgorman@suse.de> · 2012-09-21
Re: [PATCH 8/9] mm: compaction: Cache if a pageblock was scanned and no pages were isolated · Rafael Aquini <hidden> · 2012-09-21
Re: [PATCH 8/9] mm: compaction: Cache if a pageblock was scanned and no pages were isolated · Andrew Morton <akpm@linux-foundation.org> · 2012-09-21
Re: [PATCH 8/9] mm: compaction: Cache if a pageblock was scanned and no pages were isolated · Mel Gorman <mgorman@suse.de> · 2012-09-24
Re: [PATCH 8/9] mm: compaction: Cache if a pageblock was scanned and no pages were isolated · Andrew Morton <akpm@linux-foundation.org> · 2012-09-24
Re: [PATCH 8/9] mm: compaction: Cache if a pageblock was scanned and no pages were isolated · Mel Gorman <mgorman@suse.de> · 2012-09-25
Re: [PATCH 8/9] mm: compaction: Cache if a pageblock was scanned and no pages were isolated · Andrew Morton <akpm@linux-foundation.org> · 2012-09-25
[PATCH] mm: compaction: cache if a pageblock was scanned and no pages were isolated -fix2 · Mel Gorman <mgorman@suse.de> · 2012-09-27
Re: [PATCH 8/9] mm: compaction: Cache if a pageblock was scanned and no pages were isolated · Mel Gorman <mgorman@suse.de> · 2012-09-27
Re: [PATCH 8/9] mm: compaction: Cache if a pageblock was scanned and no pages were isolated · Minchan Kim <minchan@kernel.org> · 2012-09-26
Re: [PATCH 8/9] mm: compaction: Cache if a pageblock was scanned and no pages were isolated · Mel Gorman <mgorman@suse.de> · 2012-09-27
[PATCH 2/9] Revert "mm-compaction-abort-compaction-loop-if-lock-is-contended-or-run-too-long-fix" · Mel Gorman <mgorman@suse.de> · 2012-09-21
Re: [PATCH 2/9] Revert "mm-compaction-abort-compaction-loop-if-lock-is-contended-or-run-too-long-fix" · Rafael Aquini <hidden> · 2012-09-21
Re: [PATCH 0/9] Reduce compaction scanning and lock contention · Rik van Riel <hidden> · 2012-09-21

Re: [PATCH 8/9] mm: compaction: Cache if a pageblock was scanned and no pages were isolated

From: Andrew Morton <akpm@linux-foundation.org>
Date: 2012-09-25 20:03:56
Also in: kvm, lkml, qemu-devel

On Tue, 25 Sep 2012 10:12:07 +0100
Mel Gorman [off-list ref] wrote:

First, we'd introduce a variant of get_pageblock_migratetype() that returns
all the bits for the pageblock flags and then helpers to extract either the
migratetype or the PG_migrate_skip. We already are incurring the cost of
get_pageblock_migratetype() so it will not be much more expensive than what
is already there. If there is an allocation or free within a pageblock that
as the PG_migrate_skip bit set then we increment a counter. When the counter
reaches some to-be-decided "threshold" then compaction may clear all the
bits. This would match the criteria of the clearing being based on activity.

There are four potential problems with this

1. The logic to retrieve all the bits and split them up will be a little
   convulated but maybe it would not be that bad.

2. The counter is a shared-writable cache line but obviously it could
   be moved to vmstat and incremented with inc_zone_page_state to offset
   the cost a little.

3. The biggested weakness is that there is not way to know if the
   counter is incremented based on activity in a small subset of blocks.

4. What should the threshold be?

The first problem is minor but the other three are potentially a mess.
Adding another vmstat counter is bad enough in itself but if the counter
is incremented based on a small subsets of pageblocks, the hint becomes
is potentially useless.

However, does this match what you have in mind or am I over-complicating
things?

Sounds complicated.

Using wall time really does suck.  Are you sure you can't think of
something more logical?

How would we demonstrate the suckage?  What would be the observeable downside of
switching that 5 seconds to 5 hours?

quoted

+	for (pfn = start_pfn; pfn < end_pfn; pfn += pageblock_nr_pages) {
+		struct page *page;
+		if (!pfn_valid(pfn))
+			continue;
+
+		page = pfn_to_page(pfn);
+		if (zone != page_zone(page))
+			continue;
+
+		clear_pageblock_skip(page);
+	}

What's the worst-case loop count here?

zone->spanned_pages >> pageblock_order

What's the worst-case value of (zone->spanned_pages >> pageblock_order) :)

Lets take an unlikely case - 128G single-node machine. That loop count
on x86-64 would be 65536. It'll be fast enough, particularly in this
path.

That could easily exceed a millisecond.  Can/should we stick a
cond_resched() in there?

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help