Re: [patch 5/5] mm: refault distance-based file cache sizing

[patch 0/5] refault distance-based file cache sizing · Johannes Weiner <hannes@cmpxchg.org> · 2012-05-01
[patch 1/5] mm: readahead: move radix tree hole searching here · Johannes Weiner <hannes@cmpxchg.org> · 2012-05-01
Re: [patch 1/5] mm: readahead: move radix tree hole searching here · Rik van Riel <hidden> · 2012-05-01
[patch 2/5] mm + fs: prepare for non-page entries in page cache · Johannes Weiner <hannes@cmpxchg.org> · 2012-05-01
Re: [patch 2/5] mm + fs: prepare for non-page entries in page cache · Andrew Morton <akpm@linux-foundation.org> · 2012-05-01
Re: [patch 2/5] mm + fs: prepare for non-page entries in page cache · Johannes Weiner <hannes@cmpxchg.org> · 2012-05-01
Re: [patch 2/5] mm + fs: prepare for non-page entries in page cache · Andrew Morton <akpm@linux-foundation.org> · 2012-05-01
Re: [patch 2/5] mm + fs: prepare for non-page entries in page cache · Rik van Riel <hidden> · 2012-05-01
Re: [patch 2/5] mm + fs: prepare for non-page entries in page cache · Johannes Weiner <hannes@cmpxchg.org> · 2012-05-01
[patch 3/5] mm + fs: store shadow pages in page cache · Johannes Weiner <hannes@cmpxchg.org> · 2012-05-01
[patch 4/5] mm + fs: provide refault distance to page cache instantiations · Johannes Weiner <hannes@cmpxchg.org> · 2012-05-01
Re: [patch 4/5] mm + fs: provide refault distance to page cache instantiations · Peter Zijlstra <peterz@infradead.org> · 2012-05-01
Re: [patch 4/5] mm + fs: provide refault distance to page cache instantiations · Johannes Weiner <hannes@cmpxchg.org> · 2012-05-01
Re: [patch 4/5] mm + fs: provide refault distance to page cache instantiations · Peter Zijlstra <peterz@infradead.org> · 2012-05-01
[patch 5/5] mm: refault distance-based file cache sizing · Johannes Weiner <hannes@cmpxchg.org> · 2012-05-01
Re: [patch 5/5] mm: refault distance-based file cache sizing · Minchan Kim <minchan@kernel.org> · 2012-05-01
Re: [patch 5/5] mm: refault distance-based file cache sizing · Johannes Weiner <hannes@cmpxchg.org> · 2012-05-01
Re: [patch 5/5] mm: refault distance-based file cache sizing · Minchan Kim <minchan@kernel.org> · 2012-05-02
Re: [patch 5/5] mm: refault distance-based file cache sizing · Andrea Arcangeli <hidden> · 2012-05-02
Re: [patch 5/5] mm: refault distance-based file cache sizing · Johannes Weiner <hannes@cmpxchg.org> · 2012-05-02
Re: [patch 5/5] mm: refault distance-based file cache sizing · Andrea Arcangeli <hidden> · 2012-05-02
Re: [patch 0/5] refault distance-based file cache sizing · Andrew Morton <akpm@linux-foundation.org> · 2012-05-01
Re: [patch 0/5] refault distance-based file cache sizing · Rik van Riel <hidden> · 2012-05-01
Re: [patch 0/5] refault distance-based file cache sizing · Andrew Morton <akpm@linux-foundation.org> · 2012-05-01
Re: [patch 0/5] refault distance-based file cache sizing · Andrea Arcangeli <hidden> · 2012-05-02
Re: [patch 0/5] refault distance-based file cache sizing · Johannes Weiner <hannes@cmpxchg.org> · 2012-05-03
Re: [patch 0/5] refault distance-based file cache sizing · nai.xia <hidden> · 2012-05-16
Re: [patch 0/5] refault distance-based file cache sizing · Johannes Weiner <hannes@cmpxchg.org> · 2012-05-16
Re: [patch 0/5] refault distance-based file cache sizing · nai.xia <hidden> · 2012-05-16
Re: [patch 0/5] refault distance-based file cache sizing · Johannes Weiner <hannes@cmpxchg.org> · 2012-05-17
Re: [patch 0/5] refault distance-based file cache sizing · Nai Xia <hidden> · 2012-05-18
Re: [patch 0/5] refault distance-based file cache sizing · Rik van Riel <hidden> · 2012-05-18
Re: [patch 0/5] refault distance-based file cache sizing · Nai Xia <hidden> · 2012-05-18
Re: [patch 0/5] refault distance-based file cache sizing · Rik van Riel <hidden> · 2012-05-17
Re: [patch 0/5] refault distance-based file cache sizing · Nai Xia <hidden> · 2012-05-18

From: Minchan Kim <minchan@kernel.org>
Date: 2012-05-01 14:13:30
Also in: linux-fsdevel, lkml

Hi Hannes,

On Tue, May 01, 2012 at 10:41:53AM +0200, Johannes Weiner wrote:

To protect frequently used page cache (workingset) from bursts of less
frequently used or one-shot cache, page cache pages are managed on two
linked lists.  The inactive list is where all cache starts out on
fault and ends on reclaim.  Pages that get accessed another time while
on the inactive list get promoted to the active list to protect them
from reclaim.

Right now we have two main problems.

One stems from numa allocation decisions and how the page allocator
and kswapd interact.  The both of them can enter into a perfect loop
where kswapd reclaims from the preferred zone of a task, allowing the
task to continuously allocate from that zone.  Or, the node distance
can lead to the allocator to do direct zone reclaim to stay in the
preferred zone.  This may be good for locality, but the task has only

Understood.

the inactive space of that one zone to get its memory activated.
Forcing the allocator to spread out to lower zones in the right
situation makes the difference between continuous IO to serve the
workingset, or taking the numa cost but serving fully from memory.

It's hard to parse your word due to my dumb brain.
Could you elaborate on it?
It would be a good if you say with example.

The other issue is that with the two lists alone, we can never detect
when a new set of data with equal access frequency should be cached if
the size of it is bigger than total/allowed memory minus the active
set.  Currently we have the perfect compromise given those
constraints: the active list is not allowed to grow bigger than the
inactive list.  This means that we can protect cache from reclaim only

Okay.

up to half of memory, and don't recognize workingset changes that are
bigger than half of memory.

Workingset change?
You mean if new workingset is bigger than half of memory and it's like
stream before retouch, we could cache only part of working set because 
head pages on working set would be discared by tail pages of working set
in inactive list?

I'm sure I totally coudln't parse your point.
Could you explain in detail? Before reading your approach and diving into code,
I would like to see the problem clearly.

Thanks.
 

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Fight unfair telecom internet charges in Canada: sign http://stopthemeter.ca/
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help