Re: [PATCH v4 10/10] thp: implement refcounting for huge zero page

[PATCH v4 00/10, REBASED] Introduce huge zero page · Kirill A. Shutemov <hidden> · 2012-10-15
[PATCH v4 02/10] thp: zap_huge_pmd(): zap huge zero pmd · Kirill A. Shutemov <hidden> · 2012-10-15
[PATCH v4 01/10] thp: huge zero page: basic preparation · Kirill A. Shutemov <hidden> · 2012-10-15
[PATCH v4 10/10] thp: implement refcounting for huge zero page · Kirill A. Shutemov <hidden> · 2012-10-15
Re: [PATCH v4 10/10] thp: implement refcounting for huge zero page · Andrew Morton <akpm@linux-foundation.org> · 2012-10-18
Re: [PATCH v4 10/10] thp: implement refcounting for huge zero page · Kirill A. Shutemov <hidden> · 2012-10-18
Re: [PATCH v4 10/10] thp: implement refcounting for huge zero page · Kirill A. Shutemov <hidden> · 2012-10-23
Re: [PATCH v4 10/10] thp: implement refcounting for huge zero page · Andrew Morton <akpm@linux-foundation.org> · 2012-10-23
Re: [PATCH v4 10/10] thp: implement refcounting for huge zero page · Kirill A. Shutemov <hidden> · 2012-10-23
Re: [PATCH v4 10/10] thp: implement refcounting for huge zero page · Andrew Morton <akpm@linux-foundation.org> · 2012-10-23
Re: [PATCH v4 10/10] thp: implement refcounting for huge zero page · Kirill A. Shutemov <hidden> · 2012-10-23
Re: [PATCH v4 10/10] thp: implement refcounting for huge zero page · Andrew Morton <akpm@linux-foundation.org> · 2012-10-24
Re: [PATCH v4 10/10] thp: implement refcounting for huge zero page · Kirill A. Shutemov <hidden> · 2012-10-24
Re: [PATCH v4 10/10] thp: implement refcounting for huge zero page · Andrew Morton <akpm@linux-foundation.org> · 2012-10-24
Re: [PATCH v4 10/10] thp: implement refcounting for huge zero page · Kirill A. Shutemov <hidden> · 2012-10-24
Re: [PATCH v4 10/10] thp: implement refcounting for huge zero page · Andi Kleen <hidden> · 2012-10-24
Re: [PATCH v4 10/10] thp: implement refcounting for huge zero page · Kirill A. Shutemov <hidden> · 2012-10-25
Re: [PATCH v4 10/10] thp: implement refcounting for huge zero page · Andrew Morton <akpm@linux-foundation.org> · 2012-10-25
Re: [PATCH v4 10/10] thp: implement refcounting for huge zero page · Kirill A. Shutemov <hidden> · 2012-10-25
Re: [PATCH v4 10/10] thp: implement refcounting for huge zero page · Andrew Morton <akpm@linux-foundation.org> · 2012-10-25
Re: [PATCH v4 10/10] thp: implement refcounting for huge zero page · Kirill A. Shutemov <hidden> · 2012-10-25
[PATCH v4 09/10] thp: lazy huge zero page allocation · Kirill A. Shutemov <hidden> · 2012-10-15
[PATCH v4 07/10] thp: implement splitting pmd for huge zero page · Kirill A. Shutemov <hidden> · 2012-10-15
[PATCH v4 08/10] thp: setup huge zero page on non-write page fault · Kirill A. Shutemov <hidden> · 2012-10-15
[PATCH v4 05/10] thp: change_huge_pmd(): keep huge zero page write-protected · Kirill A. Shutemov <hidden> · 2012-10-15
[PATCH v4 06/10] thp: change split_huge_page_pmd() interface · Kirill A. Shutemov <hidden> · 2012-10-15
[PATCH v4 04/10] thp: do_huge_pmd_wp_page(): handle huge zero page · Kirill A. Shutemov <hidden> · 2012-10-15
[PATCH v4 03/10] thp: copy_huge_pmd(): copy huge zero page · Kirill A. Shutemov <hidden> · 2012-10-15
Re: [PATCH v4 00/10, REBASED] Introduce huge zero page · Ni zhan Chen <hidden> · 2012-10-16
Re: [PATCH v4 00/10, REBASED] Introduce huge zero page · Kirill A. Shutemov <hidden> · 2012-10-16
Re: [PATCH v4 00/10, REBASED] Introduce huge zero page · Ni zhan Chen <hidden> · 2012-10-16
Re: [PATCH v4 00/10, REBASED] Introduce huge zero page · Kirill A. Shutemov <hidden> · 2012-10-16
Re: [PATCH v4 00/10, REBASED] Introduce huge zero page · Ni zhan Chen <hidden> · 2012-10-16
[PATCH] thp, vmstat: implement HZP_ALLOC and HZP_ALLOC_FAILED events · Kirill A. Shutemov <hidden> · 2012-10-26

From: Kirill A. Shutemov <hidden>
Date: 2012-10-25 21:21:36
Also in: lkml

On Thu, Oct 25, 2012 at 02:05:24PM -0700, Andrew Morton wrote:

On Thu, 25 Oct 2012 23:49:59 +0300
"Kirill A. Shutemov" [off-list ref] wrote:

quoted

On Wed, Oct 24, 2012 at 01:25:52PM -0700, Andrew Morton wrote:

quoted

On Wed, 24 Oct 2012 22:45:52 +0300
"Kirill A. Shutemov" [off-list ref] wrote:

quoted

On Wed, Oct 24, 2012 at 12:22:53PM -0700, Andrew Morton wrote:

quoted

I'm thinking that such a workload would be the above dd in parallel
with a small app which touches the huge page and then exits, then gets
executed again.  That "small app" sounds realistic to me.  Obviously
one could exercise the zero page's refcount at higher frequency with a
tight map/touch/unmap loop, but that sounds less realistic.  It's worth
trying that exercise as well though.

Or do something else.  But we should try to probe this code's
worst-case behaviour, get an understanding of its effects and then
decide whether any such workload is realisic enough to worry about.

Okay, I'll try few memory pressure scenarios.

A test program:

        while (1) {
                posix_memalign((void **)&p, 2 * MB, 2 * MB);
                assert(*p == 0);
                free(p);
        }

With this code in background we have pretty good chance to have huge zero
page freeable (refcount == 1) when shrinker callback called - roughly one
of two.

Pagecache hog (dd if=hugefile of=/dev/null bs=1M) creates enough pressure
to get shrinker callback called, but it was only asked about cache size
(nr_to_scan == 0).
I was not able to get it called with nr_to_scan > 0 on this scenario, so
hzp never freed.

hm.  It's odd that the kernel didn't try to shrink slabs in this case. 
Why didn't it??

nr_to_scan == 0 asks for the fast path. shrinker callback can shink, if
it thinks it's good idea.

quoted

I also tried another scenario: usemem -n16 100M -r 1000. It creates real
memory pressure - no easy reclaimable memory. This time callback called
with nr_to_scan > 0 and we freed hzp. Under pressure we fails to allocate
hzp and code goes to fallback path as it supposed to.

Do I need to check any other scenario?

I'm thinking that if we do hit problems in this area, we could avoid
freeing the hugepage unless the scan_control.priority is high enough. 
That would involve adding a magic number or a tunable to set the
threshold.

What about ratelimit on alloc path to force fallback if we allocate
to often? Is it good idea?

Also, it would be beneficial if we can monitor this easily.  Perhaps
add a counter to /proc/vmstat which tells us how many times that page
has been reallocated?  And perhaps how many times we tried to allocate
it but failed?

Okay, I'll prepare patch.

-- 
 Kirill A. Shutemov

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help