Re: [PATCH RESEND 0/8] hugetlb: add demote/split page functionality

[PATCH RESEND 0/8] hugetlb: add demote/split page functionality · Mike Kravetz <hidden> · 2021-08-16
[PATCH 3/8] hugetlb: add demote bool to gigantic page routines · Mike Kravetz <hidden> · 2021-08-16
[PATCH 1/8] hugetlb: add demote hugetlb page sysfs interfaces · Mike Kravetz <hidden> · 2021-08-16
[PATCH 2/8] hugetlb: add HPageCma flag and code to free non-gigantic pages in CMA · Mike Kravetz <hidden> · 2021-08-16
[PATCH 6/8] hugetlb: vmemmap optimizations when demoting hugetlb pages · Mike Kravetz <hidden> · 2021-08-16
[PATCH 8/8] hugetlb: Optimized demote vmemmap optimizatized pages · Mike Kravetz <hidden> · 2021-08-16
[PATCH 5/8] hugetlb: document the demote sysfs interfaces · Mike Kravetz <hidden> · 2021-08-16
Re: [PATCH 5/8] hugetlb: document the demote sysfs interfaces · Andrew Morton <akpm@linux-foundation.org> · 2021-08-16
Re: [PATCH 5/8] hugetlb: document the demote sysfs interfaces · Mike Kravetz <hidden> · 2021-08-17
Re: [PATCH 5/8] hugetlb: document the demote sysfs interfaces · Aneesh Kumar K.V <hidden> · 2021-09-21
Re: [PATCH 5/8] hugetlb: document the demote sysfs interfaces · Mike Kravetz <hidden> · 2021-09-21
[PATCH 4/8] hugetlb: add hugetlb demote page support · Mike Kravetz <hidden> · 2021-08-16
[PATCH 7/8] hugetlb: prepare destroy and prep routines for vmemmap optimized pages · Mike Kravetz <hidden> · 2021-08-16
Re: [PATCH RESEND 0/8] hugetlb: add demote/split page functionality · Andrew Morton <akpm@linux-foundation.org> · 2021-08-16
Re: [PATCH RESEND 0/8] hugetlb: add demote/split page functionality · Mike Kravetz <hidden> · 2021-08-17
Re: [PATCH RESEND 0/8] hugetlb: add demote/split page functionality · Andrew Morton <akpm@linux-foundation.org> · 2021-08-17
Re: [PATCH RESEND 0/8] hugetlb: add demote/split page functionality · Mike Kravetz <hidden> · 2021-08-17
Re: [PATCH RESEND 0/8] hugetlb: add demote/split page functionality · Andrew Morton <akpm@linux-foundation.org> · 2021-08-16
Re: [PATCH RESEND 0/8] hugetlb: add demote/split page functionality · Mike Kravetz <hidden> · 2021-08-17
Re: [PATCH RESEND 0/8] hugetlb: add demote/split page functionality · Andrew Morton <akpm@linux-foundation.org> · 2021-08-17
Re: [PATCH RESEND 0/8] hugetlb: add demote/split page functionality · David Hildenbrand <hidden> · 2021-08-17
Re: [PATCH RESEND 0/8] hugetlb: add demote/split page functionality · Mike Kravetz <hidden> · 2021-08-17
Re: [PATCH RESEND 0/8] hugetlb: add demote/split page functionality · David Hildenbrand <hidden> · 2021-08-17
Re: [PATCH RESEND 0/8] hugetlb: add demote/split page functionality · Mike Kravetz <hidden> · 2021-08-24
Re: [PATCH RESEND 0/8] hugetlb: add demote/split page functionality · Hillf Danton <hidden> · 2021-08-26
Re: [PATCH RESEND 0/8] hugetlb: add demote/split page functionality · Vlastimil Babka <hidden> · 2021-08-27
Re: [PATCH RESEND 0/8] hugetlb: add demote/split page functionality · Mike Kravetz <hidden> · 2021-08-27
Re: [PATCH RESEND 0/8] hugetlb: add demote/split page functionality · Vlastimil Babka <hidden> · 2021-08-30
Re: [PATCH RESEND 0/8] hugetlb: add demote/split page functionality · Mike Kravetz <hidden> · 2021-09-02
Re: [PATCH RESEND 0/8] hugetlb: add demote/split page functionality · Vlastimil Babka <hidden> · 2021-09-06
Re: [PATCH RESEND 0/8] hugetlb: add demote/split page functionality · Hillf Danton <hidden> · 2021-09-07
Re: [PATCH RESEND 0/8] hugetlb: add demote/split page functionality · Mike Kravetz <hidden> · 2021-09-08
Re: [PATCH RESEND 0/8] hugetlb: add demote/split page functionality · Hillf Danton <hidden> · 2021-09-09
Re: [PATCH RESEND 0/8] hugetlb: add demote/split page functionality · Michal Hocko <mhocko@suse.com> · 2021-09-09
Re: [PATCH RESEND 0/8] hugetlb: add demote/split page functionality · Vlastimil Babka <hidden> · 2021-09-09
Re: [PATCH RESEND 0/8] hugetlb: add demote/split page functionality · Mike Kravetz <hidden> · 2021-09-09
Re: [PATCH RESEND 0/8] hugetlb: add demote/split page functionality · Michal Hocko <mhocko@suse.com> · 2021-09-10
Re: [PATCH RESEND 0/8] hugetlb: add demote/split page functionality · Mike Kravetz <hidden> · 2021-09-11
Re: [PATCH RESEND 0/8] hugetlb: add demote/split page functionality · Hillf Danton <hidden> · 2021-09-11
Re: [PATCH RESEND 0/8] hugetlb: add demote/split page functionality · Michal Hocko <mhocko@suse.com> · 2021-09-13
Re: [PATCH RESEND 0/8] hugetlb: add demote/split page functionality · Mike Kravetz <hidden> · 2021-09-15
Re: [PATCH RESEND 0/8] hugetlb: add demote/split page functionality · Mike Kravetz <hidden> · 2021-09-17

From: Mike Kravetz <hidden>
Date: 2021-09-09 21:32:16
Also in: lkml

On 9/9/21 6:45 AM, Vlastimil Babka wrote:

On 9/9/21 13:54, Michal Hocko wrote:

quoted

On Wed 08-09-21 14:00:19, Mike Kravetz wrote:

quoted

On 9/7/21 1:50 AM, Hillf Danton wrote:

quoted

On Mon, 6 Sep 2021 16:40:28 +0200 Vlastimil Babka wrote:

And/or clamp reclaim retries for costly orders

	reclaim retries = MAX_RECLAIM_RETRIES - order;

to pull down the chance for stall as low as possible.

Thanks, and sorry for not replying quickly.  I only get back to this as
time allows.

We could clamp the number of compaction and reclaim retries in
__alloc_pages_slowpath as suggested.  However, I noticed that a single
reclaim call could take a bunch of time.  As a result, I instrumented
shrink_node to see what might be happening.  Here is some information
from a long stall.  Note that I only dump stats when jiffies > 100000.

[ 8136.874706] shrink_node: 507654 total jiffies,  3557110 tries
[ 8136.881130]              130596341 reclaimed, 32 nr_to_reclaim
[ 8136.887643]              compaction_suitable results:
[ 8136.893276]     idx COMPACT_SKIPPED, 3557109

Can you get a more detailed break down of where the time is spent. Also
How come the number of reclaimed pages is so excessive comparing to the
reclaim target? There is something fishy going on here.

I would say it's simply should_continue_reclaim() behaving similarly to
should_compact_retry(). We'll get compaction_suitable() returning
COMPACT_SKIPPED because the reclaimed pages have been immediately stolen,
and compaction indicates there's not enough base pages to begin with to form
a high-order pages. Since the stolen pages will appear on inactive lru, it
seems to be worth continuing reclaim to make enough free base pages for
compaction to no longer be skipped, because "inactive_lru_pages >
pages_for_compaction" is true.

So, both should_continue_reclaim() and should_compact_retry() are unable to
recognize that reclaimed pages are being stolen and limit the retries in
that case. The scenario seems to be uncommon, otherwise we'd be getting more
reports of that.

Yes, I believe this is what is happening.

I honestly do not know if my test/recreation scenario is realistic.

I do know that our DB team has had issues with allocating a number of
hugetlb pages (after much uptime) taking forever or a REALLY long time.
These are all 2MB huge page allocations, so going through the normal
page allocation code path.  No idea what else is running on the system
at the time of the allocation stalls.  Unfortunately, this can not be
reproduced at will in their environment.  As a result, I have no data
and just this brief description of the issue.  When I stumbled on an
easy way to recreate, I thought it would be worth investigating/fixing.

It certainly does not seem to be a common scenario.
-- 
Mike Kravetz

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help