Re: [PATCH v3 15/16] memcg/sl[au]b: shrink dead caches

[PATCH v3 00/16] slab accounting for memcg · Glauber Costa <hidden> · 2012-09-18
[PATCH v3 02/16] slub: use free_page instead of put_page for freeing kmalloc allocation · Glauber Costa <hidden> · 2012-09-18
[PATCH v3 01/16] slab/slub: struct memcg_params · Glauber Costa <hidden> · 2012-09-18
[PATCH v3 14/16] slub: slub-specific propagation changes. · Glauber Costa <hidden> · 2012-09-18
[PATCH v3 03/16] slab: Ignore the cflgs bit in cache creation · Glauber Costa <hidden> · 2012-09-18
Re: [PATCH v3 03/16] slab: Ignore the cflgs bit in cache creation · Christoph Lameter <hidden> · 2012-09-18
Re: [PATCH v3 03/16] slab: Ignore the cflgs bit in cache creation · Tejun Heo <tj@kernel.org> · 2012-09-21
[PATCH v3 04/16] provide a common place for initcall processing in kmem_cache · Glauber Costa <hidden> · 2012-09-18
Re: [PATCH v3 04/16] provide a common place for initcall processing in kmem_cache · Christoph Lameter <hidden> · 2012-09-18
[PATCH v3 07/16] memcg: skip memcg kmem allocations in specified code regions · Glauber Costa <hidden> · 2012-09-18
Re: [PATCH v3 07/16] memcg: skip memcg kmem allocations in specified code regions · Tejun Heo <tj@kernel.org> · 2012-09-21
Re: [PATCH v3 07/16] memcg: skip memcg kmem allocations in specified code regions · Glauber Costa <hidden> · 2012-09-24
Re: [PATCH v3 07/16] memcg: skip memcg kmem allocations in specified code regions · Tejun Heo <tj@kernel.org> · 2012-09-24
[PATCH v3 05/16] consider a memcg parameter in kmem_create_cache · Glauber Costa <hidden> · 2012-09-18
Re: [PATCH v3 05/16] consider a memcg parameter in kmem_create_cache · Tejun Heo <tj@kernel.org> · 2012-09-21
Re: [PATCH v3 05/16] consider a memcg parameter in kmem_create_cache · Glauber Costa <hidden> · 2012-09-24
Re: [PATCH v3 05/16] consider a memcg parameter in kmem_create_cache · Christoph <hidden> · 2012-09-24
Re: [PATCH v3 05/16] consider a memcg parameter in kmem_create_cache · Glauber Costa <hidden> · 2012-09-24
Re: [PATCH v3 05/16] consider a memcg parameter in kmem_create_cache · Christoph Lameter <hidden> · 2012-09-24
Re: [PATCH v3 05/16] consider a memcg parameter in kmem_create_cache · Glauber Costa <hidden> · 2012-09-24
Re: [PATCH v3 05/16] consider a memcg parameter in kmem_create_cache · Christoph Lameter <hidden> · 2012-09-24
Re: [PATCH v3 05/16] consider a memcg parameter in kmem_create_cache · Glauber Costa <hidden> · 2012-09-24
Re: [PATCH v3 05/16] consider a memcg parameter in kmem_create_cache · Pekka Enberg <penberg@kernel.org> · 2012-09-24
Re: [PATCH v3 05/16] consider a memcg parameter in kmem_create_cache · Glauber Costa <hidden> · 2012-09-24
Re: [PATCH v3 05/16] consider a memcg parameter in kmem_create_cache · Christoph Lameter <hidden> · 2012-09-24
Re: [PATCH v3 05/16] consider a memcg parameter in kmem_create_cache · Christoph Lameter <hidden> · 2012-09-24
Re: [PATCH v3 05/16] consider a memcg parameter in kmem_create_cache · Michal Hocko <hidden> · 2012-10-02
[PATCH v3 11/16] memcg: destroy memcg caches · Glauber Costa <hidden> · 2012-09-18
Re: [PATCH v3 11/16] memcg: destroy memcg caches · Tejun Heo <tj@kernel.org> · 2012-09-21
[PATCH v3 10/16] sl[au]b: Allocate objects from memcg cache · Glauber Costa <hidden> · 2012-09-18
[PATCH v3 13/16] slab: slab-specific propagation changes. · Glauber Costa <hidden> · 2012-09-18
Re: [PATCH v3 13/16] slab: slab-specific propagation changes. · Christoph Lameter <hidden> · 2012-09-18
Re: [PATCH v3 13/16] slab: slab-specific propagation changes. · Glauber Costa <hidden> · 2012-09-19
[PATCH v3 15/16] memcg/sl[au]b: shrink dead caches · Glauber Costa <hidden> · 2012-09-18
Re: [PATCH v3 15/16] memcg/sl[au]b: shrink dead caches · Christoph Lameter <hidden> · 2012-09-18
Re: [PATCH v3 15/16] memcg/sl[au]b: shrink dead caches · Glauber Costa <hidden> · 2012-09-19
Re: [PATCH v3 15/16] memcg/sl[au]b: shrink dead caches · JoonSoo Kim <hidden> · 2012-09-21
Re: [PATCH v3 15/16] memcg/sl[au]b: shrink dead caches · Glauber Costa <hidden> · 2012-09-21
Re: [PATCH v3 15/16] memcg/sl[au]b: shrink dead caches · JoonSoo Kim <hidden> · 2012-09-21
Re: [PATCH v3 15/16] memcg/sl[au]b: shrink dead caches · Glauber Costa <hidden> · 2012-09-21
Re: [PATCH v3 15/16] memcg/sl[au]b: shrink dead caches · Tejun Heo <tj@kernel.org> · 2012-09-21
Re: [PATCH v3 15/16] memcg/sl[au]b: shrink dead caches · Glauber Costa <hidden> · 2012-09-24
Re: [PATCH v3 15/16] memcg/sl[au]b: shrink dead caches · Tejun Heo <tj@kernel.org> · 2012-09-24
[PATCH v3 06/16] memcg: infrastructure to match an allocation to the right cache · Glauber Costa <hidden> · 2012-09-18
Re: [PATCH v3 06/16] memcg: infrastructure to match an allocation to the right cache · Tejun Heo <tj@kernel.org> · 2012-09-21
Re: [PATCH v3 06/16] memcg: infrastructure to match an allocation to the right cache · Glauber Costa <hidden> · 2012-09-24
Re: [PATCH v3 06/16] memcg: infrastructure to match an allocation to the right cache · Tejun Heo <tj@kernel.org> · 2012-09-24
Re: [PATCH v3 06/16] memcg: infrastructure to match an allocation to the right cache · Glauber Costa <hidden> · 2012-09-25
Re: [PATCH v3 06/16] memcg: infrastructure to match an allocation to the right cache · Christoph Lameter <hidden> · 2012-09-25
Re: [PATCH v3 06/16] memcg: infrastructure to match an allocation to the right cache · Tejun Heo <tj@kernel.org> · 2012-09-21
Re: [PATCH v3 06/16] memcg: infrastructure to match an allocation to the right cache · Glauber Costa <hidden> · 2012-09-24
Re: [PATCH v3 06/16] memcg: infrastructure to match an allocation to the right cache · Tejun Heo <tj@kernel.org> · 2012-09-24
[PATCH v3 12/16] memcg/sl[au]b Track all the memcg children of a kmem_cache. · Glauber Costa <hidden> · 2012-09-18
Re: [PATCH v3 12/16] memcg/sl[au]b Track all the memcg children of a kmem_cache. · Tejun Heo <tj@kernel.org> · 2012-09-21
[PATCH v3 08/16] slab: allow enable_cpu_cache to use preset values for its tunables · Glauber Costa <hidden> · 2012-09-18
Re: [PATCH v3 08/16] slab: allow enable_cpu_cache to use preset values for its tunables · Christoph Lameter <hidden> · 2012-09-18
Re: [PATCH v3 08/16] slab: allow enable_cpu_cache to use preset values for its tunables · Glauber Costa <hidden> · 2012-09-19
Re: [PATCH v3 08/16] slab: allow enable_cpu_cache to use preset values for its tunables · Pekka Enberg <penberg@kernel.org> · 2012-09-21
[PATCH v3 16/16] Add documentation about the kmem controller · Glauber Costa <hidden> · 2012-09-18
[PATCH v3 09/16] sl[au]b: always get the cache from its page in kfree · Glauber Costa <hidden> · 2012-09-18
Re: [PATCH v3 09/16] sl[au]b: always get the cache from its page in kfree · Christoph Lameter <hidden> · 2012-09-18
Re: [PATCH v3 09/16] sl[au]b: always get the cache from its page in kfree · Glauber Costa <hidden> · 2012-09-19
Re: [PATCH v3 09/16] sl[au]b: always get the cache from its page in kfree · Christoph Lameter <hidden> · 2012-09-19
Re: [PATCH v3 09/16] sl[au]b: always get the cache from its page in kfree · Pekka Enberg <penberg@kernel.org> · 2012-09-21
Re: [PATCH v3 09/16] sl[au]b: always get the cache from its page in kfree · Glauber Costa <hidden> · 2012-09-21
Re: [PATCH v3 09/16] sl[au]b: always get the cache from its page in kfree · Pekka Enberg <penberg@kernel.org> · 2012-09-21
Re: [PATCH v3 09/16] sl[au]b: always get the cache from its page in kfree · Tejun Heo <tj@kernel.org> · 2012-09-21
Re: [PATCH v3 09/16] sl[au]b: always get the cache from its page in kfree · Pekka Enberg <penberg@kernel.org> · 2012-09-21
Re: [PATCH v3 09/16] sl[au]b: always get the cache from its page in kfree · Tejun Heo <tj@kernel.org> · 2012-09-21
Re: [PATCH v3 00/16] slab accounting for memcg · Pekka Enberg <penberg@kernel.org> · 2012-09-21
Re: [PATCH v3 00/16] slab accounting for memcg · Glauber Costa <hidden> · 2012-09-21
Re: [PATCH v3 00/16] slab accounting for memcg · Tejun Heo <tj@kernel.org> · 2012-09-21
Re: [PATCH v3 00/16] slab accounting for memcg · Tejun Heo <tj@kernel.org> · 2012-09-21
Re: [PATCH v3 00/16] slab accounting for memcg · Glauber Costa <hidden> · 2012-09-24

From: Glauber Costa <hidden>
Date: 2012-09-21 09:35:25
Also in: cgroups, lkml

On 09/21/2012 01:28 PM, JoonSoo Kim wrote:

Hi, Glauber.

quoted

2012/9/18 Glauber Costa [off-list ref]:

quoted

diff --git a/mm/slub.c b/mm/slub.c
index 0b68d15..9d79216 100644
--- a/mm/slub.c
+++ b/mm/slub.c

@@ -2602,6 +2602,7 @@ redo:
        } else
                __slab_free(s, page, x, addr);

+       kmem_cache_verify_dead(s);
 }

As far as u know, I am not a expert and don't know anything about memcg.
IMHO, this implementation may hurt system performance in some case.

In case of memcg is destoried, remained kmem_cache is marked "dead".
After it is marked,
every free operation to this "dead" kmem_cache call
kmem_cache_verify_dead() and finally call kmem_cache_shrink().

As long as it is restricted to that cache, this is a non issue.
dead caches are exactly what they name imply: dead.

Means that we actively want them to go away, and just don't kill them
right away because they have some inflight objects - which we expect not
to be too much.

Hmm.. I don't think so.
We can destroy memcg whenever we want, is it right?

wrong.

it is impossible to track objects to a task, so when tasks are moved,
objects stay. Which means that some objects are still referenced by the
cache.

If it is right, there is many inflight objects when we destory memcg.
If there is so many inflight objects, performance of these processes
can be hurt too much.

There are in-flight objects. It is not expected to have "many inflight
objects", which is a different statement.

We are assuming all the time that the workloads that goes into a cgroup
are mostly nature. When it goes away, most of the objects that were
referenced are expected to go away. Because we can't guarantee all of
them will, the references stay.

When we are able to call shrink_slab() directly on a memcg slab, which
is WIP, we'll be able make those objects even rarer.

quoted

And, I found one case that destroying memcg's kmem_cache don't works properly.
If we destroy memcg after all object is freed, current implementation
doesn't destroy kmem_cache.
kmem_cache_destroy_work_func() check "cachep->memcg_params.nr_pages == 0",
but in this case, it return false, because kmem_cache may have
cpu_slab, and cpu_partials_slabs.
As we already free all objects, kmem_cache_verify_dead() is not invoked forever.
I think that we need another kmem_cache_shrink() in
kmem_cache_destroy_work_func().

I'll take a look here. What you describe makes sense, and can
potentially happen. I tried to handle this case with care in
destroy_all_caches, but I may have always made a mistake...

Did you see this actively happening, or are you just assuming this can
happen from your read of the code?

Just read of the code.

I will go through it again, just in case. This is indeed subtle and it
is in my best interest to sort out those issues early.


--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help