Re: [PATCH v3 00/28] kmem limitation for memcg

[PATCH v3 00/28] kmem limitation for memcg · Glauber Costa <hidden> · 2012-05-25
[PATCH v3 04/28] memcg: Make it possible to use the stock for more than one page. · Glauber Costa <hidden> · 2012-05-25
[PATCH v3 05/28] memcg: Reclaim when more than one page needed. · Glauber Costa <hidden> · 2012-05-25
Re: [PATCH v3 05/28] memcg: Reclaim when more than one page needed. · Christoph Lameter <hidden> · 2012-05-29
Re: [PATCH v3 05/28] memcg: Reclaim when more than one page needed. · Christoph Lameter <hidden> · 2012-05-29
Re: [PATCH v3 05/28] memcg: Reclaim when more than one page needed. · Glauber Costa <hidden> · 2012-05-29
[PATCH v3 07/28] memcg: change defines to an enum · Glauber Costa <hidden> · 2012-05-25
[PATCH v3 02/28] memcg: Always free struct memcg through schedule_work() · Glauber Costa <hidden> · 2012-05-25
[PATCH v3 09/28] kmem slab accounting basic infrastructure · Glauber Costa <hidden> · 2012-05-25
[PATCH v3 08/28] res_counter: don't force return value checking in res_counter_charge_nofail · Glauber Costa <hidden> · 2012-05-25
[PATCH v3 01/28] slab: move FULL state transition to an initcall · Glauber Costa <hidden> · 2012-05-25
[PATCH v3 03/28] slab: rename gfpflags to allocflags · Glauber Costa <hidden> · 2012-05-25
[PATCH v3 24/28] memcg: Per-memcg memory.kmem.slabinfo file. · Glauber Costa <hidden> · 2012-05-25
[PATCH v3 06/28] slab: use obj_size field of struct kmem_cache when not debugging · Glauber Costa <hidden> · 2012-05-25
[PATCH v3 12/28] slab: pass memcg parameter to kmem_cache_create · Glauber Costa <hidden> · 2012-05-25
Re: [PATCH v3 12/28] slab: pass memcg parameter to kmem_cache_create · Christoph Lameter <hidden> · 2012-05-29
Re: [PATCH v3 12/28] slab: pass memcg parameter to kmem_cache_create · Glauber Costa <hidden> · 2012-05-29
Re: [PATCH v3 12/28] slab: pass memcg parameter to kmem_cache_create · Christoph Lameter <hidden> · 2012-05-29
Re: [PATCH v3 12/28] slab: pass memcg parameter to kmem_cache_create · Glauber Costa <hidden> · 2012-05-29
Re: [PATCH v3 12/28] slab: pass memcg parameter to kmem_cache_create · Christoph Lameter <hidden> · 2012-05-29
Re: [PATCH v3 12/28] slab: pass memcg parameter to kmem_cache_create · Glauber Costa <hidden> · 2012-05-29
Re: [PATCH v3 12/28] slab: pass memcg parameter to kmem_cache_create · Frederic Weisbecker <hidden> · 2012-05-30
[PATCH v3 13/28] slub: create duplicate cache · Glauber Costa <hidden> · 2012-05-25
Re: [PATCH v3 13/28] slub: create duplicate cache · Christoph Lameter <hidden> · 2012-05-29
Re: [PATCH v3 13/28] slub: create duplicate cache · Glauber Costa <hidden> · 2012-05-29
Re: [PATCH v3 13/28] slub: create duplicate cache · Christoph Lameter <hidden> · 2012-05-29
Re: [PATCH v3 13/28] slub: create duplicate cache · Glauber Costa <hidden> · 2012-05-29
Re: [PATCH v3 13/28] slub: create duplicate cache · Christoph Lameter <hidden> · 2012-05-29
Re: [PATCH v3 13/28] slub: create duplicate cache · Glauber Costa <hidden> · 2012-05-29
Re: [PATCH v3 13/28] slub: create duplicate cache · Christoph Lameter <hidden> · 2012-05-29
Re: [PATCH v3 13/28] slub: create duplicate cache · Glauber Costa <hidden> · 2012-05-29
Re: [PATCH v3 13/28] slub: create duplicate cache · Christoph Lameter <hidden> · 2012-05-29
Re: [PATCH v3 13/28] slub: create duplicate cache · Glauber Costa <hidden> · 2012-05-29
Re: [PATCH v3 13/28] slub: create duplicate cache · Christoph Lameter <hidden> · 2012-05-29
Re: [PATCH v3 13/28] slub: create duplicate cache · Glauber Costa <hidden> · 2012-05-29
Re: [PATCH v3 13/28] slub: create duplicate cache · Tejun Heo <tj@kernel.org> · 2012-05-30
Re: [Devel] Re: [PATCH v3 13/28] slub: create duplicate cache · James Bottomley <hidden> · 2012-05-30
Re: [PATCH v3 13/28] slub: create duplicate cache · Glauber Costa <hidden> · 2012-05-30
Re: [PATCH v3 13/28] slub: create duplicate cache · Tejun Heo <tj@kernel.org> · 2012-05-30
Re: [PATCH v3 13/28] slub: create duplicate cache · Christoph Lameter <hidden> · 2012-05-30
Re: [PATCH v3 13/28] slub: create duplicate cache · Suleiman Souhlal <hidden> · 2012-05-29
[PATCH v3 15/28] slub: always get the cache from its page in kfree · Glauber Costa <hidden> · 2012-05-25
Re: [PATCH v3 15/28] slub: always get the cache from its page in kfree · Christoph Lameter <hidden> · 2012-05-29
Re: [PATCH v3 15/28] slub: always get the cache from its page in kfree · Glauber Costa <hidden> · 2012-05-29
[PATCH v3 11/28] slub: consider a memcg parameter in kmem_create_cache · Glauber Costa <hidden> · 2012-05-25
[PATCH v3 17/28] skip memcg kmem allocations in specified code regions · Glauber Costa <hidden> · 2012-05-25
[PATCH v3 20/28] memcg: disable kmem code when not in use. · Glauber Costa <hidden> · 2012-05-25
[PATCH v3 21/28] memcg: destroy memcg caches · Glauber Costa <hidden> · 2012-05-25
[PATCH v3 22/28] memcg/slub: shrink dead caches · Glauber Costa <hidden> · 2012-05-25
[PATCH v3 25/28] slub: create slabinfo file for memcg · Glauber Costa <hidden> · 2012-05-25
[PATCH v3 26/28] slub: track all children of a kmem cache · Glauber Costa <hidden> · 2012-05-25
[PATCH v3 27/28] memcg: propagate kmem limiting information to children · Glauber Costa <hidden> · 2012-05-25
[PATCH v3 28/28] Documentation: add documentation for slab tracker for memcg · Glauber Costa <hidden> · 2012-05-25
[PATCH v3 23/28] slab: Track all the memcg children of a kmem_cache. · Glauber Costa <hidden> · 2012-05-25
[PATCH v3 19/28] slab: per-memcg accounting of slab caches · Glauber Costa <hidden> · 2012-05-25
Re: [PATCH v3 19/28] slab: per-memcg accounting of slab caches · Christoph Lameter <hidden> · 2012-05-29
Re: [PATCH v3 19/28] slab: per-memcg accounting of slab caches · Glauber Costa <hidden> · 2012-05-29
Re: [PATCH v3 19/28] slab: per-memcg accounting of slab caches · Glauber Costa <hidden> · 2012-05-29
[PATCH v3 16/28] memcg: kmem controller charge/uncharge infrastructure · Glauber Costa <hidden> · 2012-05-25
Re: [PATCH v3 16/28] memcg: kmem controller charge/uncharge infrastructure · Christoph Lameter <hidden> · 2012-05-29
Re: [PATCH v3 16/28] memcg: kmem controller charge/uncharge infrastructure · Glauber Costa <hidden> · 2012-05-29
Re: [PATCH v3 16/28] memcg: kmem controller charge/uncharge infrastructure · Frederic Weisbecker <hidden> · 2012-05-30
Re: [PATCH v3 16/28] memcg: kmem controller charge/uncharge infrastructure · Glauber Costa <hidden> · 2012-05-30
Re: [PATCH v3 16/28] memcg: kmem controller charge/uncharge infrastructure · Frederic Weisbecker <hidden> · 2012-05-30
Re: [PATCH v3 16/28] memcg: kmem controller charge/uncharge infrastructure · Glauber Costa <hidden> · 2012-05-30
Re: [PATCH v3 16/28] memcg: kmem controller charge/uncharge infrastructure · Frederic Weisbecker <hidden> · 2012-05-30
Re: [PATCH v3 16/28] memcg: kmem controller charge/uncharge infrastructure · Glauber Costa <hidden> · 2012-05-30
Re: [PATCH v3 16/28] memcg: kmem controller charge/uncharge infrastructure · Frederic Weisbecker <hidden> · 2012-05-30
Re: [PATCH v3 16/28] memcg: kmem controller charge/uncharge infrastructure · Glauber Costa <hidden> · 2012-05-30
Re: [PATCH v3 16/28] memcg: kmem controller charge/uncharge infrastructure · Frederic Weisbecker <hidden> · 2012-05-30
Re: [PATCH v3 16/28] memcg: kmem controller charge/uncharge infrastructure · Glauber Costa <hidden> · 2012-05-30
Re: [PATCH v3 16/28] memcg: kmem controller charge/uncharge infrastructure · Frederic Weisbecker <hidden> · 2012-05-30
Re: [PATCH v3 16/28] memcg: kmem controller charge/uncharge infrastructure · Glauber Costa <hidden> · 2012-05-30
Re: [PATCH v3 16/28] memcg: kmem controller charge/uncharge infrastructure · Frederic Weisbecker <hidden> · 2012-05-30
Re: [PATCH v3 16/28] memcg: kmem controller charge/uncharge infrastructure · Glauber Costa <hidden> · 2012-05-30
[PATCH v3 18/28] slub: charge allocation to a memcg · Glauber Costa <hidden> · 2012-05-25
Re: [PATCH v3 18/28] slub: charge allocation to a memcg · Christoph Lameter <hidden> · 2012-05-29
Re: [PATCH v3 18/28] slub: charge allocation to a memcg · Glauber Costa <hidden> · 2012-05-29
[PATCH v3 14/28] slab: create duplicate cache · Glauber Costa <hidden> · 2012-05-25
[PATCH v3 10/28] slab/slub: struct memcg_params · Glauber Costa <hidden> · 2012-05-25
Re: [PATCH v3 00/28] kmem limitation for memcg · Michal Hocko <hidden> · 2012-05-25
Re: [PATCH v3 00/28] kmem limitation for memcg · Christoph Lameter <hidden> · 2012-05-25
Re: [PATCH v3 00/28] kmem limitation for memcg · Glauber Costa <hidden> · 2012-05-28
Re: [PATCH v3 00/28] kmem limitation for memcg · Christoph Lameter <hidden> · 2012-05-29
Re: [PATCH v3 00/28] kmem limitation for memcg · Glauber Costa <hidden> · 2012-05-29
Re: [PATCH v3 00/28] kmem limitation for memcg · Christoph Lameter <hidden> · 2012-05-29
Re: [PATCH v3 00/28] kmem limitation for memcg · Frederic Weisbecker <hidden> · 2012-06-07
Re: [PATCH v3 00/28] kmem limitation for memcg · Glauber Costa <hidden> · 2012-06-07
Re: [PATCH v3 00/28] kmem limitation for memcg · Frederic Weisbecker <hidden> · 2012-06-07
Re: [PATCH v3 00/28] kmem limitation for memcg · Kamezawa Hiroyuki <hidden> · 2012-06-14

From: Glauber Costa <hidden>
Date: 2012-05-29 15:47:15
Also in: linux-mm, lkml

On 05/29/2012 07:07 PM, Christoph Lameter wrote:

On Mon, 28 May 2012, Glauber Costa wrote:

quoted

It would be best to merge these with my patchset to extract common code
from the allocators. The modifications of individual slab allocators would
then be not necessary anymore and it would save us a lot of work.

Some of them would not, some of them would still be. But also please note that
the patches here that deal with differences between allocators are usually the
low hanging fruits compared to the rest.

I agree that long term it not only better, but inevitable, if we are going to
merge both.

But right now, I think we should agree with the implementation itself - so if
you have any comments on how I am handling these, I'd be happy to hear. Then
we can probably set up a tree that does both, or get your patches merged and
I'll rebase, etc.

Just looked over the patchset and its quite intrusive.

Thank you very much, Christoph, appreciate it.

I have never been
fond of cgroups (IMHO hardware needs to be partitioned at physical
boundaries) so I have not too much insight into what is going on in that
area.

There is certainly a big market for that, and certainly a big market for 
what we're doing as well. So there are users interested in Containers 
technology, and I don't really see it as "partitioning it here" vs 
"partitioning there". It's just different.

Moreover, not everyone doing cgroups are doing containers. Some people 
are isolating a service, or a paticular job.

I agree it is an intrusive change, but it used to be even more. I did my 
best to diminish its large spread.

The idea to just duplicate the caches leads to some weird stuff like the
refcounting and the recovery of the arguments used during slab creation.

The refcounting is only needed so we are sure the parent cache won't go 
away without the child caches going away. I can try to find a better way 
to do that, specifically.

I think it may be simplest to only account for the pages used by a slab in
a memcg. That code could be added to the functions in the slab allocators
that interface with the page allocators. Those are not that performance
critical and would do not much harm.

No, I don't think so. Well, accounting the page is easy, but when we do 
a new allocation, we need to match a process to its correspondent page. 
This will likely lead to flushing the internal cpu caches of the slub, 
for instance, hurting performance. That is because once we allocate a 
page, all objects on that page need to belong to the same cgroup.

Also, you talk about intrusiveness, accounting pages is a lot more 
intrusive, since then you need to know a lot about the internal 
structure of each cache. Having the cache replicated has exactly the 
effect of isolating it better.

I of course agree this is no walk in the park, but accounting something 
that is internal to the cache, and that each cache will use and organize 
in its own private way, doesn't make it any better.

If you need per object accounting then the cleanest solution would be to
duplicate the per node arrays per memcg (or only the statistics) and have
the kmem_cache structure only once in memory.

No, it's all per-page. Nothing here is per-object, maybe you 
misunderstood something?

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help