Thread (59 messages) 59 messages, 5 authors, 2012-07-25

Re: [PATCH v4 00/25] kmem limitation for memcg

From: Glauber Costa <hidden>
Date: 2012-06-18 12:17:09
Also in: linux-mm, lkml

On 06/18/2012 04:10 PM, Kamezawa Hiroyuki wrote:
(2012/06/18 19:27), Glauber Costa wrote:
quoted
Hello All,

This is my new take for the memcg kmem accounting. This should merge
all of the previous comments from you guys, specially concerning the big churn
inside the allocators themselves.

My focus in this new round was to keep the changes in the cache internals to
a minimum. To do that, I relied upon two main pillars:

   * Cristoph's unification series, that allowed me to put must of the changes
     in a common file. Even then, the changes are not too many, since the overal
     level of invasiveness was decreased.
   * Accounting is done directly from the page allocator. This means some pages
     can fail to be accounted, but that can only happen when the task calling
     kmem_cache_alloc or kmalloc is not the same task allocating a new page.
     This never happens in steady state operation if the tasks are kept in the
     same memcg. Naturally, if the page ends up being accounted to a memcg that
     is not limited (such as root memcg), that particular page will simply not
     be accounted.

The dispatcher code stays (mem_cgroup_get_kmem_cache), being the mechanism who
guarantees that, during steady state operation, all objects allocated in a page
will belong to the same memcg. I consider this a good compromise point between
strict and loose accounting here.
2 questions.

   - Do you have performance numbers ?
Not extensive. I've run some microbenchmarks trying to determine the
effect of my code on kmem_cache_alloc, and found it to be in the order
of 2 to 3 %. I would expect that to vanish in a workload benchmark.
   - Do you think user-memory memcg should be switched to page-allocator level accounting ?
     (it will require some study for modifying current bached-freeing and per-cpu-stock
      logics...)
I don't see a reason for that. My main goal by doing that was to reduce
the churn in the cache internal structures, but specially because there
is at least two of them, obeying a stable interface. The way I
understand it, memcg for user pages is already pretty well integrated to
the page allocator, so the benefit of it is questionable.
Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help