Re: [PATCH 14/14] mm: memcontrol: hook up vmpressure to socket pressure

[PATCH 00/14] mm: memcontrol: account socket memory in unified hierarchy · Johannes Weiner <hannes@cmpxchg.org> · 2015-11-12
[PATCH 01/14] mm: memcontrol: export root_mem_cgroup · Johannes Weiner <hannes@cmpxchg.org> · 2015-11-12
Re: [PATCH 01/14] mm: memcontrol: export root_mem_cgroup · David Miller <davem@davemloft.net> · 2015-11-13
Re: [PATCH 01/14] mm: memcontrol: export root_mem_cgroup · Vladimir Davydov <hidden> · 2015-11-14
[PATCH 03/14] net: tcp_memcontrol: properly detect ancestor socket pressure · Johannes Weiner <hannes@cmpxchg.org> · 2015-11-12
Re: [PATCH 03/14] net: tcp_memcontrol: properly detect ancestor socket pressure · David Miller <davem@davemloft.net> · 2015-11-13
Re: [PATCH 03/14] net: tcp_memcontrol: properly detect ancestor socket pressure · Vladimir Davydov <hidden> · 2015-11-14
Re: [PATCH 03/14] net: tcp_memcontrol: properly detect ancestor socket pressure · Johannes Weiner <hannes@cmpxchg.org> · 2015-11-14
[PATCH 05/14] net: tcp_memcontrol: protect all tcp_memcontrol calls by jump-label · Johannes Weiner <hannes@cmpxchg.org> · 2015-11-12
Re: [PATCH 05/14] net: tcp_memcontrol: protect all tcp_memcontrol calls by jump-label · David Miller <davem@davemloft.net> · 2015-11-13
Re: [PATCH 05/14] net: tcp_memcontrol: protect all tcp_memcontrol calls by jump-label · Vladimir Davydov <hidden> · 2015-11-14
Re: [PATCH 05/14] net: tcp_memcontrol: protect all tcp_memcontrol calls by jump-label · Johannes Weiner <hannes@cmpxchg.org> · 2015-11-16
[PATCH 07/14] net: tcp_memcontrol: simplify the per-memcg limit access · Johannes Weiner <hannes@cmpxchg.org> · 2015-11-12
Re: [PATCH 07/14] net: tcp_memcontrol: simplify the per-memcg limit access · Vladimir Davydov <hidden> · 2015-11-20
[PATCH 10/14] mm: memcontrol: generalize the socket accounting jump label · Johannes Weiner <hannes@cmpxchg.org> · 2015-11-12
Re: [PATCH 10/14] mm: memcontrol: generalize the socket accounting jump label · Michal Hocko <mhocko@kernel.org> · 2015-11-13
Re: [PATCH 10/14] mm: memcontrol: generalize the socket accounting jump label · Vladimir Davydov <hidden> · 2015-11-14
[PATCH 09/14] net: tcp_memcontrol: simplify linkage between socket and page counter · Johannes Weiner <hannes@cmpxchg.org> · 2015-11-12
Re: [PATCH 09/14] net: tcp_memcontrol: simplify linkage between socket and page counter · Vladimir Davydov <hidden> · 2015-11-20
Re: [PATCH 09/14] net: tcp_memcontrol: simplify linkage between socket and page counter · Johannes Weiner <hannes@cmpxchg.org> · 2015-11-20
Re: [PATCH 09/14] net: tcp_memcontrol: simplify linkage between socket and page counter · Vladimir Davydov <hidden> · 2015-11-23
Re: [PATCH 09/14] net: tcp_memcontrol: simplify linkage between socket and page counter · Johannes Weiner <hannes@cmpxchg.org> · 2015-11-23
Re: [PATCH 09/14] net: tcp_memcontrol: simplify linkage between socket and page counter · Vladimir Davydov <hidden> · 2015-11-24
[PATCH 13/14] mm: memcontrol: account socket memory in unified hierarchy memory controller · Johannes Weiner <hannes@cmpxchg.org> · 2015-11-12
Re: [PATCH 13/14] mm: memcontrol: account socket memory in unified hierarchy memory controller · Michal Hocko <mhocko@kernel.org> · 2015-11-16
Re: [PATCH 13/14] mm: memcontrol: account socket memory in unified hierarchy memory controller · Johannes Weiner <hannes@cmpxchg.org> · 2015-11-16
Re: [PATCH 13/14] mm: memcontrol: account socket memory in unified hierarchy memory controller · Michal Hocko <mhocko@kernel.org> · 2015-11-18
Re: [PATCH 13/14] mm: memcontrol: account socket memory in unified hierarchy memory controller · Johannes Weiner <hannes@cmpxchg.org> · 2015-11-18
Re: [PATCH 13/14] mm: memcontrol: account socket memory in unified hierarchy memory controller · Michal Hocko <mhocko@kernel.org> · 2015-11-19
Re: [PATCH 13/14] mm: memcontrol: account socket memory in unified hierarchy memory controller · Johannes Weiner <hannes@cmpxchg.org> · 2015-11-19
Re: [PATCH 13/14] mm: memcontrol: account socket memory in unified hierarchy memory controller · Vladimir Davydov <hidden> · 2015-11-20
Re: [PATCH 13/14] mm: memcontrol: account socket memory in unified hierarchy memory controller · Johannes Weiner <hannes@cmpxchg.org> · 2015-11-20
Re: [PATCH 13/14] mm: memcontrol: account socket memory in unified hierarchy memory controller · Vladimir Davydov <hidden> · 2015-11-23
Re: [PATCH 13/14] mm: memcontrol: account socket memory in unified hierarchy memory controller · Johannes Weiner <hannes@cmpxchg.org> · 2015-11-23
[PATCH 14/14] mm: memcontrol: hook up vmpressure to socket pressure · Johannes Weiner <hannes@cmpxchg.org> · 2015-11-12
Re: [PATCH 14/14] mm: memcontrol: hook up vmpressure to socket pressure · Vladimir Davydov <hidden> · 2015-11-15
Re: [PATCH 14/14] mm: memcontrol: hook up vmpressure to socket pressure · Johannes Weiner <hannes@cmpxchg.org> · 2015-11-16
Re: [PATCH 14/14] mm: memcontrol: hook up vmpressure to socket pressure · Vladimir Davydov <hidden> · 2015-11-17
Re: [PATCH 14/14] mm: memcontrol: hook up vmpressure to socket pressure · Johannes Weiner <hannes@cmpxchg.org> · 2015-11-17
Re: [PATCH 14/14] mm: memcontrol: hook up vmpressure to socket pressure · Vladimir Davydov <hidden> · 2015-11-18
Re: [PATCH 14/14] mm: memcontrol: hook up vmpressure to socket pressure · Johannes Weiner <hannes@cmpxchg.org> · 2015-11-18
[PATCH 12/14] mm: memcontrol: move socket code for unified hierarchy accounting · Johannes Weiner <hannes@cmpxchg.org> · 2015-11-12
Re: [PATCH 12/14] mm: memcontrol: move socket code for unified hierarchy accounting · Vladimir Davydov <hidden> · 2015-11-20
[PATCH 11/14] mm: memcontrol: do not account memory+swap on unified hierarchy · Johannes Weiner <hannes@cmpxchg.org> · 2015-11-12
Re: [PATCH 11/14] mm: memcontrol: do not account memory+swap on unified hierarchy · Michal Hocko <mhocko@kernel.org> · 2015-11-13
Re: [PATCH 11/14] mm: memcontrol: do not account memory+swap on unified hierarchy · Vladimir Davydov <hidden> · 2015-11-14
[PATCH 08/14] net: tcp_memcontrol: sanitize tcp memory accounting callbacks · Johannes Weiner <hannes@cmpxchg.org> · 2015-11-12
Re: [PATCH 08/14] net: tcp_memcontrol: sanitize tcp memory accounting callbacks · Eric Dumazet <hidden> · 2015-11-13
Re: [PATCH 08/14] net: tcp_memcontrol: sanitize tcp memory accounting callbacks · Johannes Weiner <hannes@cmpxchg.org> · 2015-11-13
Re: [PATCH 08/14] net: tcp_memcontrol: sanitize tcp memory accounting callbacks · Vladimir Davydov <hidden> · 2015-11-20
Re: [PATCH 08/14] net: tcp_memcontrol: sanitize tcp memory accounting callbacks · Johannes Weiner <hannes@cmpxchg.org> · 2015-11-20
[PATCH 06/14] net: tcp_memcontrol: remove dead per-memcg count of allocated sockets · Johannes Weiner <hannes@cmpxchg.org> · 2015-11-12
Re: [PATCH 06/14] net: tcp_memcontrol: remove dead per-memcg count of allocated sockets · David Miller <davem@davemloft.net> · 2015-11-13
Re: [PATCH 06/14] net: tcp_memcontrol: remove dead per-memcg count of allocated sockets · Vladimir Davydov <hidden> · 2015-11-20
[PATCH 04/14] net: tcp_memcontrol: remove bogus hierarchy pressure propagation · Johannes Weiner <hannes@cmpxchg.org> · 2015-11-12
Re: [PATCH 04/14] net: tcp_memcontrol: remove bogus hierarchy pressure propagation · David Miller <davem@davemloft.net> · 2015-11-13
Re: [PATCH 04/14] net: tcp_memcontrol: remove bogus hierarchy pressure propagation · Vladimir Davydov <hidden> · 2015-11-20
[PATCH 02/14] mm: vmscan: simplify memcg vs. global shrinker invocation · Johannes Weiner <hannes@cmpxchg.org> · 2015-11-12
Re: [PATCH 02/14] mm: vmscan: simplify memcg vs. global shrinker invocation · David Miller <davem@davemloft.net> · 2015-11-13
Re: [PATCH 02/14] mm: vmscan: simplify memcg vs. global shrinker invocation · Vladimir Davydov <hidden> · 2015-11-14
Re: [PATCH 02/14] mm: vmscan: simplify memcg vs. global shrinker invocation · Johannes Weiner <hannes@cmpxchg.org> · 2015-11-14

From: Vladimir Davydov <hidden>
Date: 2015-11-17 20:19:12
Also in: cgroups, linux-mm, lkml

On Mon, Nov 16, 2015 at 01:53:16PM -0500, Johannes Weiner wrote:

On Sun, Nov 15, 2015 at 04:54:57PM +0300, Vladimir Davydov wrote:

quoted

On Thu, Nov 12, 2015 at 06:41:33PM -0500, Johannes Weiner wrote:

quoted

Let the networking stack know when a memcg is under reclaim pressure
so that it can clamp its transmit windows accordingly.

Whenever the reclaim efficiency of a cgroup's LRU lists drops low
enough for a MEDIUM or HIGH vmpressure event to occur, assert a
pressure state in the socket and tcp memory code that tells it to curb
consumption growth from sockets associated with said control group.

vmpressure events are naturally edge triggered, so for hysteresis
assert socket pressure for a second to allow for subsequent vmpressure
events to occur before letting the socket code return to normal.

AFAICS, in contrast to v1, now you don't modify vmpressure behavior,
which means socket_pressure will only be set when cgroup hits its
high/hard limit. On tightly packed system, this is unlikely IMO -
cgroups will mostly experience pressure due to memory shortage at the
global level and/or their low limit configuration, in which case no
vmpressure events will be triggered and therefore tcp window won't be
clamped accordingly.

Yeah, this is an inherent problem in the vmpressure design and it
makes the feature significantly less useful than it could be IMO.

AFAIK vmpressure was designed to allow userspace to tune hard limits of
cgroups in accordance with their demands, in which case the way how
vmpressure notifications work makes sense.

But you guys were wary about the patch that changed it, and this

Changing vmpressure semantics as you proposed in v1 would result in
userspace getting notifications even if cgroup does not hit its limit.
May be it could be useful to someone (e.g. it could help tuning
memory.low), but I am pretty sure this would also result in breakages
for others.

series has kicked up enough dust already, so I backed it out.

But this will still be useful. Yes, it won't help in rebalancing an
regularly working system, which would be cool, but it'll still help
contain a worklad that is growing beyond expectations, which is the
scenario that kickstarted this work.

I haven't looked through all the previous patches in the series, but
AFAIU they should do the trick, no? Notifying sockets about vmpressure
is rather needed to protect a workload from itself. And with this patch
it will work this way, but only if sum limits < total ram, which is
rather rare in practice. On tightly packed systems it does nothing.

That said, I don't think we should commit this particular patch. Neither
do I think socket accounting should be enabled by default in the unified
hierarchy for now, since the implementation is still incomplete. IMHO.

Thanks,
Vladimir

quoted

May be, we could use a per memcg slab shrinker to detect memory
pressure? This looks like abusing shrinkers API though.

Actually, I thought about doing this long-term.

Shrinkers are a nice way to export VM pressure to auxiliary allocators
and caches. But currently, the only metric we export is LRU scan rate,
whose application is limited to ageable caches: it doesn't make sense
to cause auxiliary workingsets to shrink when the VM is merely picking
up the drop-behind pages of a one-off page cache stream. I think it
would make sense for shrinkers to include reclaim efficiency so that
they can be used by caches that don't have 'accessed' bits and object
rotation, but are able to shrink based on the cost they're imposing.

But a change like this is beyond the scope of this series, IMO.

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help