Thread (22 messages) 22 messages, 4 authors, 2016-01-14

Re: [PATCH v2] mm,oom: Exclude TIF_MEMDIE processes from candidates.

From: Tetsuo Handa <penguin-kernel@I-love.SAKURA.ne.jp>
Date: 2016-01-14 22:21:24
Also in: lkml

David Rientjes wrote:
On Thu, 14 Jan 2016, Tetsuo Handa wrote:
quoted
quoted
quoted
quoted
quoted
@@ -171,7 +195,7 @@ unsigned long oom_badness(struct task_struct *p, struct mem_cgroup *memcg,
 	if (oom_unkillable_task(p, memcg, nodemask))
 		return 0;

-	p = find_lock_task_mm(p);
+	p = find_lock_non_victim_task_mm(p);
 	if (!p)
 		return 0;
I understand how this may make your test case pass, but I simply don't 
understand how this could possibly be the correct thing to do.  This would 
cause oom_badness() to return 0 for any process where a thread has 
TIF_MEMDIE set.  If the oom killer is called from the page allocator, 
kills a thread, and it is recalled before that thread may exit, then this 
will panic the system if there are no other eligible processes to kill.
Why? oom_badness() is called after oom_scan_process_thread() returned OOM_SCAN_OK.
oom_scan_process_thread() returns OOM_SCAN_ABORT if a thread has TIF_MEMDIE set.
oom_scan_process_thread() checks for TIF_MEMDIE on p, not on p's threads.
If one of p's threads has TIF_MEMDIE set and p does not, we actually want 
to set TIF_MEMDIE for p.  That's the current behavior since it will lead 
to p->mm memory freeing.  Your patch is excluding such processes entirely 
and selecting another process to kill unnecessarily.
I think p's threads are checked by oom_scan_process_thread() for TIF_MEMDIE
even if p does not have TIF_MEMDIE. What am I misunderstanding about what
for_each_process_thread(g, p) is doing?

  #define for_each_process_thread(p, t) for_each_process(p) for_each_thread(p, t)

  select_bad_process() {
    for_each_process_thread(g, p) {
      oom_scan_process_thread(oc, p, totalpages));
      oom_badness(p);
    }
  }
Yes, select_bad_process() iterates over threads, that is obvious.  The 
point is that today it can select a thread independent of whether any of 
its other threads have TIF_MEMDIE set, which is the desired behavior per 
the above.  With your change, that is no longer possible because we 
disregard _all_ threads if one of them has TIF_MEMDIE set.
I still cannot understand. Today select_bad_process() can select a thread
independent of whether any of its other threads have TIF_MEMDIE set. But
select_bad_process() after all ignores that thread selected by oom_badness()
logic and aborts the iterate loop as soon as oom_scan_process_thread() finds
a TIF_MEMDIE thread from all threads. Changing oom_badness() logic to skip
processes with TIF_MEMDIE threads does not change the task select_bad_process()
logic will finally return.

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help