[PATCH v7 3/9] seccomp: introduce writer locking

[PATCH v7 0/9] seccomp: add thread sync ability · Kees Cook <hidden> · 2014-06-23
[PATCH v7 1/9] seccomp: create internal mode-setting function · Kees Cook <hidden> · 2014-06-23
[PATCH v7 8/9] ARM: add seccomp syscall · Kees Cook <hidden> · 2014-06-23
[PATCH v7 5/9] seccomp: split mode set routines · Kees Cook <hidden> · 2014-06-23
[PATCH v7 6/9] seccomp: add "seccomp" syscall · Kees Cook <hidden> · 2014-06-23
[PATCH v7 4/9] seccomp: move no_new_privs into seccomp · Kees Cook <hidden> · 2014-06-23
Re: [PATCH v7 4/9] seccomp: move no_new_privs into seccomp · Oleg Nesterov <oleg@redhat.com> · 2014-06-24
Re: [PATCH v7 4/9] seccomp: move no_new_privs into seccomp · Andy Lutomirski <luto@amacapital.net> · 2014-06-24
Re: [PATCH v7 4/9] seccomp: move no_new_privs into seccomp · Oleg Nesterov <oleg@redhat.com> · 2014-06-24
Re: [PATCH v7 4/9] seccomp: move no_new_privs into seccomp · Andy Lutomirski <luto@amacapital.net> · 2014-06-24
Re: [PATCH v7 4/9] seccomp: move no_new_privs into seccomp · Kees Cook <hidden> · 2014-06-24
Re: [PATCH v7 4/9] seccomp: move no_new_privs into seccomp · Andy Lutomirski <luto@amacapital.net> · 2014-06-24
[PATCH v7 7/9] seccomp: implement SECCOMP_FILTER_FLAG_TSYNC · Kees Cook <hidden> · 2014-06-23
Re: [PATCH v7 7/9] seccomp: implement SECCOMP_FILTER_FLAG_TSYNC · Oleg Nesterov <oleg@redhat.com> · 2014-06-24
Re: [PATCH v7 7/9] seccomp: implement SECCOMP_FILTER_FLAG_TSYNC · Kees Cook <hidden> · 2014-06-24
Re: [PATCH v7 7/9] seccomp: implement SECCOMP_FILTER_FLAG_TSYNC · Oleg Nesterov <oleg@redhat.com> · 2014-06-24
Re: [PATCH v7 7/9] seccomp: implement SECCOMP_FILTER_FLAG_TSYNC · Kees Cook <hidden> · 2014-06-24
Re: [PATCH v7 7/9] seccomp: implement SECCOMP_FILTER_FLAG_TSYNC · Oleg Nesterov <oleg@redhat.com> · 2014-06-24
Re: [PATCH v7 7/9] seccomp: implement SECCOMP_FILTER_FLAG_TSYNC · Kees Cook <hidden> · 2014-06-24
[PATCH v7 2/9] seccomp: split filter prep from check and apply · Kees Cook <hidden> · 2014-06-23
Re: [PATCH v7 2/9] seccomp: split filter prep from check and apply · David Drysdale <hidden> · 2014-06-26
Re: [PATCH v7 2/9] seccomp: split filter prep from check and apply · Kees Cook <hidden> · 2014-06-27
[PATCH v7 9/9] MIPS: add seccomp syscall · Kees Cook <hidden> · 2014-06-23
[PATCH v7 3/9] seccomp: introduce writer locking · Kees Cook <hidden> · 2014-06-23
Re: [PATCH v7 3/9] seccomp: introduce writer locking · Oleg Nesterov <oleg@redhat.com> · 2014-06-24
Re: [PATCH v7 3/9] seccomp: introduce writer locking · Kees Cook <hidden> · 2014-06-24
Re: [PATCH v7 3/9] seccomp: introduce writer locking · Oleg Nesterov <oleg@redhat.com> · 2014-06-24
Re: [PATCH v7 3/9] seccomp: introduce writer locking · Kees Cook <hidden> · 2014-06-24
Re: [PATCH v7 3/9] seccomp: introduce writer locking · Oleg Nesterov <oleg@redhat.com> · 2014-06-24
Re: [PATCH v7 3/9] seccomp: introduce writer locking · Kees Cook <hidden> · 2014-06-24
[PATCH v7 1/1] man-pages: seccomp.2: document syscall · Kees Cook <hidden> · 2014-06-23
Re: [PATCH v7 1/1] man-pages: seccomp.2: document syscall · Michael Kerrisk (man-pages) <hidden> · 2014-06-24
Re: [PATCH v7 1/1] man-pages: seccomp.2: document syscall · Kees Cook <hidden> · 2014-06-24
[PATCH v7.1 1/1] man-pages: seccomp.2: document syscall · Kees Cook <hidden> · 2014-06-24
Re: [PATCH v7 1/1] man-pages: seccomp.2: document syscall · Andy Lutomirski <luto@amacapital.net> · 2014-06-24
Re: [PATCH v7 1/1] man-pages: seccomp.2: document syscall · Kees Cook <hidden> · 2014-06-24

From: Kees Cook <hidden>
Date: 2014-06-24 19:46:06
Also in: linux-api, linux-arch, linux-mips, lkml

On Tue, Jun 24, 2014 at 11:30 AM, Oleg Nesterov [off-list ref] wrote:

I am puzzled by the usage of smp_load_acquire(),

It was recommended by Andy Lutomirski in preference to ACCESS_ONCE().

On 06/23, Kees Cook wrote:

quoted

 static u32 seccomp_run_filters(int syscall)
 {
-     struct seccomp_filter *f;
+     struct seccomp_filter *f = smp_load_acquire(&current->seccomp.filter);
      struct seccomp_data sd;
      u32 ret = SECCOMP_RET_ALLOW;

      /* Ensure unexpected behavior doesn't result in failing open. */
-     if (WARN_ON(current->seccomp.filter == NULL))
+     if (WARN_ON(f == NULL))
              return SECCOMP_RET_KILL;

      populate_seccomp_data(&sd);

@@ -186,9 +186,8 @@ static u32 seccomp_run_filters(int syscall)
       * All filters in the list are evaluated and the lowest BPF return
       * value always takes priority (ignoring the DATA).
       */
-     for (f = current->seccomp.filter; f; f = f->prev) {
+     for (; f; f = smp_load_acquire(&f->prev)) {
              u32 cur_ret = SK_RUN_FILTER(f->prog, (void *)&sd);
-
              if ((cur_ret & SECCOMP_RET_ACTION) < (ret & SECCOMP_RET_ACTION))
                      ret = cur_ret;

OK, in this case the 1st one is probably fine, altgough it is not
clear to me why it is better than read_barrier_depends().

But why do we need a 2nd one inside the loop? And if we actually need
it (I don't think so) then why it is safe to use f->prog without
load_acquire ?

You're right -- it should not be possible for for any of the ->prev
pointers to change.

quoted

 void get_seccomp_filter(struct task_struct *tsk)
 {
-     struct seccomp_filter *orig = tsk->seccomp.filter;
+     struct seccomp_filter *orig = smp_load_acquire(&tsk->seccomp.filter);
      if (!orig)
              return;

This one looks unneeded.

First of all, afaics atomic_inc() should work correctly without any barriers,
otherwise it is buggy. But even this doesn't matter.

With this changes get_seccomp_filter() must be called under ->siglock, it can't
race with add-filter and thus tsk->seccomp.filter should be stable.

Excellent point, yes. I'll remove that.

quoted

      /* Reference count is bounded by the number of total processes. */

@@ -361,7 +364,7 @@ void put_seccomp_filter(struct task_struct *tsk)
      /* Clean up single-reference branches iteratively. */
      while (orig && atomic_dec_and_test(&orig->usage)) {
              struct seccomp_filter *freeme = orig;
-             orig = orig->prev;
+             orig = smp_load_acquire(&orig->prev);
              seccomp_filter_free(freeme);
      }

This one looks unneeded too. And note that this patch does not add
smp_load_acquire() to read tsk->seccomp.filter.

Hrm, yes, that should get added.

atomic_dec_and_test() adds mb(), we do not need more barriers to access
->prev ?

Right, same situation as the run_filters loop. Thanks!

-Kees

-- 
Kees Cook
Chrome OS Security

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help