Re: [PATCH v2 0/9] Remove spin_unlock_wait()

[PATCH RFC 0/26] Remove spin_unlock_wait() · Paul E. McKenney <hidden> · 2017-06-29
[PATCH RFC 17/26] metag: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 04/26] completion: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 06/26] ipc: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-06-30
Re: [PATCH RFC 06/26] ipc: Replace spin_unlock_wait() with lock/unlock pair · Manfred Spraul <hidden> · 2017-07-01
Re: [PATCH RFC 06/26] ipc: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-07-02
[PATCH RFC 11/26] arm: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 25/26] tile: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
Re: [PATCH RFC 25/26] tile: Remove spin_unlock_wait() arch-specific definitions · Linus Torvalds <torvalds@linux-foundation.org> · 2017-06-30
Re: [PATCH RFC 25/26] tile: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
Re: [PATCH RFC 25/26] tile: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
Re: [PATCH RFC 25/26] tile: Remove spin_unlock_wait() arch-specific definitions · Linus Torvalds <torvalds@linux-foundation.org> · 2017-06-30
Re: [PATCH RFC 25/26] tile: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 08/26] locking: Remove spin_unlock_wait() generic definitions · Paul E. McKenney <hidden> · 2017-06-30
Re: [PATCH RFC 08/26] locking: Remove spin_unlock_wait() generic definitions · Will Deacon <hidden> · 2017-06-30
Re: [PATCH RFC 08/26] locking: Remove spin_unlock_wait() generic definitions · Paul E. McKenney <hidden> · 2017-06-30
Re: [PATCH RFC 08/26] locking: Remove spin_unlock_wait() generic definitions · Will Deacon <hidden> · 2017-06-30
Re: [PATCH RFC 08/26] locking: Remove spin_unlock_wait() generic definitions · Paul E. McKenney <hidden> · 2017-06-30
Re: [PATCH RFC 08/26] locking: Remove spin_unlock_wait() generic definitions · Will Deacon <hidden> · 2017-07-03
Re: [PATCH RFC 08/26] locking: Remove spin_unlock_wait() generic definitions · Paul E. McKenney <hidden> · 2017-07-03
Re: [PATCH RFC 08/26] locking: Remove spin_unlock_wait() generic definitions · Linus Torvalds <torvalds@linux-foundation.org> · 2017-07-03
Re: [PATCH RFC 08/26] locking: Remove spin_unlock_wait() generic definitions · Will Deacon <hidden> · 2017-07-03
Re: [PATCH RFC 08/26] locking: Remove spin_unlock_wait() generic definitions · Paul E. McKenney <hidden> · 2017-07-03
Re: [PATCH RFC 08/26] locking: Remove spin_unlock_wait() generic definitions · Linus Torvalds <torvalds@linux-foundation.org> · 2017-07-03
Re: [PATCH RFC 08/26] locking: Remove spin_unlock_wait() generic definitions · Paul E. McKenney <hidden> · 2017-07-04
Re: [PATCH RFC 08/26] locking: Remove spin_unlock_wait() generic definitions · Paul E. McKenney <hidden> · 2017-07-04
Re: [PATCH RFC 08/26] locking: Remove spin_unlock_wait() generic definitions · Paul E. McKenney <hidden> · 2017-07-03
[PATCH RFC 09/26] alpha: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 26/26] xtensa: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 13/26] blackfin: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 20/26] parisc: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 01/26] netfilter: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 22/26] s390: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 14/26] hexagon: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 24/26] sparc: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 23/26] sh: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 18/26] mips: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 21/26] powerpc: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
Re: [PATCH RFC 21/26] powerpc: Remove spin_unlock_wait() arch-specific definitions · Boqun Feng <hidden> · 2017-07-02
Re: [PATCH RFC 21/26] powerpc: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-07-05
[PATCH RFC 19/26] mn10300: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 07/26] drivers/ata: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 16/26] m32r: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 15/26] ia64: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 12/26] arm64: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
Re: [PATCH RFC 12/26] arm64: Remove spin_unlock_wait() arch-specific definitions · Will Deacon <hidden> · 2017-06-30
Re: [PATCH RFC 12/26] arm64: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 02/26] task_work: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-06-30
Re: [PATCH RFC 02/26] task_work: Replace spin_unlock_wait() with lock/unlock pair · Oleg Nesterov <oleg@redhat.com> · 2017-06-30
Re: [PATCH RFC 02/26] task_work: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-06-30
Re: [PATCH RFC 02/26] task_work: Replace spin_unlock_wait() with lock/unlock pair · Oleg Nesterov <oleg@redhat.com> · 2017-06-30
Re: [PATCH RFC 02/26] task_work: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-06-30
Re: [PATCH RFC 02/26] task_work: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-06-30
Re: [PATCH RFC 02/26] task_work: Replace spin_unlock_wait() with lock/unlock pair · Oleg Nesterov <oleg@redhat.com> · 2017-06-30
Re: [PATCH RFC 02/26] task_work: Replace spin_unlock_wait() with lock/unlock pair · Alan Stern <stern@rowland.harvard.edu> · 2017-06-30
Re: [PATCH RFC 02/26] task_work: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-06-30
Re: [PATCH RFC 02/26] task_work: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-06-30
Re: [PATCH RFC 02/26] task_work: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 03/26] sched: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-06-30
Re: [PATCH RFC 03/26] sched: Replace spin_unlock_wait() with lock/unlock pair · Arnd Bergmann <arnd@arndb.de> · 2017-06-30
Re: [PATCH RFC 03/26] sched: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 10/26] arc: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 05/26] exit: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-06-30
[PATCH v2 0/9] Remove spin_unlock_wait() · Paul E. McKenney <hidden> · 2017-07-05
[PATCH v2 2/9] task_work: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-07-05
[PATCH v2 7/9] drivers/ata: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-07-05
[PATCH v2 5/9] exit: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-07-05
[PATCH v2 4/9] completion: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-07-05
[PATCH v2 3/9] sched: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-07-05
[PATCH v2 1/9] net/netfilter/nf_conntrack_core: Fix net_conntrack_lock() · Paul E. McKenney <hidden> · 2017-07-05
Re: [PATCH v2 1/9] net/netfilter/nf_conntrack_core: Fix net_conntrack_lock() · Manfred Spraul <hidden> · 2017-07-06
Re: [PATCH v2 1/9] net/netfilter/nf_conntrack_core: Fix net_conntrack_lock() · Paul E. McKenney <hidden> · 2017-07-06
[PATCH v2 8/9] locking: Remove spin_unlock_wait() generic definitions · Paul E. McKenney <hidden> · 2017-07-05
[PATCH v2 6/9] ipc: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-07-05
[PATCH v2 9/9] arch: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-07-05
RE: [PATCH v2 0/9] Remove spin_unlock_wait() · David Laight <hidden> · 2017-07-06
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Paul E. McKenney <hidden> · 2017-07-06
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Peter Zijlstra <peterz@infradead.org> · 2017-07-06
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Paul E. McKenney <hidden> · 2017-07-06
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Peter Zijlstra <peterz@infradead.org> · 2017-07-06
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Paul E. McKenney <hidden> · 2017-07-06
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Alan Stern <stern@rowland.harvard.edu> · 2017-07-06
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Peter Zijlstra <peterz@infradead.org> · 2017-07-06
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Alan Stern <stern@rowland.harvard.edu> · 2017-07-06
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Peter Zijlstra <peterz@infradead.org> · 2017-07-06
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Paul E. McKenney <hidden> · 2017-07-06
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Peter Zijlstra <peterz@infradead.org> · 2017-07-06
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Will Deacon <hidden> · 2017-07-06
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Paul E. McKenney <hidden> · 2017-07-06
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Paul E. McKenney <hidden> · 2017-07-06
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Ingo Molnar <mingo@kernel.org> · 2017-07-07
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Peter Zijlstra <peterz@infradead.org> · 2017-07-07
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Ingo Molnar <mingo@kernel.org> · 2017-07-07
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Peter Zijlstra <peterz@infradead.org> · 2017-07-07
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Paul E. McKenney <hidden> · 2017-07-07
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Ingo Molnar <mingo@kernel.org> · 2017-07-08
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Paul E. McKenney <hidden> · 2017-07-08
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Manfred Spraul <hidden> · 2017-07-07
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Ingo Molnar <mingo@kernel.org> · 2017-07-08
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Paul E. McKenney <hidden> · 2017-07-08
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Ingo Molnar <mingo@kernel.org> · 2017-07-08
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Paul E. McKenney <hidden> · 2017-07-08
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Alan Stern <stern@rowland.harvard.edu> · 2017-07-08
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Manfred Spraul <hidden> · 2017-07-10
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Ingo Molnar <mingo@kernel.org> · 2017-07-07
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Ingo Molnar <mingo@kernel.org> · 2017-07-07
[PATCH v3 0/9] Remove spin_unlock_wait() · Paul E. McKenney <hidden> · 2017-07-07
[PATCH v3 3/9] sched: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-07-07
[PATCH v3 1/9] net/netfilter/nf_conntrack_core: Fix net_conntrack_lock() · Paul E. McKenney <hidden> · 2017-07-07
[PATCH v3 4/9] completion: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-07-07
[PATCH v3 9/9] arch: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-07-07
[PATCH v3 6/9] ipc: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-07-07
[PATCH v3 7/9] drivers/ata: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-07-07
[PATCH v3 2/9] task_work: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-07-07
[PATCH v3 8/9] locking: Remove spin_unlock_wait() generic definitions · Paul E. McKenney <hidden> · 2017-07-07
[PATCH v3 5/9] exit: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-07-07

From: Manfred Spraul <hidden>
Date: 2017-07-10 17:22:28
Also in: linux-arch, lkml, netfilter-devel

Hi Alan,

On 07/08/2017 06:21 PM, Alan Stern wrote:

Pardon me for barging in, but I found this whole interchange extremely
confusing...

On Sat, 8 Jul 2017, Ingo Molnar wrote:

quoted

* Paul E. McKenney [off-list ref] wrote:

quoted

On Sat, Jul 08, 2017 at 10:35:43AM +0200, Ingo Molnar wrote:

quoted

* Manfred Spraul [off-list ref] wrote:

quoted

Hi Ingo,

On 07/07/2017 10:31 AM, Ingo Molnar wrote:

quoted

There's another, probably just as significant advantage: queued_spin_unlock_wait()
is 'read-only', while spin_lock()+spin_unlock() dirties the lock cache line. On
any bigger system this should make a very measurable difference - if
spin_unlock_wait() is ever used in a performance critical code path.

At least for ipc/sem:
Dirtying the cacheline (in the slow path) allows to remove a smp_mb() in the
hot path.
So for sem_lock(), I either need a primitive that dirties the cacheline or
sem_lock() must continue to use spin_lock()/spin_unlock().

This statement doesn't seem to make sense.  Did Manfred mean to write
"smp_mb()" instead of "spin_lock()/spin_unlock()"?

Option 1:
     fastpath:
         spin_lock(local_lock)
         smp_mb(); [[1]]
         smp_load_acquire(global_flag);
     slow path:
         global_flag = 1;
         smp_mb();
         <spin_unlock_wait_without_cacheline_dirtying>

Option 2:
     fastpath:
         spin_lock(local_lock);
         smp_load_acquire(global_flag)
     slow path:
         global_flag = 1;
         spin_lock(local_lock);spin_unlock(local_lock).

Rational:
The ACQUIRE from spin_lock is at the read of local_lock, not at the write.
i.e.: Without the smp_mb() at [[1]], the CPU can do:
         read local_lock;
         read global_flag;
         write local_lock;
For Option 2, the smp_mb() is not required, because fast path and slow 
path acquire the same lock.

quoted

Technically you could use spin_trylock()+spin_unlock() and avoid the lock acquire
spinning on spin_unlock() and get very close to the slow path performance of a
pure cacheline-dirtying behavior.

This is even more confusing.  Did Ingo mean to suggest using
"spin_trylock()+spin_unlock()" in place of "spin_lock()+spin_unlock()"
could provide the desired ordering guarantee without delaying other
CPUs that may try to acquire the lock?  That seems highly questionable.

I agree :-)

--
     Manfred

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help