Re: [PATCH v2 0/9] Remove spin_unlock_wait()

[PATCH RFC 0/26] Remove spin_unlock_wait() · Paul E. McKenney <hidden> · 2017-06-29
[PATCH RFC 17/26] metag: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 04/26] completion: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 06/26] ipc: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-06-30
Re: [PATCH RFC 06/26] ipc: Replace spin_unlock_wait() with lock/unlock pair · Manfred Spraul <hidden> · 2017-07-01
Re: [PATCH RFC 06/26] ipc: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-07-02
[PATCH RFC 11/26] arm: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 25/26] tile: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
Re: [PATCH RFC 25/26] tile: Remove spin_unlock_wait() arch-specific definitions · Linus Torvalds <torvalds@linux-foundation.org> · 2017-06-30
Re: [PATCH RFC 25/26] tile: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
Re: [PATCH RFC 25/26] tile: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
Re: [PATCH RFC 25/26] tile: Remove spin_unlock_wait() arch-specific definitions · Linus Torvalds <torvalds@linux-foundation.org> · 2017-06-30
Re: [PATCH RFC 25/26] tile: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 08/26] locking: Remove spin_unlock_wait() generic definitions · Paul E. McKenney <hidden> · 2017-06-30
Re: [PATCH RFC 08/26] locking: Remove spin_unlock_wait() generic definitions · Will Deacon <hidden> · 2017-06-30
Re: [PATCH RFC 08/26] locking: Remove spin_unlock_wait() generic definitions · Paul E. McKenney <hidden> · 2017-06-30
Re: [PATCH RFC 08/26] locking: Remove spin_unlock_wait() generic definitions · Will Deacon <hidden> · 2017-06-30
Re: [PATCH RFC 08/26] locking: Remove spin_unlock_wait() generic definitions · Paul E. McKenney <hidden> · 2017-06-30
Re: [PATCH RFC 08/26] locking: Remove spin_unlock_wait() generic definitions · Will Deacon <hidden> · 2017-07-03
Re: [PATCH RFC 08/26] locking: Remove spin_unlock_wait() generic definitions · Paul E. McKenney <hidden> · 2017-07-03
Re: [PATCH RFC 08/26] locking: Remove spin_unlock_wait() generic definitions · Linus Torvalds <torvalds@linux-foundation.org> · 2017-07-03
Re: [PATCH RFC 08/26] locking: Remove spin_unlock_wait() generic definitions · Will Deacon <hidden> · 2017-07-03
Re: [PATCH RFC 08/26] locking: Remove spin_unlock_wait() generic definitions · Paul E. McKenney <hidden> · 2017-07-03
Re: [PATCH RFC 08/26] locking: Remove spin_unlock_wait() generic definitions · Linus Torvalds <torvalds@linux-foundation.org> · 2017-07-03
Re: [PATCH RFC 08/26] locking: Remove spin_unlock_wait() generic definitions · Paul E. McKenney <hidden> · 2017-07-04
Re: [PATCH RFC 08/26] locking: Remove spin_unlock_wait() generic definitions · Paul E. McKenney <hidden> · 2017-07-04
Re: [PATCH RFC 08/26] locking: Remove spin_unlock_wait() generic definitions · Paul E. McKenney <hidden> · 2017-07-03
[PATCH RFC 09/26] alpha: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 26/26] xtensa: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 13/26] blackfin: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 20/26] parisc: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 01/26] netfilter: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 22/26] s390: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 14/26] hexagon: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 24/26] sparc: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 23/26] sh: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 18/26] mips: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 21/26] powerpc: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
Re: [PATCH RFC 21/26] powerpc: Remove spin_unlock_wait() arch-specific definitions · Boqun Feng <hidden> · 2017-07-02
Re: [PATCH RFC 21/26] powerpc: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-07-05
[PATCH RFC 19/26] mn10300: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 07/26] drivers/ata: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 16/26] m32r: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 15/26] ia64: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 12/26] arm64: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
Re: [PATCH RFC 12/26] arm64: Remove spin_unlock_wait() arch-specific definitions · Will Deacon <hidden> · 2017-06-30
Re: [PATCH RFC 12/26] arm64: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 02/26] task_work: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-06-30
Re: [PATCH RFC 02/26] task_work: Replace spin_unlock_wait() with lock/unlock pair · Oleg Nesterov <oleg@redhat.com> · 2017-06-30
Re: [PATCH RFC 02/26] task_work: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-06-30
Re: [PATCH RFC 02/26] task_work: Replace spin_unlock_wait() with lock/unlock pair · Oleg Nesterov <oleg@redhat.com> · 2017-06-30
Re: [PATCH RFC 02/26] task_work: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-06-30
Re: [PATCH RFC 02/26] task_work: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-06-30
Re: [PATCH RFC 02/26] task_work: Replace spin_unlock_wait() with lock/unlock pair · Oleg Nesterov <oleg@redhat.com> · 2017-06-30
Re: [PATCH RFC 02/26] task_work: Replace spin_unlock_wait() with lock/unlock pair · Alan Stern <stern@rowland.harvard.edu> · 2017-06-30
Re: [PATCH RFC 02/26] task_work: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-06-30
Re: [PATCH RFC 02/26] task_work: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-06-30
Re: [PATCH RFC 02/26] task_work: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 03/26] sched: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-06-30
Re: [PATCH RFC 03/26] sched: Replace spin_unlock_wait() with lock/unlock pair · Arnd Bergmann <arnd@arndb.de> · 2017-06-30
Re: [PATCH RFC 03/26] sched: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 10/26] arc: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-06-30
[PATCH RFC 05/26] exit: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-06-30
[PATCH v2 0/9] Remove spin_unlock_wait() · Paul E. McKenney <hidden> · 2017-07-05
[PATCH v2 2/9] task_work: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-07-05
[PATCH v2 7/9] drivers/ata: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-07-05
[PATCH v2 5/9] exit: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-07-05
[PATCH v2 4/9] completion: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-07-05
[PATCH v2 3/9] sched: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-07-05
[PATCH v2 1/9] net/netfilter/nf_conntrack_core: Fix net_conntrack_lock() · Paul E. McKenney <hidden> · 2017-07-05
Re: [PATCH v2 1/9] net/netfilter/nf_conntrack_core: Fix net_conntrack_lock() · Manfred Spraul <hidden> · 2017-07-06
Re: [PATCH v2 1/9] net/netfilter/nf_conntrack_core: Fix net_conntrack_lock() · Paul E. McKenney <hidden> · 2017-07-06
[PATCH v2 8/9] locking: Remove spin_unlock_wait() generic definitions · Paul E. McKenney <hidden> · 2017-07-05
[PATCH v2 6/9] ipc: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-07-05
[PATCH v2 9/9] arch: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-07-05
RE: [PATCH v2 0/9] Remove spin_unlock_wait() · David Laight <hidden> · 2017-07-06
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Paul E. McKenney <hidden> · 2017-07-06
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Peter Zijlstra <peterz@infradead.org> · 2017-07-06
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Paul E. McKenney <hidden> · 2017-07-06
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Peter Zijlstra <peterz@infradead.org> · 2017-07-06
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Paul E. McKenney <hidden> · 2017-07-06
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Alan Stern <stern@rowland.harvard.edu> · 2017-07-06
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Peter Zijlstra <peterz@infradead.org> · 2017-07-06
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Alan Stern <stern@rowland.harvard.edu> · 2017-07-06
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Peter Zijlstra <peterz@infradead.org> · 2017-07-06
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Paul E. McKenney <hidden> · 2017-07-06
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Peter Zijlstra <peterz@infradead.org> · 2017-07-06
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Will Deacon <hidden> · 2017-07-06
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Paul E. McKenney <hidden> · 2017-07-06
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Paul E. McKenney <hidden> · 2017-07-06
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Ingo Molnar <mingo@kernel.org> · 2017-07-07
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Peter Zijlstra <peterz@infradead.org> · 2017-07-07
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Ingo Molnar <mingo@kernel.org> · 2017-07-07
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Peter Zijlstra <peterz@infradead.org> · 2017-07-07
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Paul E. McKenney <hidden> · 2017-07-07
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Ingo Molnar <mingo@kernel.org> · 2017-07-08
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Paul E. McKenney <hidden> · 2017-07-08
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Manfred Spraul <hidden> · 2017-07-07
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Ingo Molnar <mingo@kernel.org> · 2017-07-08
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Paul E. McKenney <hidden> · 2017-07-08
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Ingo Molnar <mingo@kernel.org> · 2017-07-08
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Paul E. McKenney <hidden> · 2017-07-08
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Alan Stern <stern@rowland.harvard.edu> · 2017-07-08
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Manfred Spraul <hidden> · 2017-07-10
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Ingo Molnar <mingo@kernel.org> · 2017-07-07
Re: [PATCH v2 0/9] Remove spin_unlock_wait() · Ingo Molnar <mingo@kernel.org> · 2017-07-07
[PATCH v3 0/9] Remove spin_unlock_wait() · Paul E. McKenney <hidden> · 2017-07-07
[PATCH v3 3/9] sched: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-07-07
[PATCH v3 1/9] net/netfilter/nf_conntrack_core: Fix net_conntrack_lock() · Paul E. McKenney <hidden> · 2017-07-07
[PATCH v3 4/9] completion: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-07-07
[PATCH v3 9/9] arch: Remove spin_unlock_wait() arch-specific definitions · Paul E. McKenney <hidden> · 2017-07-07
[PATCH v3 6/9] ipc: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-07-07
[PATCH v3 7/9] drivers/ata: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-07-07
[PATCH v3 2/9] task_work: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-07-07
[PATCH v3 8/9] locking: Remove spin_unlock_wait() generic definitions · Paul E. McKenney <hidden> · 2017-07-07
[PATCH v3 5/9] exit: Replace spin_unlock_wait() with lock/unlock pair · Paul E. McKenney <hidden> · 2017-07-07

From: Ingo Molnar <mingo@kernel.org>
Date: 2017-07-08 12:30:26
Also in: linux-arch, lkml, netfilter-devel

* Paul E. McKenney [off-list ref] wrote:

On Sat, Jul 08, 2017 at 10:35:43AM +0200, Ingo Molnar wrote:

quoted

* Manfred Spraul [off-list ref] wrote:

quoted

Hi Ingo,

On 07/07/2017 10:31 AM, Ingo Molnar wrote:

quoted

There's another, probably just as significant advantage: queued_spin_unlock_wait()
is 'read-only', while spin_lock()+spin_unlock() dirties the lock cache line. On
any bigger system this should make a very measurable difference - if
spin_unlock_wait() is ever used in a performance critical code path.

At least for ipc/sem:
Dirtying the cacheline (in the slow path) allows to remove a smp_mb() in the
hot path.
So for sem_lock(), I either need a primitive that dirties the cacheline or
sem_lock() must continue to use spin_lock()/spin_unlock().

Technically you could use spin_trylock()+spin_unlock() and avoid the lock acquire 
spinning on spin_unlock() and get very close to the slow path performance of a 
pure cacheline-dirtying behavior.

But adding something like spin_barrier(), which purely dirties the lock cacheline, 
would be even faster, right?

Interestingly enough, the arm64 and powerpc implementations of
spin_unlock_wait() were very close to what it sounds like you are
describing.

So could we perhaps solve all our problems by defining the generic version thusly:

void spin_unlock_wait(spinlock_t *lock)
{
	if (spin_trylock(lock))
		spin_unlock(lock);
}

... and perhaps rename it to spin_barrier() [or whatever proper name there would 
be]?

Architectures can still optimize it, to remove the small window where the lock is 
held locally - as long as the ordering is at least as strong as the generic 
version.

This would have various advantages:

 - semantics are well-defined

 - the generic implementation is already pretty well optimized (no spinning)

 - it would make it usable for the IPC performance optimization

 - architectures could still optimize it to eliminate the window where the lock is
   held locally - if there's such instructions available.

Was this proposed before, or am I missing something?

Thanks,

	Ingo

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help