Re: Re: [PATCH RFC 1/2] arch: Introduce ARCH_HAS_HW_XCHG_SMALL

[PATCH RFC 1/2] arch: Introduce ARCH_HAS_HW_XCHG_SMALL · Huacai Chen <hidden> · 2021-07-24
[PATCH RFC 2/2] qspinlock: Use ARCH_HAS_HW_XCHG_SMALL to select _Q_PENDING_BITS definition · Huacai Chen <hidden> · 2021-07-24
Re: [PATCH RFC 1/2] arch: Introduce ARCH_HAS_HW_XCHG_SMALL · Arnd Bergmann <arnd@arndb.de> · 2021-07-24
Re: [PATCH RFC 1/2] arch: Introduce ARCH_HAS_HW_XCHG_SMALL · Jiaxun Yang <jiaxun.yang@flygoat.com> · 2021-07-25
Re: [PATCH RFC 1/2] arch: Introduce ARCH_HAS_HW_XCHG_SMALL · Arnd Bergmann <arnd@arndb.de> · 2021-07-25
Re: [PATCH RFC 1/2] arch: Introduce ARCH_HAS_HW_XCHG_SMALL · Geert Uytterhoeven <geert@linux-m68k.org> · 2021-07-26
Re: [PATCH RFC 1/2] arch: Introduce ARCH_HAS_HW_XCHG_SMALL · Huacai Chen <hidden> · 2021-07-26
Re: [PATCH RFC 1/2] arch: Introduce ARCH_HAS_HW_XCHG_SMALL · Boqun Feng <hidden> · 2021-07-26
Re: [PATCH RFC 1/2] arch: Introduce ARCH_HAS_HW_XCHG_SMALL · Guo Ren <guoren@kernel.org> · 2021-07-26
Re: [PATCH RFC 1/2] arch: Introduce ARCH_HAS_HW_XCHG_SMALL · Boqun Feng <hidden> · 2021-07-26
Re: [PATCH RFC 1/2] arch: Introduce ARCH_HAS_HW_XCHG_SMALL · Waiman Long <hidden> · 2021-07-26
Re: [PATCH RFC 1/2] arch: Introduce ARCH_HAS_HW_XCHG_SMALL · Guo Ren <guoren@kernel.org> · 2021-07-27
Re: [PATCH RFC 1/2] arch: Introduce ARCH_HAS_HW_XCHG_SMALL · Boqun Feng <hidden> · 2021-07-27
Re: [PATCH RFC 1/2] arch: Introduce ARCH_HAS_HW_XCHG_SMALL · Waiman Long <hidden> · 2021-07-27
Re: [PATCH RFC 1/2] arch: Introduce ARCH_HAS_HW_XCHG_SMALL · Peter Zijlstra <peterz@infradead.org> · 2021-07-27
Re: [PATCH RFC 1/2] arch: Introduce ARCH_HAS_HW_XCHG_SMALL · Huacai Chen <hidden> · 2021-07-28
Re: [PATCH RFC 1/2] arch: Introduce ARCH_HAS_HW_XCHG_SMALL · Peter Zijlstra <peterz@infradead.org> · 2021-07-28
Re: [PATCH RFC 1/2] arch: Introduce ARCH_HAS_HW_XCHG_SMALL · Huacai Chen <hidden> · 2021-07-29
Re: [PATCH RFC 1/2] arch: Introduce ARCH_HAS_HW_XCHG_SMALL · Peter Zijlstra <peterz@infradead.org> · 2021-07-27
Re: [PATCH RFC 1/2] arch: Introduce ARCH_HAS_HW_XCHG_SMALL · Guo Ren <guoren@kernel.org> · 2021-07-27
Re: Re: [PATCH RFC 1/2] arch: Introduce ARCH_HAS_HW_XCHG_SMALL · Wang Rui <hidden> · 2021-07-27
Re: Re: [PATCH RFC 1/2] arch: Introduce ARCH_HAS_HW_XCHG_SMALL · Peter Zijlstra <peterz@infradead.org> · 2021-07-27
Re: [PATCH RFC 1/2] arch: Introduce ARCH_HAS_HW_XCHG_SMALL · Boqun Feng <hidden> · 2021-07-27
Re: [PATCH RFC 1/2] arch: Introduce ARCH_HAS_HW_XCHG_SMALL · Peter Zijlstra <peterz@infradead.org> · 2021-07-27
Re: [PATCH RFC 1/2] arch: Introduce ARCH_HAS_HW_XCHG_SMALL · Peter Zijlstra <peterz@infradead.org> · 2021-07-27
Re: [PATCH RFC 1/2] arch: Introduce ARCH_HAS_HW_XCHG_SMALL · Arnd Bergmann <arnd@arndb.de> · 2021-07-26

From: Peter Zijlstra <peterz@infradead.org>
Date: 2021-07-27 11:03:24

On Tue, Jul 27, 2021 at 09:52:26AM +0800, Wang Rui wrote:

I think the forward progress are guaranteed while all operations are
atomic(ll/sc or amo). If ll/sc runs on a fast cpu, there will be
random delays, is that okay? Else, for such hardware, we can't even
implement generic spinlock with ll/sc.

And I also think that the hardware supports normal store for
unlocking. (e.g. arch_spin_unlock)

In qspinlock, when _Q_PENDING_BITS == 1, it's available for all
hardware, because the clear_pending/clear_pending_set_locked are all
atomic operations. Isn't it?

Q: Why live lock happens while _Q_PENDING_BITS == 8?
A: I found a case is:

* CPU A updates sub-word of qpsinlock at high frequency with normal store.
* CPU B do xchg_tail with load + cmpxchg, and the value of load is always not equal to the value of ll(cmpxchg).

qspinlock:
  0: locked
  1: pending
  2: tail

CPU A                    CPU B
1:                       1: &lt;--------------------+
  sh $newval, &amp;locked      lw  $v1, &amp;qspinlock   |
  add $newval, 1           and $t1, $v1, ~mask   |
  b 1b                     or  $t1, $t1, newval  | (live lock path)
                           ll  $v2, &amp;qspinlock   |
                           bne $v1, $v2, 1b -----+
                           sc  $t1, &amp;qspinlock
                           beq $t1, 0, 1b

If xchg_tail like this, at least there is no live lock on Loongson

xchg_tail:

1:
  ll  $v1, &amp;qspinlock
  and $t1, $v1, ~mask
  or  $t1, $t1, newval
  sc  $t1, &amp;qspinlock
  beq $t1, 0, 1b

For hardware that ll/sc is based on cache coherency, I think sc is
easy to succeed. The ll makes cache-line is exclusive by CPU B, and
the store of CPU A needs to acquire exclusive again, the sc may be
completed before this.

This! I've been saying this for ages. All those xchg16() implementations
are broken for using cmpxchg() on LL/SC. Not because xchg16() is
fundamentally flawed.

Perhaps we should introduce:

	atomic_nand_or() and atomic_fetch_nand_or()

and implement short xchg() using those, then we can have the whole masks
setup shared. It just means you get to implement those primitives for
*all* archs :-)

Also, the _Q_PENDING_BITS==1 case can use that primitive.

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help