Re: [PATCH 05/11] qspinlock: Optimize for smaller NR_CPUS

[PATCH 00/11] qspinlock with paravirt support · Peter Zijlstra <hidden> · 2014-06-15
[PATCH 04/11] qspinlock: Extract out the exchange of tail code word · Peter Zijlstra <hidden> · 2014-06-15
Re: [PATCH 04/11] qspinlock: Extract out the exchange of tail code word · Konrad Rzeszutek Wilk <hidden> · 2014-06-17
Re: [PATCH 04/11] qspinlock: Extract out the exchange of tail code word · Paolo Bonzini <pbonzini@redhat.com> · 2014-06-18
Re: [PATCH 04/11] qspinlock: Extract out the exchange of tail code word · Konrad Rzeszutek Wilk <hidden> · 2014-06-18
Re: [PATCH 04/11] qspinlock: Extract out the exchange of tail code word · Waiman Long <hidden> · 2014-06-18
Re: [PATCH 04/11] qspinlock: Extract out the exchange of tail code word · Paolo Bonzini <pbonzini@redhat.com> · 2014-06-18
Re: [PATCH 04/11] qspinlock: Extract out the exchange of tail code word · Konrad Rzeszutek Wilk <hidden> · 2014-06-18
Re: [PATCH 04/11] qspinlock: Extract out the exchange of tail code word · Peter Zijlstra <peterz@infradead.org> · 2014-06-24
[PATCH 03/11] qspinlock: Add pending bit · Peter Zijlstra <hidden> · 2014-06-15
Re: [PATCH 03/11] qspinlock: Add pending bit · Konrad Rzeszutek Wilk <hidden> · 2014-06-17
Re: [PATCH 03/11] qspinlock: Add pending bit · Waiman Long <hidden> · 2014-06-17
Re: [PATCH 03/11] qspinlock: Add pending bit · Konrad Rzeszutek Wilk <hidden> · 2014-06-17
Re: [PATCH 03/11] qspinlock: Add pending bit · Konrad Rzeszutek Wilk <hidden> · 2014-06-17
Re: [PATCH 03/11] qspinlock: Add pending bit · Waiman Long <hidden> · 2014-06-17
Re: [PATCH 03/11] qspinlock: Add pending bit · Peter Zijlstra <peterz@infradead.org> · 2014-06-24
Re: [PATCH 03/11] qspinlock: Add pending bit · Paolo Bonzini <pbonzini@redhat.com> · 2014-06-18
Re: [PATCH 03/11] qspinlock: Add pending bit · Konrad Rzeszutek Wilk <hidden> · 2014-06-18
Re: [PATCH 03/11] qspinlock: Add pending bit · Peter Zijlstra <peterz@infradead.org> · 2014-06-23
[PATCH 06/11] qspinlock: Optimize pending bit · Peter Zijlstra <hidden> · 2014-06-15
Re: [PATCH 06/11] qspinlock: Optimize pending bit · Paolo Bonzini <pbonzini@redhat.com> · 2014-06-18
[PATCH 02/11] qspinlock, x86: Enable x86-64 to use queue spinlock · Peter Zijlstra <hidden> · 2014-06-15
[PATCH 09/11] pvqspinlock, x86: Rename paravirt_ticketlocks_enabled · Peter Zijlstra <hidden> · 2014-06-15
Re: [PATCH 09/11] pvqspinlock, x86: Rename paravirt_ticketlocks_enabled · Konrad Rzeszutek Wilk <hidden> · 2014-06-18
[PATCH 11/11] qspinlock, kvm: Add paravirt support · Peter Zijlstra <hidden> · 2014-06-15
Re: [PATCH 11/11] qspinlock, kvm: Add paravirt support · Raghavendra K T <hidden> · 2014-06-22
Re: [PATCH 11/11] qspinlock, kvm: Add paravirt support · Peter Zijlstra <peterz@infradead.org> · 2014-07-07
[PATCH 10/11] qspinlock: Paravirt support · Peter Zijlstra <hidden> · 2014-06-15
Re: [PATCH 10/11] qspinlock: Paravirt support · Waiman Long <hidden> · 2014-06-16
Re: [PATCH 10/11] qspinlock: Paravirt support · Paolo Bonzini <pbonzini@redhat.com> · 2014-06-18
Re: [PATCH 10/11] qspinlock: Paravirt support · Waiman Long <hidden> · 2014-06-18
Re: [PATCH 10/11] qspinlock: Paravirt support · Peter Zijlstra <peterz@infradead.org> · 2014-07-07
Re: [PATCH 10/11] qspinlock: Paravirt support · Peter Zijlstra <peterz@infradead.org> · 2014-07-07
Re: [PATCH 10/11] qspinlock: Paravirt support · Waiman Long <hidden> · 2014-06-17
Re: [PATCH 10/11] qspinlock: Paravirt support · Paolo Bonzini <pbonzini@redhat.com> · 2014-06-18
Re: [PATCH 10/11] qspinlock: Paravirt support · Konrad Rzeszutek Wilk <hidden> · 2014-06-20
Re: [PATCH 10/11] qspinlock: Paravirt support · Peter Zijlstra <peterz@infradead.org> · 2014-07-07
Re: [PATCH 10/11] qspinlock: Paravirt support · Konrad Rzeszutek Wilk <hidden> · 2014-07-15
[PATCH 08/11] qspinlock: Revert to test-and-set on hypervisors · Peter Zijlstra <hidden> · 2014-06-15
Re: [PATCH 08/11] qspinlock: Revert to test-and-set on hypervisors · Waiman Long <hidden> · 2014-06-16
Re: [PATCH 08/11] qspinlock: Revert to test-and-set on hypervisors · Konrad Rzeszutek Wilk <hidden> · 2014-06-18
[PATCH 07/11] qspinlock: Use a simple write to grab the lock, if applicable · Peter Zijlstra <hidden> · 2014-06-15
Re: [PATCH 07/11] qspinlock: Use a simple write to grab the lock, if applicable · Konrad Rzeszutek Wilk <hidden> · 2014-06-18
Re: [PATCH 07/11] qspinlock: Use a simple write to grab the lock, if applicable · Peter Zijlstra <peterz@infradead.org> · 2014-07-07
[PATCH 01/11] qspinlock: A simple generic 4-byte queue spinlock · Peter Zijlstra <hidden> · 2014-06-15
Re: [PATCH 01/11] qspinlock: A simple generic 4-byte queue spinlock · Konrad Rzeszutek Wilk <hidden> · 2014-06-16
Re: [PATCH 01/11] qspinlock: A simple generic 4-byte queue spinlock · Konrad Rzeszutek Wilk <hidden> · 2014-06-17
Re: [PATCH 01/11] qspinlock: A simple generic 4-byte queue spinlock · Peter Zijlstra <peterz@infradead.org> · 2014-06-23
Re: [PATCH 01/11] qspinlock: A simple generic 4-byte queue spinlock · Konrad Rzeszutek Wilk <hidden> · 2014-06-27
Re: [PATCH 01/11] qspinlock: A simple generic 4-byte queue spinlock · Peter Zijlstra <peterz@infradead.org> · 2014-06-23
Re: [PATCH 01/11] qspinlock: A simple generic 4-byte queue spinlock · Konrad Rzeszutek Wilk <hidden> · 2014-06-27
Re: [PATCH 01/11] qspinlock: A simple generic 4-byte queue spinlock · Konrad Rzeszutek Wilk <hidden> · 2014-06-17
Re: [PATCH 01/11] qspinlock: A simple generic 4-byte queue spinlock · Peter Zijlstra <peterz@infradead.org> · 2014-06-23
Re: [PATCH 01/11] qspinlock: A simple generic 4-byte queue spinlock · Konrad Rzeszutek Wilk <hidden> · 2014-06-27
[PATCH 05/11] qspinlock: Optimize for smaller NR_CPUS · Peter Zijlstra <hidden> · 2014-06-15
Re: [PATCH 05/11] qspinlock: Optimize for smaller NR_CPUS · Paolo Bonzini <pbonzini@redhat.com> · 2014-06-18
Re: [PATCH 05/11] qspinlock: Optimize for smaller NR_CPUS · Peter Zijlstra <peterz@infradead.org> · 2014-07-07
Re: [PATCH 05/11] qspinlock: Optimize for smaller NR_CPUS · Paolo Bonzini <pbonzini@redhat.com> · 2014-07-07
Re: [PATCH 05/11] qspinlock: Optimize for smaller NR_CPUS · Peter Zijlstra <peterz@infradead.org> · 2014-07-07
Re: [PATCH 05/11] qspinlock: Optimize for smaller NR_CPUS · Paolo Bonzini <pbonzini@redhat.com> · 2014-07-07
Re: [PATCH 05/11] qspinlock: Optimize for smaller NR_CPUS · Konrad Rzeszutek Wilk <hidden> · 2014-06-18
Re: [PATCH 05/11] qspinlock: Optimize for smaller NR_CPUS · Peter Zijlstra <peterz@infradead.org> · 2014-07-07
Re: [PATCH 00/11] qspinlock with paravirt support · Konrad Rzeszutek Wilk <hidden> · 2014-06-16

From: Peter Zijlstra <peterz@infradead.org>
Date: 2014-07-07 14:34:28
Also in: kvm, lkml, virtualization

On Wed, Jun 18, 2014 at 11:57:30AM -0400, Konrad Rzeszutek Wilk wrote:

On Sun, Jun 15, 2014 at 02:47:02PM +0200, Peter Zijlstra wrote:

quoted

From: Peter Zijlstra <peterz@infradead.org>

When we allow for a max NR_CPUS < 2^14 we can optimize the pending
wait-acquire and the xchg_tail() operations.

By growing the pending bit to a byte, we reduce the tail to 16bit.
This means we can use xchg16 for the tail part and do away with all
the repeated compxchg() operations.

This in turn allows us to unconditionally acquire; the locked state
as observed by the wait loops cannot change. And because both locked
and pending are now a full byte we can use simple stores for the
state transition, obviating one atomic operation entirely.

I have to ask - how much more performance do you get from this?

Is this extra atomic operation hurting that much?

Its not extra, its a cmpxchg loop vs an unconditional xchg.

And yes, its somewhat tedious to show, but on 4 socket systems you can
really see it make a difference. I'll try and run some numbers, I need
to reinstall the box.

(there were numbers in the previous threads, but you're right, I
should've put some in the Changelog).

quoted

 /**
  * queue_spin_lock_slowpath - acquire the queue spinlock

@@ -173,8 +259,13 @@ void queue_spin_lock_slowpath(struct qsp
 	 * we're pending, wait for the owner to go away.
 	 *
 	 * *,1,1 -> *,1,0
+	 *
+	 * this wait loop must be a load-acquire such that we match the
+	 * store-release that clears the locked bit and create lock
+	 * sequentiality; this because not all clear_pending_set_locked()
+	 * implementations imply full barriers.
 	 */
-	while ((val = atomic_read(&lock->val)) & _Q_LOCKED_MASK)
+	while ((val = smp_load_acquire(&lock->val.counter)) & _Q_LOCKED_MASK)

lock->val.counter? Ugh, all to deal with the 'int' -> 'u32' (or 'u64')

No, to do atomic_t -> int.

Could you introduce a macro in atomic.h called 'atomic_read_raw' which
would do the this? Like this:

That would be worse I think. It looks like a function returning an
rvalue whereas we really want an lvalue.

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help