Re: [RFC PATCH v3 for 4.15 08/24] Provide cpu_opv system call

[RFC PATCH for 4.15 00/24] Restartable sequences and CPU op vector v11 · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-14
[RFC PATCH for 4.15 02/24] Restartable sequences: ARM 32 architecture support · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-14
[RFC PATCH for 4.15 06/24] Restartable sequences: powerpc architecture support · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-14
[RFC PATCH for 4.15 03/24] Restartable sequences: wire up ARM 32 system call · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-14
[RFC PATCH for 4.15 23/24] membarrier: selftest: Test private expedited sync core cmd · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-14
Re: [RFC PATCH for 4.15 23/24] membarrier: selftest: Test private expedited sync core cmd · Shuah Khan <hidden> · 2017-11-17
Re: [RFC PATCH for 4.15 23/24] membarrier: selftest: Test private expedited sync core cmd · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-17
[RFC PATCH v2 for 4.15 21/24] x86: Introduce sync_core_before_usermode · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-14
[RFC PATCH for 4.15 18/24] membarrier: provide SHARED_EXPEDITED command · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-14
Re: [RFC PATCH for 4.15 18/24] membarrier: provide SHARED_EXPEDITED command · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-15
[RFC PATCH for 4.15 19/24] membarrier: selftest: Test shared expedited cmd · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-14
Re: [RFC PATCH for 4.15 19/24] membarrier: selftest: Test shared expedited cmd · Shuah Khan <hidden> · 2017-11-17
[RFC PATCH for 4.15 20/24] membarrier: Provide core serializing command · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-14
[RFC PATCH v7 for 4.15 16/24] membarrier: powerpc: Skip memory barrier in switch_mm() · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-14
[RFC PATCH v5 for 4.15 17/24] membarrier: Document scheduler barrier requirements · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-14
[RFC PATCH v2 for 4.15 22/24] membarrier: x86: Provide core serializing command · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-14
[RFC PATCH v2 for 4.15 13/24] Restartable sequences: Provide self-tests · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-14
[RFC PATCH v2 for 4.15 15/24] membarrier: selftest: Test private expedited cmd · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-14
[RFC PATCH v2 for 4.15 12/24] cpu_opv: Implement selftests · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-14
[RFC PATCH for 4.15 14/24] Restartable sequences selftests: arm: workaround gcc asm size guess · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-14
[RFC PATCH for 4.15 24/24] membarrier: arm64: Provide core serializing command · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-14
[RFC PATCH for 4.15 09/24] cpu_opv: Wire up x86 32/64 system call · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-14
[RFC PATCH for 4.15 11/24] cpu_opv: Wire up ARM32 system call · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-14
[RFC PATCH v3 for 4.15 08/24] Provide cpu_opv system call · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-14
Re: [RFC PATCH v3 for 4.15 08/24] Provide cpu_opv system call · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-15
Re: [RFC PATCH v3 for 4.15 08/24] Provide cpu_opv system call · Michael Kerrisk (man-pages) <hidden> · 2017-11-15
Re: [RFC PATCH v3 for 4.15 08/24] Provide cpu_opv system call · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-15
Re: [RFC PATCH v3 for 4.15 08/24] Provide cpu_opv system call · Thomas Gleixner <hidden> · 2017-11-16
Re: [RFC PATCH v3 for 4.15 08/24] Provide cpu_opv system call · Andi Kleen <hidden> · 2017-11-17
Re: [RFC PATCH v3 for 4.15 08/24] Provide cpu_opv system call · Thomas Gleixner <hidden> · 2017-11-17
Re: [RFC PATCH v3 for 4.15 08/24] Provide cpu_opv system call · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-17
Re: [RFC PATCH v3 for 4.15 08/24] Provide cpu_opv system call · Andi Kleen <hidden> · 2017-11-17
Re: [RFC PATCH v3 for 4.15 08/24] Provide cpu_opv system call · Thomas Gleixner <hidden> · 2017-11-17
Re: [RFC PATCH v3 for 4.15 08/24] Provide cpu_opv system call · Andi Kleen <hidden> · 2017-11-17
Re: [RFC PATCH v3 for 4.15 08/24] Provide cpu_opv system call · Thomas Gleixner <hidden> · 2017-11-17
Re: [RFC PATCH v3 for 4.15 08/24] Provide cpu_opv system call · Andy Lutomirski <luto@amacapital.net> · 2017-11-18
Re: [RFC PATCH v3 for 4.15 08/24] Provide cpu_opv system call · Thomas Gleixner <hidden> · 2017-11-17
Re: [RFC PATCH v3 for 4.15 08/24] Provide cpu_opv system call · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-20
Re: [RFC PATCH v3 for 4.15 08/24] Provide cpu_opv system call · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-20
Re: [RFC PATCH v3 for 4.15 08/24] Provide cpu_opv system call · Thomas Gleixner <hidden> · 2017-11-20
Re: [RFC PATCH v3 for 4.15 08/24] Provide cpu_opv system call · Thomas Gleixner <hidden> · 2017-11-20
Re: [RFC PATCH v3 for 4.15 08/24] Provide cpu_opv system call · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-20
Re: [RFC PATCH v3 for 4.15 08/24] Provide cpu_opv system call · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-20
Re: [RFC PATCH v3 for 4.15 08/24] Provide cpu_opv system call · Andi Kleen <hidden> · 2017-11-20
Re: [RFC PATCH v3 for 4.15 08/24] Provide cpu_opv system call · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-20
Re: [RFC PATCH v3 for 4.15 08/24] Provide cpu_opv system call · Thomas Gleixner <hidden> · 2017-11-20
Re: [RFC PATCH v3 for 4.15 08/24] Provide cpu_opv system call · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-21
[RFC PATCH for 4.15 10/24] cpu_opv: Wire up powerpc system call · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-14
[RFC PATCH for 4.15 07/24] Restartable sequences: Wire up powerpc system call · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-14
[RFC PATCH for 4.15 04/24] Restartable sequences: x86 32/64 architecture support · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-14
Re: [RFC PATCH for 4.15 04/24] Restartable sequences: x86 32/64 architecture support · Thomas Gleixner <hidden> · 2017-11-16
Re: [RFC PATCH for 4.15 04/24] Restartable sequences: x86 32/64 architecture support · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-19
Re: [RFC PATCH for 4.15 04/24] Restartable sequences: x86 32/64 architecture support · Thomas Gleixner <hidden> · 2017-11-20
[RFC PATCH for 4.15 05/24] Restartable sequences: wire up x86 32/64 system call · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-14
[RFC PATCH v11 for 4.15 01/24] Restartable sequences system call · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-14
Re: [RFC PATCH v11 for 4.15 01/24] Restartable sequences system call · Ben Maurer <hidden> · 2017-11-14
Re: [RFC PATCH v11 for 4.15 01/24] Restartable sequences system call · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-14
Re: [RFC PATCH v11 for 4.15 01/24] Restartable sequences system call · Ben Maurer <hidden> · 2017-11-14
Re: [RFC PATCH v11 for 4.15 01/24] Restartable sequences system call · Peter Zijlstra <peterz@infradead.org> · 2017-11-16
Re: [RFC PATCH v11 for 4.15 01/24] Restartable sequences system call · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-16
Re: [RFC PATCH v11 for 4.15 01/24] Restartable sequences system call · Peter Zijlstra <peterz@infradead.org> · 2017-11-16
Re: [RFC PATCH v11 for 4.15 01/24] Restartable sequences system call · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-16
Re: [RFC PATCH v11 for 4.15 01/24] Restartable sequences system call · Peter Zijlstra <peterz@infradead.org> · 2017-11-16
Re: [RFC PATCH v11 for 4.15 01/24] Restartable sequences system call · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-16
Re: [RFC PATCH v11 for 4.15 01/24] Restartable sequences system call · Thomas Gleixner <hidden> · 2017-11-16
Re: [RFC PATCH v11 for 4.15 01/24] Restartable sequences system call · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-16
Re: [RFC PATCH v11 for 4.15 01/24] Restartable sequences system call · Peter Zijlstra <peterz@infradead.org> · 2017-11-16
Re: [RFC PATCH v11 for 4.15 01/24] Restartable sequences system call · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-16
Re: [RFC PATCH v11 for 4.15 01/24] Restartable sequences system call · Peter Zijlstra <peterz@infradead.org> · 2017-11-16
Re: [RFC PATCH v11 for 4.15 01/24] Restartable sequences system call · Thomas Gleixner <hidden> · 2017-11-16
Re: [RFC PATCH v11 for 4.15 01/24] Restartable sequences system call · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-19
Re: [RFC PATCH for 4.15 00/24] Restartable sequences and CPU op vector v11 · Linus Torvalds <torvalds@linux-foundation.org> · 2017-11-14
Re: [RFC PATCH for 4.15 00/24] Restartable sequences and CPU op vector v11 · Andy Lutomirski <luto@amacapital.net> · 2017-11-14
Re: [RFC PATCH for 4.15 00/24] Restartable sequences and CPU op vector v11 · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-14
Re: [RFC PATCH for 4.15 00/24] Restartable sequences and CPU op vector v11 · Andy Lutomirski <luto@amacapital.net> · 2017-11-15
Re: [RFC PATCH for 4.15 00/24] Restartable sequences and CPU op vector v11 · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2017-11-15
Re: [RFC PATCH for 4.15 00/24] Restartable sequences and CPU op vector v11 · Paul Turner <hidden> · 2017-11-14
Re: [RFC PATCH for 4.15 00/24] Restartable sequences and CPU op vector v11 · Mathieu Desnoyers <mathieu.desnoyers@efficios.com> · 2018-03-27

From: Mathieu Desnoyers <mathieu.desnoyers@efficios.com>
Date: 2017-11-20 18:38:13
Also in: lkml

----- On Nov 20, 2017, at 12:48 PM, Thomas Gleixner tglx@linutronix.de wrote:

On Mon, 20 Nov 2017, Mathieu Desnoyers wrote:

quoted

----- On Nov 16, 2017, at 6:26 PM, Thomas Gleixner tglx@linutronix.de wrote:

quoted

+#define NR_PINNED_PAGES_ON_STACK	8

8 pinned pages on stack? Which stack?

The common cases need to touch few pages, and we can keep the
pointers in an array on the kernel stack within the cpu_opv system
call.

Updating to:

/*
 * Typical invocation of cpu_opv need few pages. Keep struct page
 * pointers in an array on the stack of the cpu_opv system call up to
 * this limit, beyond which the array is dynamically allocated.
 */
#define NR_PIN_PAGES_ON_STACK        8

That name still sucks. NR_PAGE_PTRS_ON_STACK would be immediately obvious.

fixed.

quoted

+ * The operations available are: comparison, memcpy, add, or, and, xor,
+ * left shift, and right shift. The system call receives a CPU number
+ * from user-space as argument, which is the CPU on which those
+ * operations need to be performed. All preparation steps such as
+ * loading pointers, and applying offsets to arrays, need to be
+ * performed by user-space before invoking the system call. The

loading pointers and applying offsets? That makes no sense.

Updating to:

 * All preparation steps such as
 * loading base pointers, and adding offsets derived from the current
 * CPU number, need to be performed by user-space before invoking the
 * system call.

This still does not explain anything, really.

Which base pointer is loaded?  I nowhere see a reference to a base
pointer.

And what are the offsets about?

derived from current cpu number? What is current CPU number? The one on
which the task executes now or the one which it should execute on?

I assume what you want to say is:

 All pointers in the ops must have been set up to point to the per CPU
 memory of the CPU on which the operations should be executed.

At least that's what I oracle in to that.

Exactly that. Will update to use this description instead.

quoted

+ * "comparison" operation can be used to check that the data used in the
+ * preparation step did not change between preparation of system call
+ * inputs and operation execution within the preempt-off critical
+ * section.
+ *
+ * The reason why we require all pointer offsets to be calculated by
+ * user-space beforehand is because we need to use get_user_pages_fast()
+ * to first pin all pages touched by each operation. This takes care of

That doesnt explain it either.

What kind of explication are you looking for here ? Perhaps being too close
to the implementation prevents me from understanding what is unclear from
your perspective.

What the heck are pointer offsets?

The ops have one or two pointer(s) to a lump of memory. So if a pointer
points to the wrong lump of memory then you're screwed, but that's true for
all pointers handed to the kernel.

I think the sentence you suggested above is clear enough. I'll simply use
it.

quoted

Sorry, that paragraph was unclear. Updated:

 * An overall maximum of 4216 bytes in enforced on the sum of operation
 * length within an operation vector, so user-space cannot generate a
 * too long preempt-off critical section (cache cold critical section
 * duration measured as 4.7µs on x86-64). Each operation is also limited
 * a length of PAGE_SIZE bytes,

Again PAGE_SIZE is the wrong unit here. PAGE_SIZE can vary. What you want
is a hard limit of 4K. And because there is no alignment requiremnt the
rest of the sentence is stating the obvious.

I can make that a 4K limit if you prefer. This presumes that no architecture
has pages smaller than 4K, which is true on Linux.

quoted

 * meaning that an operation can touch a
 * maximum of 4 pages (memcpy: 2 pages for source, 2 pages for
 * destination if addresses are not aligned on page boundaries).

I still have to understand why the 4K copy is necessary in the first place.

quoted

What's the critical section duration for operations which go to the limits
of this on a average x86 64 machine?

When cache-cold, I measure 4.7 µs per critical section doing a
4k memcpy and 15 * 8 bytes memcpy on a E5-2630 v3 @2.4GHz. Is it an
acceptable preempt-off latency for RT ?

Depends on the use case as always ....

The use-case for 4k memcpy operation is a per-cpu ring buffer where
the rseq fast-path does the following:

- ring buffer push: in the rseq asm instruction sequence, a memcpy of a
  given structure (limited to 4k in size) into a ring buffer,
  followed by the final commit instruction which increments the current
  position offset by the number of bytes pushed.

- ring buffer pop: in the rseq asm instruction sequence, a memcpy of
  a given structure (up to 4k) from the ring buffer, at "position" offset.
  The final commit instruction decrements the current position offset by
  the number of bytes pop'd.

Having cpu_opv do a 4k memcpy allow it to handle scenarios where
rseq fails to progress.

Thanks,

Mathieu

Thanks,

	tglx

-- 
Mathieu Desnoyers
EfficiOS Inc.
http://www.efficios.com

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help