[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put

[PATCH v3 00/41] Optimize KVM/ARM for VHE systems · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 01/41] KVM: arm/arm64: Avoid vcpu_load for other vcpu ioctls than KVM_RUN · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 01/41] KVM: arm/arm64: Avoid vcpu_load for other vcpu ioctls than KVM_RUN · Julien Grall <hidden> · 2018-02-05
[PATCH v3 02/41] KVM: arm/arm64: Move vcpu_load call after kvm_vcpu_first_run_init · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 02/41] KVM: arm/arm64: Move vcpu_load call after kvm_vcpu_first_run_init · Julien Grall <hidden> · 2018-02-05
[PATCH v3 03/41] KVM: arm64: Avoid storing the vcpu pointer on the stack · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 03/41] KVM: arm64: Avoid storing the vcpu pointer on the stack · Julien Grall <hidden> · 2018-02-05
[PATCH v3 04/41] KVM: arm64: Rework hyp_panic for VHE and non-VHE · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 04/41] KVM: arm64: Rework hyp_panic for VHE and non-VHE · Julien Grall <hidden> · 2018-02-05
[PATCH v3 04/41] KVM: arm64: Rework hyp_panic for VHE and non-VHE · Julien Grall <hidden> · 2018-02-05
[PATCH v3 04/41] KVM: arm64: Rework hyp_panic for VHE and non-VHE · Christoffer Dall <hidden> · 2018-02-08
[PATCH v3 04/41] KVM: arm64: Rework hyp_panic for VHE and non-VHE · Julien Grall <hidden> · 2018-02-09
[PATCH v3 05/41] KVM: arm64: Move HCR_INT_OVERRIDE to default HCR_EL2 guest flag · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 05/41] KVM: arm64: Move HCR_INT_OVERRIDE to default HCR_EL2 guest flag · Julien Grall <hidden> · 2018-02-09
[PATCH v3 05/41] KVM: arm64: Move HCR_INT_OVERRIDE to default HCR_EL2 guest flag · Christoffer Dall <hidden> · 2018-02-13
[PATCH v3 06/41] KVM: arm/arm64: Get rid of vcpu->arch.irq_lines · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 07/41] KVM: arm/arm64: Add kvm_vcpu_load_sysregs and kvm_vcpu_put_sysregs · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 08/41] KVM: arm/arm64: Introduce vcpu_el1_is_32bit · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 08/41] KVM: arm/arm64: Introduce vcpu_el1_is_32bit · Julien Thierry <hidden> · 2018-01-17
[PATCH v3 08/41] KVM: arm/arm64: Introduce vcpu_el1_is_32bit · Christoffer Dall <hidden> · 2018-01-18
[PATCH v3 08/41] KVM: arm/arm64: Introduce vcpu_el1_is_32bit · Julien Grall <hidden> · 2018-02-09
[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put · Dave.Martin@arm.com (Dave Martin) · 2018-01-22
[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put · Christoffer Dall <hidden> · 2018-01-25
[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put · Dave.Martin@arm.com (Dave Martin) · 2018-02-07
[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put · Christoffer Dall <hidden> · 2018-02-07
[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put · Dave.Martin@arm.com (Dave Martin) · 2018-02-09
[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put · Christoffer Dall <hidden> · 2018-02-13
[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put · Dave.Martin@arm.com (Dave Martin) · 2018-02-13
[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put · Christoffer Dall <hidden> · 2018-02-14
[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put · Dave.Martin@arm.com (Dave Martin) · 2018-02-14
[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put · Christoffer Dall <hidden> · 2018-02-14
[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put · Ard Biesheuvel <hidden> · 2018-02-14
[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put · Marc Zyngier <hidden> · 2018-02-14
[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put · Dave.Martin@arm.com (Dave Martin) · 2018-02-15
[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put · Julien Grall <hidden> · 2018-02-09
[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put · Christoffer Dall <hidden> · 2018-02-13
[PATCH v3 10/41] KVM: arm64: Move debug dirty flag calculation out of world switch · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 10/41] KVM: arm64: Move debug dirty flag calculation out of world switch · Julien Thierry <hidden> · 2018-01-17
[PATCH v3 11/41] KVM: arm64: Slightly improve debug save/restore functions · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 12/41] KVM: arm64: Improve debug register save/restore flow · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 13/41] KVM: arm64: Factor out fault info population and gic workarounds · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 13/41] KVM: arm64: Factor out fault info population and gic workarounds · Julien Thierry <hidden> · 2018-01-17
[PATCH v3 14/41] KVM: arm64: Introduce VHE-specific kvm_vcpu_run · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 14/41] KVM: arm64: Introduce VHE-specific kvm_vcpu_run · Dave.Martin@arm.com (Dave Martin) · 2018-01-24
[PATCH v3 14/41] KVM: arm64: Introduce VHE-specific kvm_vcpu_run · Christoffer Dall <hidden> · 2018-01-25
[PATCH v3 14/41] KVM: arm64: Introduce VHE-specific kvm_vcpu_run · Julien Grall <hidden> · 2018-02-09
[PATCH v3 14/41] KVM: arm64: Introduce VHE-specific kvm_vcpu_run · Christoffer Dall <hidden> · 2018-02-13
[PATCH v3 15/41] KVM: arm64: Remove kern_hyp_va() use in VHE switch function · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 15/41] KVM: arm64: Remove kern_hyp_va() use in VHE switch function · Dave.Martin@arm.com (Dave Martin) · 2018-01-24
[PATCH v3 15/41] KVM: arm64: Remove kern_hyp_va() use in VHE switch function · Christoffer Dall <hidden> · 2018-01-25
[PATCH v3 16/41] KVM: arm64: Don't deactivate VM on VHE systems · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 17/41] KVM: arm64: Remove noop calls to timer save/restore from VHE switch · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 17/41] KVM: arm64: Remove noop calls to timer save/restore from VHE switch · Julien Grall <hidden> · 2018-02-09
[PATCH v3 17/41] KVM: arm64: Remove noop calls to timer save/restore from VHE switch · Christoffer Dall <hidden> · 2018-02-13
[PATCH v3 17/41] KVM: arm64: Remove noop calls to timer save/restore from VHE switch · Christoffer Dall <hidden> · 2018-02-13
[PATCH v3 17/41] KVM: arm64: Remove noop calls to timer save/restore from VHE switch · Julien Grall <hidden> · 2018-02-19
[PATCH v3 18/41] KVM: arm64: Move userspace system registers into separate function · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 18/41] KVM: arm64: Move userspace system registers into separate function · Julien Grall <hidden> · 2018-02-09
[PATCH v3 18/41] KVM: arm64: Move userspace system registers into separate function · Christoffer Dall <hidden> · 2018-02-14
[PATCH v3 19/41] KVM: arm64: Rewrite sysreg alternatives to static keys · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 20/41] KVM: arm64: Introduce separate VHE/non-VHE sysreg save/restore functions · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 21/41] KVM: arm/arm64: Remove leftover comment from kvm_vcpu_run_vhe · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 22/41] KVM: arm64: Unify non-VHE host/guest sysreg save and restore functions · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 23/41] KVM: arm64: Don't save the host ELR_EL2 and SPSR_EL2 on VHE systems · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 24/41] KVM: arm64: Change 32-bit handling of VM system registers · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 25/41] KVM: arm64: Rewrite system register accessors to read/write functions · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 26/41] KVM: arm64: Introduce framework for accessing deferred sysregs · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 26/41] KVM: arm64: Introduce framework for accessing deferred sysregs · Julien Thierry <hidden> · 2018-01-17
[PATCH v3 26/41] KVM: arm64: Introduce framework for accessing deferred sysregs · Christoffer Dall <hidden> · 2018-01-18
[PATCH v3 26/41] KVM: arm64: Introduce framework for accessing deferred sysregs · Julien Thierry <hidden> · 2018-01-18
[PATCH v3 26/41] KVM: arm64: Introduce framework for accessing deferred sysregs · Dave.Martin@arm.com (Dave Martin) · 2018-01-23
[PATCH v3 26/41] KVM: arm64: Introduce framework for accessing deferred sysregs · Christoffer Dall <hidden> · 2018-01-25
[PATCH v3 26/41] KVM: arm64: Introduce framework for accessing deferred sysregs · Dave.Martin@arm.com (Dave Martin) · 2018-02-09
[PATCH v3 26/41] KVM: arm64: Introduce framework for accessing deferred sysregs · Christoffer Dall <hidden> · 2018-02-13
[PATCH v3 26/41] KVM: arm64: Introduce framework for accessing deferred sysregs · Dave.Martin@arm.com (Dave Martin) · 2018-02-13
[PATCH v3 27/41] KVM: arm/arm64: Prepare to handle deferred save/restore of SPSR_EL1 · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 28/41] KVM: arm64: Prepare to handle deferred save/restore of ELR_EL1 · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 29/41] KVM: arm64: Defer saving/restoring 64-bit sysregs to vcpu load/put on VHE · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 30/41] KVM: arm64: Prepare to handle deferred save/restore of 32-bit registers · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 30/41] KVM: arm64: Prepare to handle deferred save/restore of 32-bit registers · Julien Thierry <hidden> · 2018-01-17
[PATCH v3 30/41] KVM: arm64: Prepare to handle deferred save/restore of 32-bit registers · Christoffer Dall <hidden> · 2018-01-18
[PATCH v3 31/41] KVM: arm64: Defer saving/restoring 32-bit sysregs to vcpu load/put · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 32/41] KVM: arm64: Move common VHE/non-VHE trap config in separate functions · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 33/41] KVM: arm64: Configure FPSIMD traps on vcpu load/put · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 33/41] KVM: arm64: Configure FPSIMD traps on vcpu load/put · Julien Thierry <hidden> · 2018-01-18
[PATCH v3 33/41] KVM: arm64: Configure FPSIMD traps on vcpu load/put · Tomasz Nowicki <hidden> · 2018-01-31
[PATCH v3 33/41] KVM: arm64: Configure FPSIMD traps on vcpu load/put · Christoffer Dall <hidden> · 2018-02-05
[PATCH v3 33/41] KVM: arm64: Configure FPSIMD traps on vcpu load/put · Tomasz Nowicki <hidden> · 2018-01-31
[PATCH v3 34/41] KVM: arm64: Configure c15, PMU, and debug register traps on cpu load/put for VHE · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 35/41] KVM: arm64: Separate activate_traps and deactive_traps for VHE and non-VHE · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 36/41] KVM: arm/arm64: Get rid of vgic_elrsr · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 37/41] KVM: arm/arm64: Handle VGICv2 save/restore from the main VGIC code · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 38/41] KVM: arm/arm64: Move arm64-only vgic-v2-sr.c file to arm64 · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 39/41] KVM: arm/arm64: Handle VGICv3 save/restore from the main VGIC code on VHE · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 40/41] KVM: arm/arm64: Move VGIC APR save/restore to vgic put/load · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 41/41] KVM: arm/arm64: Avoid VGICv3 save/restore on VHE with no IRQs · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 41/41] KVM: arm/arm64: Avoid VGICv3 save/restore on VHE with no IRQs · Tomasz Nowicki <hidden> · 2018-02-05
[PATCH v3 41/41] KVM: arm/arm64: Avoid VGICv3 save/restore on VHE with no IRQs · Christoffer Dall <hidden> · 2018-02-08
[PATCH v3 00/41] Optimize KVM/ARM for VHE systems · Yury Norov <hidden> · 2018-01-15
[PATCH v3 00/41] Optimize KVM/ARM for VHE systems · Christoffer Dall <hidden> · 2018-01-15
[PATCH v3 00/41] Optimize KVM/ARM for VHE systems · Yury Norov <hidden> · 2018-01-17
[PATCH v3 00/41] Optimize KVM/ARM for VHE systems · Christoffer Dall <hidden> · 2018-01-17
[PATCH v3 00/41] Optimize KVM/ARM for VHE systems · Christoffer Dall <hidden> · 2018-01-18
[PATCH v3 00/41] Optimize KVM/ARM for VHE systems · Yury Norov <hidden> · 2018-01-18
[PATCH v3 00/41] Optimize KVM/ARM for VHE systems · Christoffer Dall <hidden> · 2018-01-18
[PATCH v3 00/41] Optimize KVM/ARM for VHE systems · Tomasz Nowicki <hidden> · 2018-01-22
[PATCH v3 00/41] Optimize KVM/ARM for VHE systems · Tomasz Nowicki <hidden> · 2018-02-01
[PATCH v3 00/41] Optimize KVM/ARM for VHE systems · Yury Norov <hidden> · 2018-02-01
[PATCH v3 00/41] Optimize KVM/ARM for VHE systems · Tomasz Nowicki <hidden> · 2018-02-02
[PATCH v3 00/41] Optimize KVM/ARM for VHE systems · Tomasz Nowicki <hidden> · 2018-02-02
[PATCH v3 00/41] Optimize KVM/ARM for VHE systems · Christoffer Dall <hidden> · 2018-02-08

STALE3031d

From: Dave.Martin@arm.com (Dave Martin)
Date: 2018-02-15 09:51:26
Also in: kvm, kvmarm

On Wed, Feb 14, 2018 at 06:38:11PM +0100, Christoffer Dall wrote:

On Wed, Feb 14, 2018 at 02:43:42PM +0000, Dave Martin wrote:

quoted

[CC Ard, in case he has a view on how much we care about softirq NEON
performance regressions ... and whether my suggestions make sense]

On Wed, Feb 14, 2018 at 11:15:54AM +0100, Christoffer Dall wrote:

quoted

On Tue, Feb 13, 2018 at 02:08:47PM +0000, Dave Martin wrote:

quoted

On Tue, Feb 13, 2018 at 09:51:30AM +0100, Christoffer Dall wrote:

quoted

On Fri, Feb 09, 2018 at 03:59:30PM +0000, Dave Martin wrote:

[...]

quoted

kvm_fpsimd_flush_cpu_state() is just an invalidation.  No state is
actually saved today because we explicitly don't care about preserving
the SVE state, because the syscall ABI throws the SVE regs away as
a side effect any syscall including ioctl(KVM_RUN); also (currently) KVM
ensures that the non-SVE FPSIMD bits _are_ restored by itself.

I think my proposal is that this hook might take on the role of
actually saving the state too, if we move that out of the KVM host
context save/restore code.

Perhaps we could even replace

	preempt_disable();
	kvm_fpsimd_flush_cpu_state();
	/* ... */
	preempt_enable();

with

	kernel_neon_begin();
	/* ... */
	kernel_neon_end();

I'm not entirely sure where the begin and end points would be in the
context of KVM?

Hmmm, actually there's a bug in your VHE changes now I look more
closely in this area:

You assume that the only way for the FPSIMD regs to get unexpectedly
dirtied is through a context switch, but actually this is not the case:
a softirq can use kernel-mode NEON any time that softirqs are enabled.

This means that in between kvm_arch_vcpu_load() and _put() (whether via
preempt notification or not), the guest's FPSIMD state in the regs may
be trashed by a softirq.

ouch.

quoted

The simplest fix is to disable softirqs and preemption for that whole
region, but since we can stay in it indefinitely that's obviously not
the right approach.  Putting kernel_neon_begin() in _load() and
kernel_neon_end() in _put() achieves the same without disabling
softirq, but preemption is still disabled throughout, which is bad.
This effectively makes the run ioctl nonpreemptible...

A better fix would be to set the cpu's kernel_neon_busy flag, which
makes softirq code use non-NEON fallback code.

We could expose an interface from fpsimd.c to support that.

It still comes at a cost though: due to the switching from NEON to
fallback code in softirq handlers, we may get a big performance
regression in setups that rely heavily on NEON in softirq for
performance.

I wasn't aware that softirqs would use fpsimd.

quoted

Alternatively we could do something like the following, but it's a
rather gross abstraction violation:

diff --git a/virt/kvm/arm/arm.c b/virt/kvm/arm/arm.c
index 2e43f9d..6a1ff3a 100644
--- a/virt/kvm/arm/arm.c
+++ b/virt/kvm/arm/arm.c

@@ -746,9 +746,24 @@ int kvm_arch_vcpu_ioctl_run(struct kvm_vcpu *vcpu, struct kvm_run *run)
 		 * the effect of taking the interrupt again, in SVC
 		 * mode this time.
 		 */
+		local_bh_disable();
 		local_irq_enable();
 
 		/*
+		 * If we exited due to one or mode pending interrupts, they
+		 * have now been handled.  If such an interrupt pended a
+		 * softirq, we shouldn't prevent that softirq from using
+		 * kernel-mode NEON indefinitely: instead, give FPSIMD back to
+		 * the host to manage as it likes.  We'll grab it again on the
+		 * next FPSIMD trap from the guest (if any).
+		 */
+		if (local_softirq_pending() && FPSIMD untrapped for guest) {
+			/* save vcpu FPSIMD context */
+			/* enable FPSIMD trap for guest */
+		}
+		local_bh_enable();
+
+		/*
 		 * We do local_irq_enable() before calling guest_exit() so
 		 * that if a timer interrupt hits while running the guest we
 		 * account that tick as being spent in the guest.  We enable

[...]

I can't see this working, what if an IRQ comes in and a softirq gets
pending immediately after local_bh_enable() above?

Sorry, I missed a crucial bit of information here.

For context: here's the remainder of my argument.  This is not a
recommendation...


--8<--

We can inhibit softirqs from trashing the FPSIMD regs by setting the
per-cpu kernel_neon_busy flag: that's forces softirq code to use
non-NEON fallback code without actually disabling softirq.

I'd come up with a local hack

 * kernel_neon_grab();

	to set the flag, which would happen in vcpu_load().

 * kernel_neon_ungrab();

	to clear the flag, which would happen as above and in
	vcpu_put().

It would be up to the caller to ensure that preemption cannot occur
between those calls (satisfied by use of a preempt notifier here), and
to save the host context when needed.

This would bound the kernel-mode NEON blackout to the time KVM spends
in the host kernel only: the above conditional relinquishing of the
FPSIMD regs ensures that a softirq trigger event occuring during the
(unbounded) guest execution time _does_ get to use NEON.

-->8--

And as you say, it's really not pretty.

Agreed!

This is really making me think that I'll drop this part of the
optimization and when we do optimize fpsimd handling, we do it properly
by integrating it with the kernel tracking.

Since I will be hacking at this area as part of the SVE KVM support
anyway, I will sooner or later end up working on it -- at that point it
will likely be worth unifying the two mechanisms, at least for the VHE
case (SVE architecturally required v8.2, so VHE can be assumed in that
case).

It would be interesting to know what the numbers look like without
the FPSIMD optimisation.

Cheers
---Dave

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help