[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put

[PATCH v3 00/41] Optimize KVM/ARM for VHE systems · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 01/41] KVM: arm/arm64: Avoid vcpu_load for other vcpu ioctls than KVM_RUN · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 01/41] KVM: arm/arm64: Avoid vcpu_load for other vcpu ioctls than KVM_RUN · Julien Grall <hidden> · 2018-02-05
[PATCH v3 02/41] KVM: arm/arm64: Move vcpu_load call after kvm_vcpu_first_run_init · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 02/41] KVM: arm/arm64: Move vcpu_load call after kvm_vcpu_first_run_init · Julien Grall <hidden> · 2018-02-05
[PATCH v3 03/41] KVM: arm64: Avoid storing the vcpu pointer on the stack · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 03/41] KVM: arm64: Avoid storing the vcpu pointer on the stack · Julien Grall <hidden> · 2018-02-05
[PATCH v3 04/41] KVM: arm64: Rework hyp_panic for VHE and non-VHE · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 04/41] KVM: arm64: Rework hyp_panic for VHE and non-VHE · Julien Grall <hidden> · 2018-02-05
[PATCH v3 04/41] KVM: arm64: Rework hyp_panic for VHE and non-VHE · Julien Grall <hidden> · 2018-02-05
[PATCH v3 04/41] KVM: arm64: Rework hyp_panic for VHE and non-VHE · Christoffer Dall <hidden> · 2018-02-08
[PATCH v3 04/41] KVM: arm64: Rework hyp_panic for VHE and non-VHE · Julien Grall <hidden> · 2018-02-09
[PATCH v3 05/41] KVM: arm64: Move HCR_INT_OVERRIDE to default HCR_EL2 guest flag · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 05/41] KVM: arm64: Move HCR_INT_OVERRIDE to default HCR_EL2 guest flag · Julien Grall <hidden> · 2018-02-09
[PATCH v3 05/41] KVM: arm64: Move HCR_INT_OVERRIDE to default HCR_EL2 guest flag · Christoffer Dall <hidden> · 2018-02-13
[PATCH v3 06/41] KVM: arm/arm64: Get rid of vcpu->arch.irq_lines · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 07/41] KVM: arm/arm64: Add kvm_vcpu_load_sysregs and kvm_vcpu_put_sysregs · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 08/41] KVM: arm/arm64: Introduce vcpu_el1_is_32bit · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 08/41] KVM: arm/arm64: Introduce vcpu_el1_is_32bit · Julien Thierry <hidden> · 2018-01-17
[PATCH v3 08/41] KVM: arm/arm64: Introduce vcpu_el1_is_32bit · Christoffer Dall <hidden> · 2018-01-18
[PATCH v3 08/41] KVM: arm/arm64: Introduce vcpu_el1_is_32bit · Julien Grall <hidden> · 2018-02-09
[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put · Dave.Martin@arm.com (Dave Martin) · 2018-01-22
[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put · Christoffer Dall <hidden> · 2018-01-25
[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put · Dave.Martin@arm.com (Dave Martin) · 2018-02-07
[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put · Christoffer Dall <hidden> · 2018-02-07
[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put · Dave.Martin@arm.com (Dave Martin) · 2018-02-09
[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put · Christoffer Dall <hidden> · 2018-02-13
[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put · Dave.Martin@arm.com (Dave Martin) · 2018-02-13
[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put · Christoffer Dall <hidden> · 2018-02-14
[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put · Dave.Martin@arm.com (Dave Martin) · 2018-02-14
[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put · Christoffer Dall <hidden> · 2018-02-14
[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put · Ard Biesheuvel <hidden> · 2018-02-14
[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put · Marc Zyngier <hidden> · 2018-02-14
[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put · Dave.Martin@arm.com (Dave Martin) · 2018-02-15
[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put · Julien Grall <hidden> · 2018-02-09
[PATCH v3 09/41] KVM: arm64: Defer restoring host VFP state to vcpu_put · Christoffer Dall <hidden> · 2018-02-13
[PATCH v3 10/41] KVM: arm64: Move debug dirty flag calculation out of world switch · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 10/41] KVM: arm64: Move debug dirty flag calculation out of world switch · Julien Thierry <hidden> · 2018-01-17
[PATCH v3 11/41] KVM: arm64: Slightly improve debug save/restore functions · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 12/41] KVM: arm64: Improve debug register save/restore flow · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 13/41] KVM: arm64: Factor out fault info population and gic workarounds · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 13/41] KVM: arm64: Factor out fault info population and gic workarounds · Julien Thierry <hidden> · 2018-01-17
[PATCH v3 14/41] KVM: arm64: Introduce VHE-specific kvm_vcpu_run · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 14/41] KVM: arm64: Introduce VHE-specific kvm_vcpu_run · Dave.Martin@arm.com (Dave Martin) · 2018-01-24
[PATCH v3 14/41] KVM: arm64: Introduce VHE-specific kvm_vcpu_run · Christoffer Dall <hidden> · 2018-01-25
[PATCH v3 14/41] KVM: arm64: Introduce VHE-specific kvm_vcpu_run · Julien Grall <hidden> · 2018-02-09
[PATCH v3 14/41] KVM: arm64: Introduce VHE-specific kvm_vcpu_run · Christoffer Dall <hidden> · 2018-02-13
[PATCH v3 15/41] KVM: arm64: Remove kern_hyp_va() use in VHE switch function · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 15/41] KVM: arm64: Remove kern_hyp_va() use in VHE switch function · Dave.Martin@arm.com (Dave Martin) · 2018-01-24
[PATCH v3 15/41] KVM: arm64: Remove kern_hyp_va() use in VHE switch function · Christoffer Dall <hidden> · 2018-01-25
[PATCH v3 16/41] KVM: arm64: Don't deactivate VM on VHE systems · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 17/41] KVM: arm64: Remove noop calls to timer save/restore from VHE switch · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 17/41] KVM: arm64: Remove noop calls to timer save/restore from VHE switch · Julien Grall <hidden> · 2018-02-09
[PATCH v3 17/41] KVM: arm64: Remove noop calls to timer save/restore from VHE switch · Christoffer Dall <hidden> · 2018-02-13
[PATCH v3 17/41] KVM: arm64: Remove noop calls to timer save/restore from VHE switch · Christoffer Dall <hidden> · 2018-02-13
[PATCH v3 17/41] KVM: arm64: Remove noop calls to timer save/restore from VHE switch · Julien Grall <hidden> · 2018-02-19
[PATCH v3 18/41] KVM: arm64: Move userspace system registers into separate function · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 18/41] KVM: arm64: Move userspace system registers into separate function · Julien Grall <hidden> · 2018-02-09
[PATCH v3 18/41] KVM: arm64: Move userspace system registers into separate function · Christoffer Dall <hidden> · 2018-02-14
[PATCH v3 19/41] KVM: arm64: Rewrite sysreg alternatives to static keys · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 20/41] KVM: arm64: Introduce separate VHE/non-VHE sysreg save/restore functions · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 21/41] KVM: arm/arm64: Remove leftover comment from kvm_vcpu_run_vhe · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 22/41] KVM: arm64: Unify non-VHE host/guest sysreg save and restore functions · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 23/41] KVM: arm64: Don't save the host ELR_EL2 and SPSR_EL2 on VHE systems · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 24/41] KVM: arm64: Change 32-bit handling of VM system registers · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 25/41] KVM: arm64: Rewrite system register accessors to read/write functions · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 26/41] KVM: arm64: Introduce framework for accessing deferred sysregs · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 26/41] KVM: arm64: Introduce framework for accessing deferred sysregs · Julien Thierry <hidden> · 2018-01-17
[PATCH v3 26/41] KVM: arm64: Introduce framework for accessing deferred sysregs · Christoffer Dall <hidden> · 2018-01-18
[PATCH v3 26/41] KVM: arm64: Introduce framework for accessing deferred sysregs · Julien Thierry <hidden> · 2018-01-18
[PATCH v3 26/41] KVM: arm64: Introduce framework for accessing deferred sysregs · Dave.Martin@arm.com (Dave Martin) · 2018-01-23
[PATCH v3 26/41] KVM: arm64: Introduce framework for accessing deferred sysregs · Christoffer Dall <hidden> · 2018-01-25
[PATCH v3 26/41] KVM: arm64: Introduce framework for accessing deferred sysregs · Dave.Martin@arm.com (Dave Martin) · 2018-02-09
[PATCH v3 26/41] KVM: arm64: Introduce framework for accessing deferred sysregs · Christoffer Dall <hidden> · 2018-02-13
[PATCH v3 26/41] KVM: arm64: Introduce framework for accessing deferred sysregs · Dave.Martin@arm.com (Dave Martin) · 2018-02-13
[PATCH v3 27/41] KVM: arm/arm64: Prepare to handle deferred save/restore of SPSR_EL1 · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 28/41] KVM: arm64: Prepare to handle deferred save/restore of ELR_EL1 · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 29/41] KVM: arm64: Defer saving/restoring 64-bit sysregs to vcpu load/put on VHE · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 30/41] KVM: arm64: Prepare to handle deferred save/restore of 32-bit registers · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 30/41] KVM: arm64: Prepare to handle deferred save/restore of 32-bit registers · Julien Thierry <hidden> · 2018-01-17
[PATCH v3 30/41] KVM: arm64: Prepare to handle deferred save/restore of 32-bit registers · Christoffer Dall <hidden> · 2018-01-18
[PATCH v3 31/41] KVM: arm64: Defer saving/restoring 32-bit sysregs to vcpu load/put · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 32/41] KVM: arm64: Move common VHE/non-VHE trap config in separate functions · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 33/41] KVM: arm64: Configure FPSIMD traps on vcpu load/put · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 33/41] KVM: arm64: Configure FPSIMD traps on vcpu load/put · Julien Thierry <hidden> · 2018-01-18
[PATCH v3 33/41] KVM: arm64: Configure FPSIMD traps on vcpu load/put · Tomasz Nowicki <hidden> · 2018-01-31
[PATCH v3 33/41] KVM: arm64: Configure FPSIMD traps on vcpu load/put · Christoffer Dall <hidden> · 2018-02-05
[PATCH v3 33/41] KVM: arm64: Configure FPSIMD traps on vcpu load/put · Tomasz Nowicki <hidden> · 2018-01-31
[PATCH v3 34/41] KVM: arm64: Configure c15, PMU, and debug register traps on cpu load/put for VHE · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 35/41] KVM: arm64: Separate activate_traps and deactive_traps for VHE and non-VHE · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 36/41] KVM: arm/arm64: Get rid of vgic_elrsr · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 37/41] KVM: arm/arm64: Handle VGICv2 save/restore from the main VGIC code · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 38/41] KVM: arm/arm64: Move arm64-only vgic-v2-sr.c file to arm64 · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 39/41] KVM: arm/arm64: Handle VGICv3 save/restore from the main VGIC code on VHE · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 40/41] KVM: arm/arm64: Move VGIC APR save/restore to vgic put/load · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 41/41] KVM: arm/arm64: Avoid VGICv3 save/restore on VHE with no IRQs · Christoffer Dall <hidden> · 2018-01-12
[PATCH v3 41/41] KVM: arm/arm64: Avoid VGICv3 save/restore on VHE with no IRQs · Tomasz Nowicki <hidden> · 2018-02-05
[PATCH v3 41/41] KVM: arm/arm64: Avoid VGICv3 save/restore on VHE with no IRQs · Christoffer Dall <hidden> · 2018-02-08
[PATCH v3 00/41] Optimize KVM/ARM for VHE systems · Yury Norov <hidden> · 2018-01-15
[PATCH v3 00/41] Optimize KVM/ARM for VHE systems · Christoffer Dall <hidden> · 2018-01-15
[PATCH v3 00/41] Optimize KVM/ARM for VHE systems · Yury Norov <hidden> · 2018-01-17
[PATCH v3 00/41] Optimize KVM/ARM for VHE systems · Christoffer Dall <hidden> · 2018-01-17
[PATCH v3 00/41] Optimize KVM/ARM for VHE systems · Christoffer Dall <hidden> · 2018-01-18
[PATCH v3 00/41] Optimize KVM/ARM for VHE systems · Yury Norov <hidden> · 2018-01-18
[PATCH v3 00/41] Optimize KVM/ARM for VHE systems · Christoffer Dall <hidden> · 2018-01-18
[PATCH v3 00/41] Optimize KVM/ARM for VHE systems · Tomasz Nowicki <hidden> · 2018-01-22
[PATCH v3 00/41] Optimize KVM/ARM for VHE systems · Tomasz Nowicki <hidden> · 2018-02-01
[PATCH v3 00/41] Optimize KVM/ARM for VHE systems · Yury Norov <hidden> · 2018-02-01
[PATCH v3 00/41] Optimize KVM/ARM for VHE systems · Tomasz Nowicki <hidden> · 2018-02-02
[PATCH v3 00/41] Optimize KVM/ARM for VHE systems · Tomasz Nowicki <hidden> · 2018-02-02
[PATCH v3 00/41] Optimize KVM/ARM for VHE systems · Christoffer Dall <hidden> · 2018-02-08

STALE3029d

From: Christoffer Dall <hidden>
Date: 2018-01-25 19:46:53
Also in: kvm, kvmarm

On Mon, Jan 22, 2018 at 05:33:28PM +0000, Dave Martin wrote:

On Fri, Jan 12, 2018 at 01:07:15PM +0100, Christoffer Dall wrote:

quoted

Avoid saving the guest VFP registers and restoring the host VFP
registers on every exit from the VM.  Only when we're about to run
userspace or other threads in the kernel do we really have to switch the
state back to the host state.

We still initially configure the VFP registers to trap when entering the
VM, but the difference is that we now leave the guest state in the
hardware registers as long as we're running this VCPU, even if we
occasionally trap to the host, and we only restore the host state when
we return to user space or when scheduling another thread.

Reviewed-by: Andrew Jones <redacted>
Reviewed-by: Marc Zyngier <redacted>
Signed-off-by: Christoffer Dall <redacted>

[...]

quoted

diff --git a/arch/arm64/kvm/hyp/sysreg-sr.c b/arch/arm64/kvm/hyp/sysreg-sr.c
index 883a6383cd36..848a46eb33bf 100644
--- a/arch/arm64/kvm/hyp/sysreg-sr.c
+++ b/arch/arm64/kvm/hyp/sysreg-sr.c

[...]

quoted

@@ -213,6 +215,19 @@ void kvm_vcpu_load_sysregs(struct kvm_vcpu *vcpu)
  */
 void kvm_vcpu_put_sysregs(struct kvm_vcpu *vcpu)
 {
+	struct kvm_cpu_context *host_ctxt = vcpu->arch.host_cpu_context;
+	struct kvm_cpu_context *guest_ctxt = &vcpu->arch.ctxt;
+
+	/* Restore host FP/SIMD state */
+	if (vcpu->arch.guest_vfp_loaded) {
+		if (vcpu_el1_is_32bit(vcpu)) {
+			kvm_call_hyp(__fpsimd32_save_state,
+				     kern_hyp_va(guest_ctxt));
+		}
+		__fpsimd_save_state(&guest_ctxt->gp_regs.fp_regs);
+		__fpsimd_restore_state(&host_ctxt->gp_regs.fp_regs);
+		vcpu->arch.guest_vfp_loaded = 0;

Provided we've already marked the host FPSIMD state as dirty on the way
in, we probably don't need to restore it here.

In v4.15, the kvm_fpsimd_flush_cpu_state() call in
kvm_arch_vcpu_ioctl_run() is supposed to do this marking: currently
it's only done for SVE, since KVM was previously restoring the host
FPSIMD subset of the state anyway, but it could be made unconditional.

For a returning run ioctl, this would have the effect of deferring the
host FPSIMD reload until we return to userspace, which is probably
no more costly since the kernel must check whether to do this in
ret_to_user anyway; OTOH if the vcpu thread was preempted by some
other thread we save the cost of restoring the host state entirely here
... I think.

Yes, I agree.  However, currently the low-level logic in
arch/arm64/kvm/hyp/entry.S:__fpsimd_guest_restore which saves the host
state into vcpu->arch.host_cpu_context->gp_regs.fp_regs (where
host_cpu_context is a KVM-specific per-cpu variable).  I think means
that simply marking the state as invalid would cause the kernel to
restore some potentially stale values when returning to userspace.  Am I
missing something?

It might very well be possible to change the logic so that we store the
host logic the same place where task_fpsimd_save() would have, and I
think that would make what you suggest possible.

I'd like to make that a separate change from this patch though, as we're
already changing quite a bit with this series, so I'm trying to make any
logical change as contained per patch as possible, so that problems can
be spotted by bisecting.

Ultimately I'd like to go one better and actually treat a vcpu as a
first-class fpsimd context, so that taking an interrupt to the host
and then reentering the guest doesn't cause any reload at all.

That should be the case already; kvm_vcpu_put_sysregs() is only called
when you run another thread (preemptively or voluntarily), or when you
return to user space, but making the vcpu fpsimd context a first-class
citizen fpsimd context would mean that you can run another thread (and
maybe run userspace if it doesn't use fpsimd?) without having to
save/restore anything.  Am I getting this right?

But
that feels like too big a step for this series, and there are likely
side-issues I've not thought about yet.

It should definitely be in separate patches, but I would be optn to
tagging something on to the end of this series if we can stabilize this
series early after -rc1 is out.

Thanks,
-Christoffer

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help