Re: [PATCH] kvm-pr: manage illegal instructions

From: Thomas Huth <hidden>
Date: 2016-05-18 18:39:54
Also in: kvm, lkml

On 18.05.2016 12:53, Thomas Huth wrote:

On 18.05.2016 12:18, Thomas Huth wrote:

quoted

On 17.05.2016 19:49, Laurent Vivier wrote:

quoted


On 17/05/2016 10:37, Alexander Graf wrote:

quoted

On 05/17/2016 10:35 AM, Laurent Vivier wrote:

quoted

On 12/05/2016 16:23, Laurent Vivier wrote:

quoted

On 12/05/2016 11:27, Alexander Graf wrote:

quoted

On 05/12/2016 11:10 AM, Laurent Vivier wrote:

quoted

On 11/05/2016 13:49, Alexander Graf wrote:

quoted

On 05/11/2016 01:14 PM, Laurent Vivier wrote:

quoted

On 11/05/2016 12:35, Alexander Graf wrote:

quoted

On 03/15/2016 09:18 PM, Laurent Vivier wrote:

quoted

While writing some instruction tests for kvm-unit-tests for
powerpc,
I've found that illegal instructions are not managed correctly
with
kvm-pr,
while it is fine with kvm-hv.

When an illegal instruction (like ".long 0") is processed by
kvm-pr,
the kernel logs are filled with:

         Couldn't emulate instruction 0x00000000 (op 0 xop 0)
         kvmppc_handle_exit_pr: emulation at 700 failed (00000000)

While the exception handler receives an interrupt for each
instruction
executed after the illegal instruction.

Signed-off-by: Laurent Vivier <redacted>
---
     arch/powerpc/kvm/book3s_emulate.c | 4 +++-
     1 file changed, 3 insertions(+), 1 deletion(-)

diff --git a/arch/powerpc/kvm/book3s_emulate.c

b/arch/powerpc/kvm/book3s_emulate.c
index 2afdb9c..4ee969d 100644

--- a/arch/powerpc/kvm/book3s_emulate.c
+++ b/arch/powerpc/kvm/book3s_emulate.c

@@ -99,7 +99,6 @@ int kvmppc_core_emulate_op_pr(struct kvm_run

*run,
struct kvm_vcpu *vcpu,
           switch (get_op(inst)) {
         case 0:
-        emulated = EMULATE_FAIL;
             if ((kvmppc_get_msr(vcpu) & MSR_LE) &&
                 (inst == swab32(inst_sc))) {
                 /*

@@ -112,6 +111,9 @@ int kvmppc_core_emulate_op_pr(struct kvm_run

*run,
struct kvm_vcpu *vcpu,
                 kvmppc_set_gpr(vcpu, 3, EV_UNIMPLEMENTED);
                 kvmppc_set_pc(vcpu, kvmppc_get_pc(vcpu) + 4);
                 emulated = EMULATE_DONE;
+        } else {
+            kvmppc_core_queue_program(vcpu, SRR1_PROGILL);

But isn't that exactly what the semantic of EMULATE_FAIL is?
Fixing it
up in book3s_emulate.c is definitely the wrong spot.

So what is the problem you're trying to solve? Is the SRR0 at the
wrong
spot or are the log messages the problem?

No, the problem is the host kernel logs are filled by the message
and
the execution hangs. And the host becomes unresponsiveness, even
after
the end of the tests.

Please, try to run kvm-unit-tests (the emulator test) on a KVM-PR
host,
and check the kernel logs (dmesg), then try to ssh to the host...

Ok, so the log messages are the problem. Please fix the message
output
then - or remove it altogether. Or if you like, create a module
parameter that allows you to emit them.

I personally think the best solution would be to just convert the
message into a trace point.

While at it, please see whether the guest can trigger similar host
log
output excess in other code paths.

The problem is not really with the log messages: they are
consequence of
the bug I try to fix.

What happens is once kvm_pr decodes an invalid instruction all the
valid
following instructions trigger a Program exception to the guest
(but are
executed correctly). It has no real consequence on big machine like
POWER8, except that the guest become very slow and the log files of
the
host are filled with messages (and qemu uses 100% of the CPU). On a
smaller machine like a  PowerMac G5, the machine becomes simply
unusable.

It's probably more related to your verbosity level of kernel messages.
If you pass loglevel=0 (or quiet) to you kernel cmdline you won't get
the messages printed to serial which is what's slowing you down.

The other problem sounds pretty severe, but the only thing your patch
does any different from the current code flow would be the patch below.
Or did I miss anything?

diff --git a/arch/powerpc/kvm/emulate.c b/arch/powerpc/kvm/emulate.c
index 5cc2e7a..4672bc2 100644
--- a/arch/powerpc/kvm/emulate.c
+++ b/arch/powerpc/kvm/emulate.c

@@ -302,7 +302,11 @@ int kvmppc_emulate_instruction(struct kvm_run

*run,
struct kvm_vcpu *vcpu)
                         advance = 0;
                         printk(KERN_ERR "Couldn't emulate instruction
0x%08x "
                                "(op %d xop %d)\n", inst, get_op(inst),
get_xop(inst));
+#ifdef CONFIG_PPC_BOOK3S
+                       kvmppc_core_queue_program(vcpu, SRR1_PROGILL);
+#else
                         kvmppc_core_queue_program(vcpu, 0);
+#endif
                 }
         }

Do you want I send an updated patch with your changes?

Well, you reported the issue and narrowed it down, so feel free to send
it under your name :). I merely simplified your patch a bit.

Well, while I was trying to update the patch, I've re-tested this... and
it fails. I don't know what I'm doing bad now or what I did bad before
but it seems it doesn't work. :(

Thomas, could try the patch from Alex?

The patch from Alex also does not work for me.

I've now had a closer look at the code, and I think the endless loop is
caused by the fact that we try to inject a program interrupt twice here.

In kvmppc_handle_exit_pr(), the code flow looks like this:

	case BOOK3S_INTERRUPT_H_EMUL_ASSIST:
	{
		...
		er = kvmppc_emulate_instruction(run, vcpu);
		switch (er) {
			...
		case EMULATE_FAIL:
			printk(KERN_CRIT "%s: emulation at %lx failed (%08x)\n",
			       __func__, kvmppc_get_pc(vcpu), last_inst);
			kvmppc_core_queue_program(vcpu, flags);
			printk(KERN_CRIT "%s: pc = %lx \n",
			       __func__, kvmppc_get_pc(vcpu));
			r = RESUME_GUEST;
			break;
		...
	}

But when you look at the end of kvmppc_emulate_instruction(), you
can see that the interrupt has also already been injected there:

	if (emulated == EMULATE_FAIL) {
		emulated = vcpu->kvm->arch.kvm_ops->emulate_op(run, vcpu, inst,
							       &advance);
		if (emulated == EMULATE_AGAIN) {
			advance = 0;
		} else if (emulated == EMULATE_FAIL) {
			advance = 0;
			printk(KERN_ERR "Couldn't emulate instruction 0x%08x "
			       "(op %d xop %d)\n", inst, get_op(inst), get_xop(inst));
			kvmppc_core_queue_program(vcpu, 0);
		}
	}

Injecting the program interrupt twice of course destroys the
return address in SRR0, causing this strange behavior.
If I comment out the kvmppc_core_queue_program() in
kvmppc_emulate_instruction(), the endless loop is gone.

I've now also checked the other callers of kvmppc_emulate_instruction(),
and all are already trying to inject an interrupt on their own, so
removing the kvmppc_core_queue_program() from
kvmppc_emulate_instruction() sounds like the right solution for me.
I'll submit a corresponding patch...

 Thomas

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help