Thread (9 messages) 9 messages, 4 authors, 2024-11-15

Re: [PATCH] powerpc: Add preempt lazy support

From: Ankur Arora <hidden>
Date: 2024-11-15 23:37:10

Shrikanth Hegde [off-list ref] writes:
On 11/9/24 22:24, Shrikanth Hegde wrote:
quoted
On 11/9/24 00:36, Ankur Arora wrote:
quoted
Shrikanth Hegde [off-list ref] writes:
quoted
Define preempt lazy bit for Powerpc. Use bit 9 which is free and within
16 bit range of NEED_RESCHED, so compiler can issue single andi.

Since Powerpc doesn't use the generic entry/exit, add lazy check at exit
to user. CONFIG_PREEMPTION is defined for lazy/full/rt so use it for
return to kernel.

Ran a few benchmarks and db workload on Power10. Performance is close to
preempt=none/voluntary. It is possible that some patterns would
differ in lazy[2]. More details of preempt lazy is here [1]

Since Powerpc system can have large core count and large memory,
preempt lazy is going to be helpful in avoiding soft lockup issues.

[1]: https://lore.kernel.org/ lkml/20241007074609.447006177@infradead.org/
[2]: https://lore.kernel.org/all/1a973dda-c79e-4d95-935b-
e4b93eb077b8@linux.ibm.com/

Signed-off-by: Shrikanth Hegde <redacted>
Looks good. Reviewed-by: [off-list ref]
Thank you Ankur for taking a look and rwb tag.
quoted
However, I just checked and powerpc does not have
CONFIG_KVM_XFER_TO_GUEST_WORK. Do you need this additional patch
for handling the lazy bit at KVM guest entry?
It doesn't use the generic kvm entry/exit either AFAIK. I need to understand
more of this kvm maze. There are quite a lot of combinations.
The generic kvm entry/exit is gated by CONFIG_KVM_XFER_TO_GUEST_WORK.
quoted
will take a look. Thanks for the pointers.
quoted
diff --git a/arch/powerpc/kvm/powerpc.c b/arch/powerpc/kvm/powerpc.c
index f14329989e9a..7bdf7015bb65 100644
--- a/arch/powerpc/kvm/powerpc.c
+++ b/arch/powerpc/kvm/powerpc.c
@@ -84,7 +84,8 @@ int kvmppc_prepare_to_enter(struct kvm_vcpu *vcpu)
         hard_irq_disable();

         while (true) {
-               if (need_resched()) {
+               unsigned long tf = read_thread_flags();
+               if (tf & (_TIF_NEED_RESCHED | _TIF_NEED_RESCHED_LAZY)) {
                         local_irq_enable();
                         cond_resched();
                         hard_irq_disable();
This is not going help since, with LAZY, cond_resched is nop. So it doesn't call
to schedule. Same is true with preempt=full. I need to figure out if kvm stuff
was tested for preempt=full.

Instead of cond_resched this needs to use schedule instead. Need to test it out
and also see other places for kvm.
Oh yeah. Missed that it was calling cond_resched().
So, i need to spend more time on this and figure out, will send the patches
after that.
quoted
quoted
Ankur

--
ankur
Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help