Re: [PATCH v2] powerpc/rtas: Keep MSR[RI] set when calling RTAS
From: Laurent Dufour <hidden>
Date: 2022-05-04 11:01:38
Also in:
lkml, stable
On 04/05/2022, 07:59:29, Michael Ellerman wrote:
Laurent Dufour [off-list ref] writes:quoted
On 03/05/2022, 17:06:41, Michael Ellerman wrote:quoted
Laurent Dufour [off-list ref] writes:...quoted
quoted
quoted
diff --git a/arch/powerpc/kernel/rtas.c b/arch/powerpc/kernel/rtas.c index 1f42aabbbab3..d7775b8c8853 100644 --- a/arch/powerpc/kernel/rtas.c +++ b/arch/powerpc/kernel/rtas.c@@ -49,6 +49,11 @@ void enter_rtas(unsigned long); static inline void do_enter_rtas(unsigned long args) { + unsigned long msr; + + msr = mfmsr(); + BUG_ON(!(msr & MSR_RI));I'm not sure about this. We call RTAS in some low-level places, so if we ever hit this BUG_ON then it might cause us to crash badly, or recursively BUG. A WARN_ON_ONCE() might be safer?I'm afraid a BUG_ON is required here. Since MSR[RI] is set on RTAS exit so if it was not set when calling RTAS, that's a real issue and should generate unexpected behaviour. Do you have places in mind where RTAS could be called with !MSR[RI]?The main one I can think of is if someone is using CONFIG_UDBG_RTAS_CONSOLE, then udbg_rtascon_putc() is wired up as udbg_putc() and that might be called from anywhere, including xmon. There's also RTAS calls in low-level xics interrupt code, that might get called during panic/crash. I don't expect any of those places to be called with MSR[RI] unset, but I'm worried that if we're already crashing and for some reason MSR[RI] is unset, then that BUG_ON will just make things worse. eg. imagine taking a BUG_ON() for every character we try to print as part of an oops. Admittedly CONFIG_UDBG_RTAS_CONSOLE is old and probably not used much anymore, but I'm still a bit paranoid :)
I think you're right to be paranoid :) This part of code can be really sensitive. I boot a kernel built with CONFIG_UDBG_RTAS_CONSOLE, xmon is working fine, but I cannot pretend this is covering all the RTAS call cases. My hope with BUG_ON() is to raise the issue, as soon as possible, so it can be addressed during the test phase.