Re: [PATCH printk v1 03/10] kgdb: delay roundup if holding printk cpulock

[PATCH printk v1 00/10] printk: introduce atomic consoles and sync mode · John Ogness <john.ogness@linutronix.de> · 2021-08-03
[PATCH printk v1 03/10] kgdb: delay roundup if holding printk cpulock · John Ogness <john.ogness@linutronix.de> · 2021-08-03
Re: [PATCH printk v1 03/10] kgdb: delay roundup if holding printk cpulock · Daniel Thompson <hidden> · 2021-08-03
Re: [PATCH printk v1 03/10] kgdb: delay roundup if holding printk cpulock · John Ogness <john.ogness@linutronix.de> · 2021-08-03
Re: [PATCH printk v1 03/10] kgdb: delay roundup if holding printk cpulock · Daniel Thompson <hidden> · 2021-08-04
Re: [PATCH printk v1 03/10] kgdb: delay roundup if holding printk cpulock · Petr Mladek <pmladek@suse.com> · 2021-08-04
Re: [PATCH printk v1 03/10] kgdb: delay roundup if holding printk cpulock · Daniel Thompson <hidden> · 2021-08-04
Re: [PATCH printk v1 03/10] kgdb: delay roundup if holding printk cpulock · John Ogness <john.ogness@linutronix.de> · 2021-08-05
Re: [PATCH printk v1 03/10] kgdb: delay roundup if holding printk cpulock · Daniel Thompson <hidden> · 2021-08-06
Re: [PATCH printk v1 03/10] kgdb: delay roundup if holding printk cpulock · Petr Mladek <pmladek@suse.com> · 2021-08-04
[PATCH printk v1 08/10] printk: introduce kernel sync mode · John Ogness <john.ogness@linutronix.de> · 2021-08-03
Re: [PATCH printk v1 08/10] printk: introduce kernel sync mode · Petr Mladek <pmladek@suse.com> · 2021-08-05
Re: [PATCH printk v1 08/10] printk: introduce kernel sync mode · John Ogness <john.ogness@linutronix.de> · 2021-08-05
[PATCH printk v1 04/10] printk: relocate printk_delay() · John Ogness <john.ogness@linutronix.de> · 2021-08-03
Re: [PATCH printk v1 04/10] printk: relocate printk_delay() · Petr Mladek <pmladek@suse.com> · 2021-08-04
[PATCH printk v1 01/10] printk: relocate printk cpulock functions · John Ogness <john.ogness@linutronix.de> · 2021-08-03
Re: [PATCH printk v1 01/10] printk: relocate printk cpulock functions · Petr Mladek <pmladek@suse.com> · 2021-08-04
[PATCH printk v1 06/10] printk: use seqcount_latch for console_seq · John Ogness <john.ogness@linutronix.de> · 2021-08-03
Re: [PATCH printk v1 06/10] printk: use seqcount_latch for console_seq · Petr Mladek <pmladek@suse.com> · 2021-08-05
Re: [PATCH printk v1 06/10] printk: use seqcount_latch for console_seq · John Ogness <john.ogness@linutronix.de> · 2021-08-05
Re: [PATCH printk v1 06/10] printk: use seqcount_latch for console_seq · Petr Mladek <pmladek@suse.com> · 2021-08-06
Re: [PATCH printk v1 06/10] printk: use seqcount_latch for console_seq · Sergey Senozhatsky <senozhatsky@chromium.org> · 2021-08-31
[PATCH printk v1 07/10] console: add write_atomic interface · John Ogness <john.ogness@linutronix.de> · 2021-08-03
Re: [PATCH printk v1 07/10] console: add write_atomic interface · Andy Shevchenko <andriy.shevchenko@linux.intel.com> · 2021-08-03
Re: [PATCH printk v1 07/10] console: add write_atomic interface · John Ogness <john.ogness@linutronix.de> · 2021-08-06
Re: [PATCH printk v1 07/10] console: add write_atomic interface · Andy Shevchenko <andriy.shevchenko@linux.intel.com> · 2021-08-06
Re: [PATCH printk v1 07/10] console: add write_atomic interface · Sergey Senozhatsky <senozhatsky@chromium.org> · 2021-08-31
[PATCH printk v1 02/10] printk: rename printk cpulock API and always disable interrupts · John Ogness <john.ogness@linutronix.de> · 2021-08-03
Re: [PATCH printk v1 02/10] printk: rename printk cpulock API and always disable interrupts · Petr Mladek <pmladek@suse.com> · 2021-08-04
[PATCH printk v1 05/10] printk: call boot_delay_msec() in printk_delay() · John Ogness <john.ogness@linutronix.de> · 2021-08-03
Re: [PATCH printk v1 05/10] printk: call boot_delay_msec() in printk_delay() · Petr Mladek <pmladek@suse.com> · 2021-08-04
Re: [PATCH printk v1 05/10] printk: call boot_delay_msec() in printk_delay() · Sergey Senozhatsky <senozhatsky@chromium.org> · 2021-08-31
[PATCH printk v1 09/10] kdb: if available, only use atomic consoles for output mirroring · John Ogness <john.ogness@linutronix.de> · 2021-08-03
[PATCH printk v1 10/10] serial: 8250: implement write_atomic · John Ogness <john.ogness@linutronix.de> · 2021-08-03
Re: [PATCH printk v1 10/10] serial: 8250: implement write_atomic · Andy Shevchenko <andriy.shevchenko@linux.intel.com> · 2021-08-03
Re: [PATCH printk v1 10/10] serial: 8250: implement write_atomic · Jiri Slaby <jirislaby@kernel.org> · 2021-08-05
Re: [PATCH printk v1 10/10] serial: 8250: implement write_atomic · John Ogness <john.ogness@linutronix.de> · 2021-08-05
Re: [PATCH printk v1 00/10] printk: introduce atomic consoles and sync mode · Andy Shevchenko <andriy.shevchenko@linux.intel.com> · 2021-08-03
Re: [PATCH printk v1 00/10] printk: introduce atomic consoles and sync mode · Petr Mladek <pmladek@suse.com> · 2021-08-05
Re: [PATCH printk v1 00/10] printk: introduce atomic consoles and sync mode · Sergey Senozhatsky <senozhatsky@chromium.org> · 2021-08-31

From: Petr Mladek <pmladek@suse.com>
Date: 2021-08-04 12:12:34
Also in: linuxppc-dev

On Wed 2021-08-04 12:31:59, Daniel Thompson wrote:

On Tue, Aug 03, 2021 at 05:36:32PM +0206, John Ogness wrote:

quoted

On 2021-08-03, Daniel Thompson [off-list ref] wrote:

quoted

On Tue, Aug 03, 2021 at 03:18:54PM +0206, John Ogness wrote:

quoted

kgdb makes use of its own cpulock (@dbg_master_lock, @kgdb_active)
during cpu roundup. This will conflict with the printk cpulock.

When the full vision is realized what will be the purpose of the printk
cpulock?

I'm asking largely because it's current role is actively unhelpful
w.r.t. kdb. It is possible that cautious use of in_dbg_master() might
be a better (and safer) solution. However it sounds like there is a
larger role planned for the printk cpulock...

The printk cpulock is used as a synchronization mechanism for
implementing atomic consoles, which need to be able to safely interrupt
the console write() activity at any time and immediately continue with
their own printing. The ultimate goal is to move all console printing
into per-console dedicated kthreads, so the primary function of the
printk cpulock is really to immediately _stop_ the CPU/kthread
performing write() in order to allow write_atomic() (from any context on
any CPU) to safely and reliably take over.

I see.

Is there any mileage in allowing in_dbg_master() to suppress taking
the console lock?

There's a couple of reasons to worry about the current approach.

The first is that we don't want this code to trigger in the case when
kgdb is enabled and kdb is not since it is only kdb (a self-hosted
debugger) than uses the consoles. This case is relatively trivial to
address since we can rename it kdb_roundup_delay() and alter the way it
is conditionally compiled.

The second is more of a problem however. kdb will only call into the
console code from the debug master. By default this is the CPU that
takes the debug trap so initial prints will work fine. However it is
possible to switch to a different master (so we can read per-CPU
registers and things like that). This will result in one of the CPUs
that did the IPI round up calling into console code and this is unsafe
in that instance.

There are a couple of tricks we could adopt to work around this but
given the slightly odd calling context for kdb (all CPUs quiesced, no
log interleaving possible) it sounds like it would remain safe to
bypass the lock if in_dbg_master() is true.

Bypassing an inconvenient lock might sound icky but:

1. If the lock is not owned by any CPU then what kdb will do is safe.

2. If the lock is owned by any CPU then we have quiesced it anyway
   and this makes is safe for the owning CPU to share its ownership
   (since it isn't much different to recursive acquisition on a single
   CPU)

I think about the following:

void kgdb_roundup_cpus(void)
{
	__printk_cpu_lock();
	__kgdb_roundup_cpus();
}

, where __printk_cpu_lock() waits/takes printk_cpu_lock()
	__kgdb_roundup_cpus() is the original kgdb_roundup_cpus();


The idea is that kgdb_roundup_cpus() caller takes the printk_cpu lock.
The owner will be well defined.

As a result any other CPU will not be able to take the printk_cpu lock
as long as it is owned by the kgdb lock. But as you say, kgdb will
make sure that everything is serialized at this stage. So that
the original raw_printk_cpu_lock_irqsave() might just disable
IRQs when called under debugger.

Does it make any sense?

I have to say that it is a bit hairy. But it looks slightly better
than the delayed/repeated IPI proposed by this patch.

Best Regards,
Petr

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help