Thread (9 messages) 9 messages, 4 authors, 2015-02-02

PROBLEM: BUG appearing when trying to allocate interrupt on Exynos MCT after CPU hotplug

From: Russell King - ARM Linux <hidden>
Date: 2014-10-23 14:07:02
Also in: linux-samsung-soc, lkml

On Thu, Oct 23, 2014 at 03:51:16PM +0200, Marcin Jabrzyk wrote:
[1.] One line summary of the problem: "BUG: sleeping function called from
invalid context at mm/slub.c:1250" after CPU hotplug
I'm really not surprised.
When SoC have MCT_INT_SPI interrupt it is being allocated after hotplugging
of the CPU, secondary_start_kernel() is sending CPU boot notifications which
are send when preemption and interrupts are disabled. Exynos_mct
notification handler tries to set up and allocate IRQ for SPI type interrupt
for started CPU and then BUG appears.
There might be similar problem on qcom-timer I think just after looking on
the code.
The CPU notifier is called via notify_cpu_starting(), which is called
with interrupts disabled, and a reason code of CPU_STARTING.  Interrupts
at this point /must/ remain disabled.

The Exynos code then goes on to call exynos4_local_timer_setup() which
tries to reverse the free_irq() in exynos4_local_timer_stop() by calling
request_irq().  Calling request_irq() with interrupts off has never been
permissible.

So, this code is wrong today, and it was also wrong when it was written.
It /couldn't/ have been tested.  It looks like this commit added this
buggy code:

commit ee98d27df6827b5ba4bd99cb7d5cb1239b6a1a31
Author: Stephen Boyd [off-list ref]
Date:   Fri Feb 15 16:40:51 2013 -0800

    ARM: EXYNOS4: Divorce mct from local timer API

    Separate the mct local timers from the local timer API. This will
    allow us to remove ARM local timer support in the near future and
    gets us closer to moving this driver to drivers/clocksource.

    Acked-by: Kukjin Kim [off-list ref]
    Acked-by: Marc Zyngier [off-list ref]
    Cc: Thomas Abraham [off-list ref]
    Signed-off-by: Stephen Boyd [off-list ref]

A good question would be: why doesn't this happen at boot time when CPU1
is first brought up?  The conditions here are no different from hotplugging
CPU1 back in.  Do you see a similar warning on boot too?

-- 
FTTC broadband for 0.8mile line: currently at 9.5Mbps down 400kbps up
according to speedtest.net.
Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help