Re: [RFC PATCH v3 3/3] acpi_memhotplug: Allow eject to proceed on rebind scenario
From: Rafael J. Wysocki <hidden>
Date: 2012-11-28 23:00:39
Also in:
linux-acpi, lkml
On Wednesday, November 28, 2012 02:46:33 PM Greg KH wrote:
On Wed, Nov 28, 2012 at 11:39:22PM +0100, Rafael J. Wysocki wrote:quoted
On Wednesday, November 28, 2012 03:16:22 PM Toshi Kani wrote:quoted
quoted
quoted
quoted
quoted
quoted
quoted
I see. I do not think whether or not the device is removed on eject makes any difference here. The issue is that after driver_unbind() is done, acpi_bus_hot_remove_device() no longer calls the ACPI memory driver (hence, it cannot fail in prepare_remove), and goes ahead to call _EJ0. If driver_unbind() did off-line the memory, this is OK. However, it cannot off-line kernel memory ranges. So, we basically need to either 1) serialize acpi_bus_hot_remove_device() and driver_unbind(), or 2) make acpi_bus_hot_remove_device() to fail if driver_unbind() is run during the operation.OK, I see the problem now. What exactly is triggering the driver_unbind() in this scenario?User can request driver_unbind() from sysfs as follows. I do not see much reason why user has to do for memory, though. echo "PNP0C80:XX" > /sys/bus/acpi/drivers/acpi_memhotplug/unbindThis is wrong. Even if we want to permit user space to forcibly unbind drivers from anything like this, we should at least check for some situations in which it is plain dangerous. Like in this case. So I think the above should fail unless we know that the driver won't be necessary to handle hot-removal of memory.Well, we tried twice already... :) https://lkml.org/lkml/2012/11/16/649I didn't mean driver_unbind() should fail. The code path that executes driver_unbind() eventually should fail _before_ executing it.driver_unbind() is the handler, so it is called directly from this unbind interface.Yes, sorry for the confusion. So, it looks like the driver core wants us to handle driver unbinding no matter what.Yes. Well, the driver core does the unbinding no matter what, if it was told, by a user, to do so. Why is that a problem? The user then is responsible for any bad things (i.e. not able to control the device any more), if they do so.
I don't really agree with that, because the user may simply not know what the consequences of that will be. In my not so humble opinion any interface allowing user space to crash the kernel is a bad one. And this is an example of that.
quoted
This pretty much means that it is a bad idea to have a driver that is exposed as a "device driver" in sysfs for memory hotplugging.Again, why? All this means is that the driver is now not connected to the device (memory in this case.) The memory is still there, still operates as before, only difference is, the driver can't touch it anymore. This is the same for any ACPI driver, and has been for years.
Except that if this driver has been unbound and the removal is triggered by an SCI, the core will just go on and remove the memory, although it may be killing the kernel this way. Arguably, this may be considered as the core's fault, but the only way to fix that would be to move the code from that driver into the core and not to register it as a "driver" any more. Which was my point. :-)
Please don't confuse unbind with any "normal" system operation, it is not to be used for memory hotplug, or anything else like this. Also, if you really do not want to do this, turn off the ability to unbind/bind for these devices, that is under your control in your bus logic.
OK, but how? I'm looking at driver_unbind() and not seeing any way to do that actually. Thanks, Rafael -- I speak only for myself. Rafael J. Wysocki, Intel Open Source Technology Center. -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>