Re: [PATCH] Introduce the pkill_on_warn boot parameter
From: Alexander Popov <hidden>
Date: 2021-10-02 16:33:27
Also in:
linux-hardening, lkml
On 02.10.2021 15:13, Steven Rostedt wrote:
On Sat, 2 Oct 2021 14:41:34 +0300 Alexander Popov [off-list ref] wrote:quoted
Currently, the Linux kernel provides two types of reaction to kernel warnings: 1. Do nothing (by default), 2. Call panic() if panic_on_warn is set. That's a very strong reaction, so panic_on_warn is usually disabled on production systems.quoted
From a safety point of view, the Linux kernel misses a middle way of handlingkernel warnings: - The kernel should stop the activity that provokes a warning, - But the kernel should avoid complete denial of service.quoted
From a security point of view, kernel warning messages provide a lot of usefulinformation for attackers. Many GNU/Linux distributions allow unprivileged users to read the kernel log (for various reasons), so attackers use kernel warning infoleak in vulnerability exploits. See the examples: https://a13xp0p0v.github.io/2021/02/09/CVE-2021-26708.html https://a13xp0p0v.github.io/2020/02/15/CVE-2019-18683.html https://googleprojectzero.blogspot.com/2018/09/a-cache-invalidation-bug-in-linux.html Let's introduce the pkill_on_warn parameter. If this parameter is set, the kernel kills all threads in a process that provoked a kernel warning. This behavior is reasonable from a safety point of view described above. It is also useful for kernel security hardening because the system kills an exploit process that hits a kernel warning.How does this help? It only kills the process that caused the warning, it doesn't kill the process that spawned it. This is trivial to get around. Just fork a process, trigger the warning (it gets killed) and then read the kernel log. If this is your rationale, then I'm not convinced this helps at all.
Steven, as I understand, here you ask about the security implications of pkill_on_warn (not about the safety implications that I mentioned). Killing the exploit process that hit a warning is MUCH better than ignoring and proceeding with execution. That may influence the stability of the exploits that hit WARN_ON() or rely on WARN_ON() infoleak. Exploit development is the constant struggle for attack stability. Exploiting a heap memory corruption is especially painful when the kernel works with the attacked slab caches in parallel with your exploit. So when the kernel kills the exploit process, some of the WARN_ON() infoleak data becomes obsolete; the attacker loses the execution in that particular kernel task on that particular CPU. Moreover, restarting the exploit process would bring a lot of noise to the system. That may decrease the attack stability even more. So killing the exploit process is the best option that we have here to distress the attacker who uses the WARN_ON() infoleak technique. I.e. that is probabilistic attack mitigation, which is reasonable for kernel safety as well. I hope I managed to show this from the attacker's side. Best regards, Alexander