Re: [RFC][PATCH] xfrm: do not leak ESRCH to user space
From: David Miller <davem@davemloft.net>
Date: 2008-10-23 21:12:20
From: Fernando Luis Vázquez Cao <redacted> Date: Thu, 23 Oct 2008 23:27:19 +0900
I noticed that, under certain conditions, ESRCH can be leaked from the xfrm layer to user space through sys_connect. In particular, this seems to happen reliably when the kernel fails to resolve a template either because the AF_KEY receive buffer being used by racoon is full or because the SA entry we are trying to use is in XFRM_STATE_EXPIRED state. However, since this could be a transient issue it could be argued that EAGAIN would be more appropriate. Besides this error code is not even documented in the man page for sys_connect (as of man-pages 3.07). What is the expected behavior (I could not find anything in the RFCs)? Should we just fix the connect(2) man page instead?
I think this case requires some care. -EAGAIN tells the caller that it is a temporary failure and that retrying can be expected to succeed eventually (some resource is not available at the moment). So applications loop when they see this error returned, they will try again. But that's not what is happening when ESRCH is signalled. We found no matching policy, and we've done nothing to make such a policy be found in the (near) future. It is more of a hard failure, which should not necessarily be retried over and over again. So converting this to -EAGAIN doesn't seem correct at all.