Re: Possible mistake in commit 3ca459eaba1b ("tun: fix group permission check")

From: Willem de Bruijn <willemdebruijn.kernel@gmail.com>
Date: 2025-01-28 22:59:34
Also in: linux-security-module, selinux

stsp wrote:

28.01.2025 18:04, Willem de Bruijn пишет:

quoted

Ondrej Mosnacek wrote:

quoted

On Mon, Jan 27, 2025 at 3:50 PM Willem de Bruijn
[off-list ref] wrote:

quoted

stsp wrote:

quoted

27.01.2025 12:10, Ondrej Mosnacek пишет:

quoted

Hello,

It looks like the commit in $SUBJ may have introduced an unintended
change in behavior. According to the commit message, the intent was to
require just one of {user, group} to match instead of both, which
sounds reasonable, but the commit also changes the behavior for when
neither of tun->owner and tun->group is set. Before the commit the
access was always allowed, while after the commit CAP_NET_ADMIN is
required in this case.

I'm asking because the tun_tap subtest of selinux-testuite [1] started
to fail after this commit (it assumed CAP_NET_ADMIN was not needed),
so I'm trying to figure out if we need to change the test or if it
needs to be fixed in the kernel.

Thanks,

[1] https://github.com/SELinuxProject/selinux-testsuite/

Hi, IMHO having the persistent
TAP device inaccessible by anyone
but the CAP_NET_ADMIN is rather
useless, so the compatibility should
be restored on the kernel side.
I'd raise the questions about adding
the CAP_NET_ADMIN checks into
TUNSETOWNER and/or TUNSETPERSIST,
but this particular change to TUNSETIFF,
at least on my side, was unintentional.

Sorry about that. :(

Thanks for the report Ondrej.

Agreed that we need to reinstate this. I suggest this explicit
extra branch after the more likely cases:

         @@ -585,6 +585,9 @@ static inline bool tun_capable(struct tun_struct *tun)
                         return 1;
                 if (gid_valid(tun->group) && in_egroup_p(tun->group))
                         return 1;
         +       if (!uid_valid(tun->owner) && !gid_valid(tun->group))
         +               return 1;
         +
                 return 0;
          }

That could work, but the semantics become a bit weird, actually: When
you set both uid and gid, one of them needs to match. If you unset
uid/gid, you get a stricter condition (gid/uid must match).

I don't follow this point.

Judging from the history, the intent was that

- if user is set, then it must match.
- if group is set, then it must match.

And I think the group constraint was added with the idea that no one
would try to use both constraints at the same time.

The referenced patch intended to (only) relax the condition when both
are set after all.

quoted

And if you
then also unset the other one, you suddenly get a less strict
condition than the first two - nothing has to match. Might be
acceptable, but it may confuse people unless well documented.

I find that ownership is optional and must be set explicitly through
TUNSETOWNER and TUNSETGROUP quite surprising too.

But this is only reverting to long established behavior.

quoted

Also there is another smaller issue in the new code that I forgot to
mention - with LSMs (such as SELinux) the ns_capable() call will
produce an audit record when the capability is denied by an LSM. These
audit records are meant to indicate that the permission was needed but
denied and that the policy was either breached or needs to be
adjusted. Therefore, the ns_capable() call should ideally only happen
after the user/group checks so that only accesses that actually
wouldn't succeed without the capability yield an audit record.

So I would suggest this version:

static inline bool tun_capable(struct tun_struct *tun)
{
     const struct cred *cred = current_cred();
     struct net *net = dev_net(tun->dev);

     if (uid_valid(tun->owner) && uid_eq(cred->euid, tun->owner))
         return 1;
     if (gid_valid(tun->group) && in_egroup_p(tun->group))
         return 1;
     if (!uid_valid(tun->owner) && !gid_valid(tun->group))
         return 1;
     return ns_capable(net->user_ns, CAP_NET_ADMIN);
}

Improvement makes sense, thanks.

One more point, based on the problem description in the referenced
patch:

     Currently tun checks the group permission even if the user have matched.
     Besides going against the usual permission semantic, this has a
     very interesting implication: if the tun group is not among the
     supplementary groups of the tun user, then effectively no one can
     access the tun device.

The intent was to skip the group check if the user matches. Not
necessarily the reverse.

To minimize the impact of the patch, perhaps it can still always deny
if tun->owner is set and does not match. That keeps the group check
iff the owner is not explicitly set as well.

Doesn't this mean, if the user
is set then group is completely
ignored?

Yes

By doing that you indeed avoid
the problem of "completely
inaccessible tap". However, that
breaks my setup, as I really
intended to provide tap to the
owner and the unrelated group.
This is because, eg when setting
a CI job, you can add the needed
user to the needed group, but
you also need to re-login, which
is not always possible. :(

Could you leave tun->owner unset?

Also completely ignoring group
when the user is set, is somewhat
questionable. At the very least,
perhaps then you need to explicitly
clear the group when the user
is set, to avoid the confusion.
Having "either user or group"
sounds like a sensible semantic,
but its a different semantic.

True. I think that would have satisfied the intent of adding the
group check at the time, and would have avoided this situation.

But we indeed cannot retroactively restrict allowed behavior.
As that will break users.

Conversely, it might be that an existing user out there depends on
the prior behavior that only a process that matches both user and
group can use the device. Which might be reason for reverting the
patch entirely.

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help