Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction

[RFC PATCH 00/13] Ultra Ethernet driver introduction · Nikolay Aleksandrov <hidden> · 2025-03-06
[RFC PATCH 01/13] drivers: ultraeth: add initial skeleton and kconfig option · Nikolay Aleksandrov <hidden> · 2025-03-06
[RFC PATCH 02/13] drivers: ultraeth: add context support · Nikolay Aleksandrov <hidden> · 2025-03-06
[RFC PATCH 03/13] drivers: ultraeth: add new genl family · Nikolay Aleksandrov <hidden> · 2025-03-06
[RFC PATCH 04/13] drivers: ultraeth: add job support · Nikolay Aleksandrov <hidden> · 2025-03-06
[RFC PATCH 05/13] drivers: ultraeth: add tunnel udp device support · Nikolay Aleksandrov <hidden> · 2025-03-06
[RFC PATCH 06/13] drivers: ultraeth: add initial PDS infrastructure · Nikolay Aleksandrov <hidden> · 2025-03-06
[RFC PATCH 07/13] drivers: ultraeth: add request and ack receive support · Nikolay Aleksandrov <hidden> · 2025-03-06
[RFC PATCH 08/13] drivers: ultraeth: add request transmit support · Nikolay Aleksandrov <hidden> · 2025-03-06
[RFC PATCH 09/13] drivers: ultraeth: add support for coalescing ack · Nikolay Aleksandrov <hidden> · 2025-03-06
[RFC PATCH 10/13] drivers: ultraeth: add sack support · Nikolay Aleksandrov <hidden> · 2025-03-06
[RFC PATCH 11/13] drivers: ultraeth: add nack support · Nikolay Aleksandrov <hidden> · 2025-03-06
[RFC PATCH 12/13] drivers: ultraeth: add initiator and target idle timeout support · Nikolay Aleksandrov <hidden> · 2025-03-06
[RFC PATCH 13/13] HACK: drivers: ultraeth: add char device · Nikolay Aleksandrov <hidden> · 2025-03-06
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Leon Romanovsky <leon@kernel.org> · 2025-03-08
RE: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Parav Pandit <hidden> · 2025-03-09
RE: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Bernard Metzler <hidden> · 2025-03-11
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Leon Romanovsky <leon@kernel.org> · 2025-03-11
RE: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Sean Hefty <hidden> · 2025-03-11
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Nikolay Aleksandrov <hidden> · 2025-03-12
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Nikolay Aleksandrov <hidden> · 2025-03-12
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Leon Romanovsky <leon@kernel.org> · 2025-03-12
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Nikolay Aleksandrov <hidden> · 2025-03-12
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Leon Romanovsky <leon@kernel.org> · 2025-03-12
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Nikolay Aleksandrov <hidden> · 2025-03-12
RE: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Bernard Metzler <hidden> · 2025-03-14
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Leon Romanovsky <leon@kernel.org> · 2025-03-17
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Jason Gunthorpe <jgg@nvidia.com> · 2025-03-19
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Nikolay Aleksandrov <hidden> · 2025-03-19
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Stanislav Fomichev <hidden> · 2025-03-14
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Leon Romanovsky <leon@kernel.org> · 2025-03-17
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Stanislav Fomichev <hidden> · 2025-03-19
Netlink vs ioctl WAS(Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Jamal Hadi Salim <jhs@mojatatu.com> · 2025-03-15
Re: Netlink vs ioctl WAS(Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Leon Romanovsky <leon@kernel.org> · 2025-03-17
Re: Netlink vs ioctl WAS(Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Jason Gunthorpe <jgg@nvidia.com> · 2025-03-18
Re: Netlink vs ioctl WAS(Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Jamal Hadi Salim <jhs@mojatatu.com> · 2025-03-19
Re: Netlink vs ioctl WAS(Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Jason Gunthorpe <jgg@nvidia.com> · 2025-03-19
Re: Netlink vs ioctl WAS(Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Jamal Hadi Salim <jhs@mojatatu.com> · 2025-03-25
Re: Netlink vs ioctl WAS(Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Jason Gunthorpe <jgg@nvidia.com> · 2025-03-26
Re: Netlink vs ioctl WAS(Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Jamal Hadi Salim <jhs@mojatatu.com> · 2025-04-08
Re: Netlink vs ioctl WAS(Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Jason Gunthorpe <jgg@nvidia.com> · 2025-04-09
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Jason Gunthorpe <jgg@nvidia.com> · 2025-03-19
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Yunsheng Lin <hidden> · 2025-03-20
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Jason Gunthorpe <jgg@nvidia.com> · 2025-03-20
RE: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Sean Hefty <hidden> · 2025-03-20
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Jason Gunthorpe <jgg@nvidia.com> · 2025-03-20
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Yunsheng Lin <hidden> · 2025-03-21
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Jason Gunthorpe <jgg@nvidia.com> · 2025-03-21
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Roland Dreier <hidden> · 2025-03-24
RE: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Sean Hefty <hidden> · 2025-03-24
RE: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Bernard Metzler <hidden> · 2025-03-25
RE: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Sean Hefty <hidden> · 2025-03-25
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Jason Gunthorpe <jgg@nvidia.com> · 2025-03-26
RE: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Sean Hefty <hidden> · 2025-03-26
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Jason Gunthorpe <jgg@nvidia.com> · 2025-03-26
RE: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Sean Hefty <hidden> · 2025-03-26
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Jason Gunthorpe <jgg@nvidia.com> · 2025-03-27
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Yunsheng Lin <hidden> · 2025-03-28
RE: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Sean Hefty <hidden> · 2025-03-31
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Yunsheng Lin <hidden> · 2025-04-01
RE: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Sean Hefty <hidden> · 2025-03-31
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Jason Gunthorpe <jgg@nvidia.com> · 2025-04-01
RE: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Sean Hefty <hidden> · 2025-04-01
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Jason Gunthorpe <jgg@nvidia.com> · 2025-04-01
RE: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Sean Hefty <hidden> · 2025-04-03
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Ziemba, Ian <hidden> · 2025-04-04
RE: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Sean Hefty <hidden> · 2025-04-05
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Ziemba, Ian <hidden> · 2025-04-07
RE: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Sean Hefty <hidden> · 2025-04-08
RE: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Sean Hefty <hidden> · 2025-04-16
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Jason Gunthorpe <jgg@nvidia.com> · 2025-04-17
RE: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Sean Hefty <hidden> · 2025-04-17
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Jason Gunthorpe <jgg@nvidia.com> · 2025-04-17
RE: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Sean Hefty <hidden> · 2025-04-18
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Jason Gunthorpe <jgg@nvidia.com> · 2025-04-22
Re: [RFC PATCH 00/13] Ultra Ethernet driver introduction · Jason Gunthorpe <jgg@nvidia.com> · 2025-03-26

From: Nikolay Aleksandrov <hidden>
Date: 2025-03-19 14:02:36
Also in: linux-rdma

On 3/19/25 15:52, Jason Gunthorpe wrote:

On Fri, Mar 14, 2025 at 02:53:40PM +0000, Bernard Metzler wrote:

quoted

I assume the correct way forward is to first clarify the
structure of all user-visible objects that need to be
created/controlled/destroyed, and to route them through
this interface. Some will require extensions to given objects,
some may be new, some will be as-is. rdma_netlink will probably
be the right interface to look at for job control.

As I understand the job ID model you will need to have some privileged
entity to create a "job ID file descriptor" that can be passed around
to unprivileged processes to grant them access to the job ID. This is
necessary since the Job ID becomes part of the packet headers and we
must secure userspace to prevent a hijack or spoof these values on the
wire.

Netlink has a major downside that you can't use filesystem ACL
permissions to control access, so building a low privilege daemon just
to do job id management seems to me to be more difficult.

As an example, I would imagine having a job management char device
with a filesystem ACL that only allows something like SLRUM's
privileged orchestrator to talk to it. SLURM wouldn't have something
like CAP_NET_ADMIN. SLURM would setup the job ID and pass the "Job ID
FD" to the actual MPI workload processes to grant them permission to
use those network headers.

Nobody else in the system can create Job ID's besides SLURM, and in a
multi-user environment one user cannot reach into the other and hijack
their job ID because the FD does not leak outside the MPI process
tree.

This RFC doesn't describe the intended security model, but I'm very
surprised to see ultraeth_nl_job_new_doit() not do any capability
checks, or any security what so ever around access to the job.

It doesn't need to do any capability checking because it is defined in the YAML
model, there you can see flags: [ admin-perm ] so in the genl ops code that is
automatically generated we get .flags		= GENL_ADMIN_PERM | GENL_CMD_CAP_DO
for these ops, which in turn means the genetlink code will check if the caller has
CAP_NET_ADMIN. The unprivileged process can request to associate with multiple jobs
and it's the privileged process that has to configure and control them. In this
version we have only configuration. Once the specs become publicly available we
will be able to share more information about how it's expected to work.

Cheers,
 Nik

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help