Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction

[PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Andrii Nakryiko <hidden> · 2020-02-28
[PATCH bpf-next 3/3] selftests/bpf: add link pinning selftests · Andrii Nakryiko <hidden> · 2020-02-28
[PATCH bpf-next 1/3] bpf: introduce pinnable bpf_link abstraction · Andrii Nakryiko <hidden> · 2020-02-28
Re: [PATCH bpf-next 1/3] bpf: introduce pinnable bpf_link abstraction · Toke Høiland-Jørgensen <hidden> · 2020-03-02
Re: [PATCH bpf-next 1/3] bpf: introduce pinnable bpf_link abstraction · Andrii Nakryiko <hidden> · 2020-03-02
Re: [PATCH bpf-next 1/3] bpf: introduce pinnable bpf_link abstraction · Toke Høiland-Jørgensen <hidden> · 2020-03-02
Re: [PATCH bpf-next 1/3] bpf: introduce pinnable bpf_link abstraction · Andrii Nakryiko <hidden> · 2020-03-02
Re: [PATCH bpf-next 1/3] bpf: introduce pinnable bpf_link abstraction · Alexei Starovoitov <hidden> · 2020-03-03
Re: [PATCH bpf-next 1/3] bpf: introduce pinnable bpf_link abstraction · Andrii Nakryiko <hidden> · 2020-03-03
[PATCH bpf-next 2/3] libbpf: add bpf_link pinning/unpinning · Andrii Nakryiko <hidden> · 2020-02-28
Re: [PATCH bpf-next 2/3] libbpf: add bpf_link pinning/unpinning · Toke Høiland-Jørgensen <hidden> · 2020-03-02
Re: [PATCH bpf-next 2/3] libbpf: add bpf_link pinning/unpinning · Andrii Nakryiko <hidden> · 2020-03-02
Re: [PATCH bpf-next 2/3] libbpf: add bpf_link pinning/unpinning · Toke Høiland-Jørgensen <hidden> · 2020-03-02
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Toke Høiland-Jørgensen <hidden> · 2020-03-02
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Andrii Nakryiko <hidden> · 2020-03-02
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Toke Høiland-Jørgensen <hidden> · 2020-03-02
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Andrii Nakryiko <hidden> · 2020-03-02
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Toke Høiland-Jørgensen <hidden> · 2020-03-03
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Daniel Borkmann <daniel@iogearbox.net> · 2020-03-03
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Alexei Starovoitov <hidden> · 2020-03-03
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Daniel Borkmann <daniel@iogearbox.net> · 2020-03-03
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Andrii Nakryiko <hidden> · 2020-03-03
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Toke Høiland-Jørgensen <hidden> · 2020-03-03
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Daniel Borkmann <daniel@iogearbox.net> · 2020-03-03
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Alexei Starovoitov <hidden> · 2020-03-03
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Toke Høiland-Jørgensen <hidden> · 2020-03-03
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Alexei Starovoitov <hidden> · 2020-03-04
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Toke Høiland-Jørgensen <hidden> · 2020-03-04
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Alexei Starovoitov <hidden> · 2020-03-04
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Toke Høiland-Jørgensen <hidden> · 2020-03-05
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Alexei Starovoitov <hidden> · 2020-03-05
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Daniel Borkmann <daniel@iogearbox.net> · 2020-03-05
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Alexei Starovoitov <hidden> · 2020-03-05
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Daniel Borkmann <daniel@iogearbox.net> · 2020-03-05
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Toke Høiland-Jørgensen <hidden> · 2020-03-06
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Daniel Borkmann <daniel@iogearbox.net> · 2020-03-06
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · David Ahern <hidden> · 2020-03-06
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Toke Høiland-Jørgensen <hidden> · 2020-03-09
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Jakub Kicinski <kuba@kernel.org> · 2020-03-04
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Alexei Starovoitov <hidden> · 2020-03-04
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Jakub Kicinski <kuba@kernel.org> · 2020-03-04
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Alexei Starovoitov <hidden> · 2020-03-05
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Jakub Kicinski <kuba@kernel.org> · 2020-03-05
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Toke Høiland-Jørgensen <hidden> · 2020-03-05
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Jakub Kicinski <kuba@kernel.org> · 2020-03-05
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Toke Høiland-Jørgensen <hidden> · 2020-03-09
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Jakub Kicinski <kuba@kernel.org> · 2020-03-09
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Toke Høiland-Jørgensen <hidden> · 2020-03-10
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Alexei Starovoitov <hidden> · 2020-03-05
Re: [PATCH bpf-next 0/3] Introduce pinnable bpf_link kernel abstraction · Jakub Kicinski <kuba@kernel.org> · 2020-03-03

From: Alexei Starovoitov <hidden>
Date: 2020-03-04 04:36:50
Also in: bpf

On Tue, Mar 03, 2020 at 11:27:13PM +0100, Toke Høiland-Jørgensen wrote:

Alexei Starovoitov [off-list ref] writes:

quoted

Legacy api for tc, xdp, cgroup will not be able to override FD-based
link. For TC it's easy. cls-bpf allows multi-prog, so netlink
adding/removing progs will not be able to touch progs that are
attached via FD-based link.
Same thing for cgroups. FD-based link will be similar to 'multi' mode.
The owner of the link has a guarantee that their program will
stay attached to cgroup.
XDP is also easy. Since it has only one prog. Attaching FD-based link
will prevent netlink from overriding it.

So what happens if the device goes away?

I'm not sure yet whether it's cleaner to make netdev, qdisc, cgroup to be held
by the link or use notifier approach. There are pros and cons to both.

quoted

This way the rootlet prog installed by libxdp (let's find a better name
for it) will stay attached.

Dispatcher prog?

would be great, but 'bpf_dispatcher' name is already used in the kernel.
I guess we can still call the library libdispatcher and dispatcher prog?
Alternatives:
libchainer and chainer prog
libaggregator and aggregator prog?
libpolicer kinda fits too, but could be misleading.
libxdp is very confusing. It's not xdp specific.

quoted

libxdp can choose to pin it in some libxdp specific location, so other
libxdp-enabled applications can find it in the same location, detach,
replace, modify, but random app that wants to hack an xdp prog won't
be able to mess with it.

What if that "random app" comes first, and keeps holding on to the link
fd? Then the admin essentially has to start killing processes until they
find the one that has the device locked, no?

Of course not. We have to provide an api to make it easy to discover
what process holds that link and where it's pinned.
But if we go with notifier approach none of it is an issue.
Whether target obj is held or notifier is used everything I said before still
stands. "random app" that uses netlink after libdispatcher got its link FD will
not be able to mess with carefully orchestrated setup done by libdispatcher.

Also either approach will guarantee that infamous message:
"unregister_netdevice: waiting for %s to become free. Usage count"
users will never see.

And what about the case where the link fd is pinned on a bpffs that is
no longer available? I.e., if a netdevice with an XDP program moves
namespaces and no longer has access to the original bpffs, that XDP
program would essentially become immutable?

'immutable' will not be possible.
I'm not clear to me how bpffs is going to disappear. What do you mean
exactly?

quoted

We didn't come up with these design choices overnight. It came from
hard lessons learned while deploying xdp, tc and cgroup in production.
Legacy apis will not be deprecated, of course.

Not deprecated, just less privileged?

No idea what you're referring to.

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help