Re: [PATCH net-next 15/24] net: Use nested-BH locking for XDP redirect.

[PATCH net-next 00/24] locking: Introduce nested-BH locking. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2023-12-15
[PATCH net-next 01/24] locking/local_lock: Introduce guard definition for local_lock. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2023-12-15
Re: [PATCH net-next 01/24] locking/local_lock: Introduce guard definition for local_lock. · Paolo Abeni <pabeni@redhat.com> · 2023-12-18
Re: [PATCH net-next 01/24] locking/local_lock: Introduce guard definition for local_lock. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2024-01-11
[PATCH net-next 02/24] locking/local_lock: Add local nested BH locking infrastructure. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2023-12-15
[PATCH net-next 03/24] net: Use __napi_alloc_frag_align() instead of open coding it. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2023-12-15
Re: [PATCH net-next 03/24] net: Use __napi_alloc_frag_align() instead of open coding it. · Paolo Abeni <pabeni@redhat.com> · 2023-12-18
Re: [PATCH net-next 03/24] net: Use __napi_alloc_frag_align() instead of open coding it. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2024-01-12
[PATCH net-next 04/24] net: Use nested-BH locking for napi_alloc_cache. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2023-12-15
Re: [PATCH net-next 04/24] net: Use nested-BH locking for napi_alloc_cache. · kernel test robot <hidden> · 2023-12-16
Re: [PATCH net-next 04/24] net: Use nested-BH locking for napi_alloc_cache. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2024-01-12
[PATCH net-next 05/24] net/tcp_sigpool: Use nested-BH locking for sigpool_scratch. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2023-12-15
[PATCH net-next 08/24] net: softnet_data: Make xmit.recursion per task. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2023-12-15
[PATCH net-next 07/24] netfilter: br_netfilter: Use nested-BH locking for brnf_frag_data_storage. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2023-12-15
[PATCH net-next 09/24] dev: Use the RPS lock for softnet_data::input_pkt_queue on PREEMPT_RT. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2023-12-15
[PATCH net-next 11/24] lwt: Don't disable migration prio invoking BPF. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2023-12-15
[PATCH net-next 06/24] net/ipv4: Use nested-BH locking for ipv4_tcp_sk. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2023-12-15
[PATCH net-next 10/24] dev: Use nested-BH locking for softnet_data.process_queue. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2023-12-15
[PATCH net-next 13/24] net: Use nested-BH locking for bpf_scratchpad. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2023-12-15
[PATCH net-next 12/24] seg6: Use nested-BH locking for seg6_bpf_srh_states. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2023-12-15
Re: [PATCH net-next 12/24] seg6: Use nested-BH locking for seg6_bpf_srh_states. · kernel test robot <hidden> · 2023-12-16
Re: [PATCH net-next 12/24] seg6: Use nested-BH locking for seg6_bpf_srh_states. · Paolo Abeni <pabeni@redhat.com> · 2023-12-18
Re: [PATCH net-next 12/24] seg6: Use nested-BH locking for seg6_bpf_srh_states. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2024-01-12
[PATCH net-next 14/24] net: Add a lock which held during the redirect process. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2023-12-15
[PATCH net-next 15/24] net: Use nested-BH locking for XDP redirect. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2023-12-15
Re: [PATCH net-next 15/24] net: Use nested-BH locking for XDP redirect. · kernel test robot <hidden> · 2023-12-16
Re: [PATCH net-next 15/24] net: Use nested-BH locking for XDP redirect. · Alexei Starovoitov <hidden> · 2023-12-20
Re: [PATCH net-next 15/24] net: Use nested-BH locking for XDP redirect. · Toke Høiland-Jørgensen <hidden> · 2024-01-04
Re: [PATCH net-next 15/24] net: Use nested-BH locking for XDP redirect. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2024-01-12
Re: [PATCH net-next 15/24] net: Use nested-BH locking for XDP redirect. · Toke Høiland-Jørgensen <hidden> · 2024-01-17
Re: [PATCH net-next 15/24] net: Use nested-BH locking for XDP redirect. · Jakub Kicinski <kuba@kernel.org> · 2024-01-18
Re: [PATCH net-next 15/24] net: Use nested-BH locking for XDP redirect. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2024-01-18
Re: [PATCH net-next 15/24] net: Use nested-BH locking for XDP redirect. · Jakub Kicinski <kuba@kernel.org> · 2024-01-18
Re: [PATCH net-next 15/24] net: Use nested-BH locking for XDP redirect. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2024-01-18
Re: [PATCH net-next 15/24] net: Use nested-BH locking for XDP redirect. · Toke Høiland-Jørgensen <hidden> · 2024-01-18
Re: [PATCH net-next 15/24] net: Use nested-BH locking for XDP redirect. · Jakub Kicinski <kuba@kernel.org> · 2024-01-18
Re: [PATCH net-next 15/24] net: Use nested-BH locking for XDP redirect. · Toke Høiland-Jørgensen <hidden> · 2024-01-20
Re: [PATCH net-next 15/24] net: Use nested-BH locking for XDP redirect. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2024-01-18
Re: [PATCH net-next 15/24] net: Use nested-BH locking for XDP redirect. · Toke Høiland-Jørgensen <hidden> · 2024-01-18
[PATCH net-next 16/24] net: netkit, veth, tun, virt*: Use nested-BH locking for XDP redirect. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2023-12-15
Re: [PATCH net-next 16/24] net: netkit, veth, tun, virt*: Use nested-BH locking for XDP redirect. · kernel test robot <hidden> · 2023-12-16
Re: [PATCH net-next 16/24] net: netkit, veth, tun, virt*: Use nested-BH locking for XDP redirect. · Daniel Borkmann <daniel@iogearbox.net> · 2023-12-18
Re: [PATCH net-next 16/24] net: netkit, veth, tun, virt*: Use nested-BH locking for XDP redirect. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2024-01-12
[PATCH net-next 17/24] net: amazon, aquanti, broadcom, cavium, engleder: Use nested-BH locking for XDP redirect. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2023-12-15
RE: [PATCH net-next 17/24] net: amazon, aquanti, broadcom, cavium, engleder: Use nested-BH locking for XDP redirect. · "Kiyanovski, Arthur" <akiyano@amazon.com> · 2023-12-16
Re: RE: [PATCH net-next 17/24] net: amazon, aquanti, broadcom, cavium, engleder: Use nested-BH locking for XDP redirect. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2024-01-12
[PATCH net-next 18/24] net: Freescale: Use nested-BH locking for XDP redirect. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2023-12-15
[PATCH net-next 19/24] net: fungible, gve, mtk, microchip, mana: Use nested-BH locking for XDP redirect. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2023-12-15
[PATCH net-next 20/24] net: intel: Use nested-BH locking for XDP redirect. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2023-12-15
Re: [PATCH net-next 20/24] net: intel: Use nested-BH locking for XDP redirect. · kernel test robot <hidden> · 2023-12-16
Re: [PATCH net-next 20/24] net: intel: Use nested-BH locking for XDP redirect. · Nathan Chancellor <nathan@kernel.org> · 2023-12-19
Re: [PATCH net-next 20/24] net: intel: Use nested-BH locking for XDP redirect. · Nick Desaulniers <hidden> · 2023-12-19
[PATCH net-next 21/24] net: marvell: Use nested-BH locking for XDP redirect. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2023-12-15
[PATCH net-next 23/24] net: qlogic, socionext, stmmac, cpsw: Use nested-BH locking for XDP redirect. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2023-12-15
[PATCH net-next 22/24] net: mellanox, nfp, sfc: Use nested-BH locking for XDP redirect. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2023-12-15
[PATCH net-next 24/24] net: bpf: Add lockdep assert for the redirect process. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2023-12-15
Re: [PATCH net-next 00/24] locking: Introduce nested-BH locking. · Jakub Kicinski <kuba@kernel.org> · 2023-12-15
Re: [PATCH net-next 00/24] locking: Introduce nested-BH locking. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2023-12-18
Re: [PATCH net-next 00/24] locking: Introduce nested-BH locking. · Jakub Kicinski <kuba@kernel.org> · 2023-12-19
Re: [PATCH net-next 00/24] locking: Introduce nested-BH locking. · Sebastian Andrzej Siewior <bigeasy@linutronix.de> · 2023-12-21

From: Toke Høiland-Jørgensen <hidden>
Date: 2024-01-18 11:58:11
Also in: bpf, lkml

Sebastian Andrzej Siewior [off-list ref] writes:

On 2024-01-17 17:37:29 [+0100], Toke Høiland-Jørgensen wrote:

quoted

This is all back-of-the-envelope calculations, of course. Having some
actual numbers to look at would be great; I don't suppose you have a
setup where you can run xdp-bench and see how your patches affect the
throughput?

No but I probably could set it up.

That would be great! Feel free to ping me if you need any pointers to
how we usually do the perf measurements :)

quoted

I chatted with Jesper about this, and he had an idea not too far from
this: split up the XDP and regular stack processing in two stages, each
with their individual batching. So whereas right now we're doing
something like:

run_napi()
  bh_disable()
  for pkt in budget:
    act = run_xdp(pkt)
    if (act == XDP_PASS)
      run_netstack(pkt)  // this is the expensive bit
  bh_enable()

We could instead do:

run_napi()
  bh_disable()
  for pkt in budget:
    act = run_xdp(pkt)
    if (act == XDP_PASS)
      add_to_list(pkt, to_stack_list)
  bh_enable()
  // sched point
  bh_disable()
  for pkt in to_stack_list:
    run_netstack(pkt)
  bh_enable()


This would limit the batching that blocks everything to only the XDP
processing itself, which should limit the maximum time spent in the
blocking state significantly compared to what we have today. The caveat
being that rearranging things like this is potentially a pretty major
refactoring task that needs to touch all the drivers (even if some of
the logic can be moved into the core code in the process). So not really
sure if this approach is feasible, TBH.

This does not work because bh_disable() does not disable scheduling.
Scheduling may happen. bh_disable() acquires a lock which is currently
the only synchronisation point between two say network driver doing
NAPI. And this what I want to get rid of.
Regarding expensive bit as in XDP_PASS: This doesn't need locking as per
proposal, just the REDIRECT piece.

Right, well s/bh_disable()/lock()/; my main point was splitting up the
processing so that the XDP processing itself and the stack activation on
XDP_PASS is not interleaved. This will make it possible to hold the lock
around the whole XDP batch, not just individual packets, and so retain
the performance we gain from amortising expensive operations over
multiple packets.

-Toke

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help