Re: [xdp-hints] Re: [PATCH RFC bpf-next 32/52] bpf, cpumap: switch to GRO... | netdev

[PATCH RFC bpf-next 00/52] bpf, xdp: introduce and use Generic Hints/metadata · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 01/52] libbpf: factor out BTF loading from load_module_btfs() · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 03/52] libbpf: add function to get the pair BTF ID + type ID for a given type · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 02/52] libbpf: try to load vmlinux BTF from the kernel first · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 04/52] libbpf: patch module BTF ID into BPF insns · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 06/52] bpf: pass a pointer to union bpf_attr to bpf_link_ops::update_prog() · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 05/52] net, xdp: decouple XDP code from the core networking code · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 08/52] net, xdp: factor out XDP install arguments to a separate structure · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 10/52] net, xdp: add ability to specify frame size threshold for XDP metadata · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 09/52] net, xdp: add ability to specify BTF ID for XDP metadata · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 12/52] libbpf: add ability to set the BTF/type ID on setting XDP prog · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 11/52] libbpf: factor out __bpf_set_link_xdp_fd_replace() args into a struct · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 07/52] net, xdp: remove redundant arguments from dev_xdp_{at,de}tach_link() · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 14/52] libbpf: pass &bpf_link_create_opts directly to bpf_program__attach_fd() · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 16/52] selftests/bpf: expand xdp_link to check that setting meta opts works · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 20/52] net, xdp: move XDP metadata helpers into new xdp_meta.h · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 17/52] samples/bpf: pass a struct to sample_install_xdp() · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 19/52] stddef: make __struct_group() UAPI C++-friendly · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 21/52] net, xdp: allow metadata > 32 · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 18/52] samples/bpf: add ability to specify metadata threshold · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 22/52] net, skbuff: add ability to skip skb metadata comparison · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 13/52] libbpf: add ability to set the meta threshold on setting XDP prog · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 26/52] bpf, btf: add a pair of function to work with the BTF ID + type ID pair · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 25/52] net, xdp: add basic generic metadata accessors · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 15/52] libbpf: add bpf_program__attach_xdp_opts() · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 27/52] net, xdp: add &sk_buff <-> &xdp_meta_generic converters · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 30/52] net, gro: decouple GRO from the NAPI layer · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 34/52] samples/bpf: add 'timeout' option to xdp_redirect_cpu · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 38/52] net, xdp: remove unused xdp_attachment_info::flags · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 24/52] bpf, xdp: declare generic XDP metadata structure · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 33/52] bpf, cpumap: add option to set a timeout for deferred flush · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 39/52] net, xdp: make &xdp_attachment_info a bit more useful in drivers · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 42/52] net, xdp: shortcut skb->dev in bpf_prog_run_generic_xdp() · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 43/52] net, xdp: build XDP generic metadata on Generic (skb) XDP path · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 40/52] net, xdp: add an RCU version of xdp_attachment_setup() · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 48/52] libbpf: compress Endianness ops with a macro · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 47/52] net, ice: build XDP generic metadata · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 46/52] net, ice: use an onstack &xdp_meta_generic_rx to store HW frame info · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 45/52] net, ice: consolidate all skb fields processing · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 44/52] net, ice: allow XDP prog hot-swapping · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 50/52] libbpf: introduce a couple memory access helpers · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 49/52] libbpf: add LE <--> CPU conversion helpers · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 51/52] selftests/bpf: fix using test_xdp_meta BPF prog via skeleton infra · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 41/52] net, xdp: replace net_device::xdp_prog pointer with &xdp_attachment_info · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 52/52] selftests/bpf: add XDP Generic Hints selftest · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 35/52] net, skbuff: introduce napi_skb_cache_get_bulk() · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 32/52] bpf, cpumap: switch to GRO from netif_receive_skb_list() · Alexander Lobakin <hidden> · 2022-06-28
Re: [PATCH RFC bpf-next 32/52] bpf, cpumap: switch to GRO from netif_receive_skb_list() · Daniel Xu <hidden> · 2024-08-07
Re: [PATCH RFC bpf-next 32/52] bpf, cpumap: switch to GRO from netif_receive_skb_list() · Lorenzo Bianconi <hidden> · 2024-08-08
Re: [xdp-hints] Re: [PATCH RFC bpf-next 32/52] bpf, cpumap: switch to GRO from netif_receive_skb_list() · Alexander Lobakin <aleksander.lobakin@intel.com> · 2024-08-08
Re: [xdp-hints] Re: [PATCH RFC bpf-next 32/52] bpf, cpumap: switch to GRO from netif_receive_skb_list() · Lorenzo Bianconi <hidden> · 2024-08-08
Re: [xdp-hints] Re: [PATCH RFC bpf-next 32/52] bpf, cpumap: switch to GRO from netif_receive_skb_list() · Daniel Xu <hidden> · 2024-08-08
Re: [xdp-hints] Re: [PATCH RFC bpf-next 32/52] bpf, cpumap: switch to GRO from netif_receive_skb_list() · Jesper Dangaard Brouer <hawk@kernel.org> · 2024-08-09
Re: [xdp-hints] Re: [PATCH RFC bpf-next 32/52] bpf, cpumap: switch to GRO from netif_receive_skb_list() · Alexander Lobakin <aleksander.lobakin@intel.com> · 2024-08-09
Re: [xdp-hints] Re: [PATCH RFC bpf-next 32/52] bpf, cpumap: switch to GRO from netif_receive_skb_list() · Toke Høiland-Jørgensen <hidden> · 2024-08-09
Re: [xdp-hints] Re: [PATCH RFC bpf-next 32/52] bpf, cpumap: switch to GRO from netif_receive_skb_list() · Alexander Lobakin <aleksander.lobakin@intel.com> · 2024-08-09
Re: [xdp-hints] Re: [PATCH RFC bpf-next 32/52] bpf, cpumap: switch to GRO from netif_receive_skb_list() · Toke Høiland-Jørgensen <hidden> · 2024-08-09
Re: [xdp-hints] Re: [PATCH RFC bpf-next 32/52] bpf, cpumap: switch to GRO from netif_receive_skb_list() · Martin KaFai Lau <martin.lau@linux.dev> · 2024-08-10
Re: [xdp-hints] Re: [PATCH RFC bpf-next 32/52] bpf, cpumap: switch to GRO from netif_receive_skb_list() · Lorenzo Bianconi <hidden> · 2024-08-10
Re: [xdp-hints] Re: [PATCH RFC bpf-next 32/52] bpf, cpumap: switch to GRO from netif_receive_skb_list() · Jakub Kicinski <kuba@kernel.org> · 2024-08-13
Re: [xdp-hints] Re: [PATCH RFC bpf-next 32/52] bpf, cpumap: switch to GRO from netif_receive_skb_list() · Jesper Dangaard Brouer <hawk@kernel.org> · 2024-08-13
Re: [xdp-hints] Re: [PATCH RFC bpf-next 32/52] bpf, cpumap: switch to GRO from netif_receive_skb_list() · Lorenzo Bianconi <hidden> · 2024-08-10
Re: [xdp-hints] Re: [PATCH RFC bpf-next 32/52] bpf, cpumap: switch to GRO from netif_receive_skb_list() · Alexander Lobakin <aleksander.lobakin@intel.com> · 2024-08-13
Re: [xdp-hints] Re: [PATCH RFC bpf-next 32/52] bpf, cpumap: switch to GRO from netif_receive_skb_list() · Toke Høiland-Jørgensen <hidden> · 2024-08-13
Re: [xdp-hints] Re: [PATCH RFC bpf-next 32/52] bpf, cpumap: switch to GRO from netif_receive_skb_list() · Jesper Dangaard Brouer <hawk@kernel.org> · 2024-08-13
Re: [xdp-hints] Re: [PATCH RFC bpf-next 32/52] bpf, cpumap: switch to GRO from netif_receive_skb_list() · Alexander Lobakin <aleksander.lobakin@intel.com> · 2024-08-19
Re: [xdp-hints] Re: [PATCH RFC bpf-next 32/52] bpf, cpumap: switch to GRO from netif_receive_skb_list() · Daniel Xu <hidden> · 2024-08-21
Re: [xdp-hints] Re: [PATCH RFC bpf-next 32/52] bpf, cpumap: switch to GRO from netif_receive_skb_list() · Alexander Lobakin <aleksander.lobakin@intel.com> · 2024-08-21
Re: [xdp-hints] Re: [PATCH RFC bpf-next 32/52] bpf, cpumap: switch to GRO from netif_receive_skb_list() · Daniel Xu <hidden> · 2024-08-21
Re: [xdp-hints] Re: [PATCH RFC bpf-next 32/52] bpf, cpumap: switch to GRO from netif_receive_skb_list() · Lorenzo Bianconi <lorenzo@kernel.org> · 2024-08-13
Re: [xdp-hints] Re: [PATCH RFC bpf-next 32/52] bpf, cpumap: switch to GRO from netif_receive_skb_list() · Lorenzo Bianconi <hidden> · 2024-08-13
Re: [xdp-hints] Re: [PATCH RFC bpf-next 32/52] bpf, cpumap: switch to GRO from netif_receive_skb_list() · Alexander Lobakin <aleksander.lobakin@intel.com> · 2024-08-13
Re: [PATCH RFC bpf-next 32/52] bpf, cpumap: switch to GRO from netif_receive_skb_list() · Daniel Xu <hidden> · 2024-08-08
Re: [PATCH RFC bpf-next 32/52] bpf, cpumap: switch to GRO from netif_receive_skb_list() · Jesper Dangaard Brouer <hawk@kernel.org> · 2024-08-09
[PATCH RFC bpf-next 31/52] net, gro: expose some GRO API to use outside of NAPI · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 29/52] net, xdp: try to fill skb fields when converting from an &xdp_frame · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 28/52] net, xdp: prefetch data a bit when building an skb from an &xdp_frame · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 37/52] rcupdate: fix access helpers for incomplete struct pointers on GCC < 10 · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 23/52] net, skbuff: constify the @skb argument of skb_hwtstamps() · Alexander Lobakin <hidden> · 2022-06-28
[PATCH RFC bpf-next 36/52] bpf, cpumap: switch to napi_skb_cache_get_bulk() · Alexander Lobakin <hidden> · 2022-06-28
RE: [PATCH RFC bpf-next 00/52] bpf, xdp: introduce and use Generic Hints/metadata · John Fastabend <john.fastabend@gmail.com> · 2022-06-29
Re: [xdp-hints] Re: [PATCH RFC bpf-next 00/52] bpf, xdp: introduce and use Generic Hints/metadata · Toke Høiland-Jørgensen <hidden> · 2022-06-29
Re: [xdp-hints] Re: [PATCH RFC bpf-next 00/52] bpf, xdp: introduce and use Generic Hints/metadata · Alexander Lobakin <hidden> · 2022-07-04
Re: [xdp-hints] Re: [PATCH RFC bpf-next 00/52] bpf, xdp: introduce and use Generic Hints/metadata · Toke Høiland-Jørgensen <hidden> · 2022-07-04
Re: [xdp-hints] Re: [PATCH RFC bpf-next 00/52] bpf, xdp: introduce and use Generic Hints/metadata · Alexander Lobakin <hidden> · 2022-07-05
Re: [xdp-hints] Re: [PATCH RFC bpf-next 00/52] bpf, xdp: introduce and use Generic Hints/metadata · Toke Høiland-Jørgensen <hidden> · 2022-07-05
Re: [xdp-hints] Re: [PATCH RFC bpf-next 00/52] bpf, xdp: introduce and use Generic Hints/metadata · Alexander Lobakin <hidden> · 2022-07-06
Re: [xdp-hints] Re: [PATCH RFC bpf-next 00/52] bpf, xdp: introduce and use Generic Hints/metadata · Toke Høiland-Jørgensen <hidden> · 2022-07-06
Re: [xdp-hints] Re: [PATCH RFC bpf-next 00/52] bpf, xdp: introduce and use Generic Hints/metadata · Jesper Dangaard Brouer <hidden> · 2022-07-07
Re: [xdp-hints] Re: [PATCH RFC bpf-next 00/52] bpf, xdp: introduce and use Generic Hints/metadata · Magnus Karlsson <hidden> · 2022-07-12
Re: [xdp-hints] Re: [PATCH RFC bpf-next 00/52] bpf, xdp: introduce and use Generic Hints/metadata · Jesper Dangaard Brouer <hidden> · 2022-07-12
Re: [xdp-hints] Re: [PATCH RFC bpf-next 00/52] bpf, xdp: introduce and use Generic Hints/metadata · Magnus Karlsson <hidden> · 2022-07-15
Re: [xdp-hints] Re: [PATCH RFC bpf-next 00/52] bpf, xdp: introduce and use Generic Hints/metadata · Jesper Dangaard Brouer <hidden> · 2022-07-04
Re: [xdp-hints] Re: [PATCH RFC bpf-next 00/52] bpf, xdp: introduce and use Generic Hints/metadata · Alexander Lobakin <hidden> · 2022-07-05
Re: [xdp-hints] Re: [PATCH RFC bpf-next 00/52] bpf, xdp: introduce and use Generic Hints/metadata · Daniel Borkmann <daniel@iogearbox.net> · 2022-07-05
Re: [xdp-hints] Re: [PATCH RFC bpf-next 00/52] bpf, xdp: introduce and use Generic Hints/metadata · Zvi Effron <hidden> · 2022-06-29
Re: [xdp-hints] Re: [PATCH RFC bpf-next 00/52] bpf, xdp: introduce and use Generic Hints/metadata · Magnus Karlsson <hidden> · 2022-06-30
Re: [PATCH RFC bpf-next 00/52] bpf, xdp: introduce and use Generic Hints/metadata · Alexander Lobakin <hidden> · 2022-07-04

Re: [xdp-hints] Re: [PATCH RFC bpf-next 32/52] bpf, cpumap: switch to GRO from netif_receive_skb_list()

From: Daniel Xu <hidden>
Date: 2024-08-21 00:29:52
Also in: bpf, lkml
Subsystem: bpf [general] (safe dynamic programs and tools), the rest, xdp (express data path) · Maintainers: Alexei Starovoitov, Daniel Borkmann, Andrii Nakryiko, Eduard Zingerman, Kumar Kartikeya Dwivedi, Linus Torvalds, David S. Miller, Jakub Kicinski, Jesper Dangaard Brouer, John Fastabend

Hi Olek,

On Mon, Aug 19, 2024 at 04:50:52PM GMT, Alexander Lobakin wrote:
[..]

quoted

Thanks A LOT for doing this benchmarking!

I optimized the code a bit and picked my old patches for bulk NAPI skb
cache allocation and today I got 4.7 Mpps 🎉
IOW, the result of the series (7 patches totally, but 2 are not
networking-related) is 2.7 -> 4.7 Mpps == 75%!

Daniel,

if you want, you can pick my tree[0], either full or just up to

"bpf: cpumap: switch to napi_skb_cache_get_bulk()"

(13 patches total: 6 for netdev_feature_t and 7 for the cpumap)

and test with your usecases. Would be nice to see some real world
results, not my synthetic tests :D

quoted

--Jesper

[0]
https://github.com/alobakin/linux/compare/idpf-libie-new~52...idpf-libie-new/

So it turns out keeping the workload in place while I update and reboot
the kernel is a Hard Problem. I'll put in some more effort and see if I
can get one of the workloads to stay still, but it'll be a somewhat
noisy test even if it works. So the following are synthetic tests
(neper) but on a real prod setup as far as container networking and
configuration is concerned.

I cherry-picked 586be610~1..ca22ac8e9de onto our 6.9-ish branch. Had to
skip some of the flag refactors b/c of conflicts - I didn't know the
code well enough to do fixups. So I had to apply this diff (FWIW not sure
the struct_size() here was right anyhow):

diff --git a/kernel/bpf/cpumap.c b/kernel/bpf/cpumap.c
index 089d19c62efe..359fbfaa43eb 100644
--- a/kernel/bpf/cpumap.c
+++ b/kernel/bpf/cpumap.c

@@ -110,7 +110,7 @@ static struct bpf_map *cpu_map_alloc(union bpf_attr *attr)
 	if (!cmap->cpu_map)
 		goto free_cmap;
 
-	dev = bpf_map_area_alloc(struct_size(dev, priv, 0), NUMA_NO_NODE);
+	dev = bpf_map_area_alloc(sizeof(*dev), NUMA_NO_NODE);
 	if (!dev)
 		goto free_cpu_map;

==== Baseline ===
	./tcp_rr -c -H $SERVER -p 50,90,99 -T4 -F8 -l30				./tcp_stream -c -H $SERVER -T8 -F16 -l30

	Transactions	Latency P50 (s)	Latency P90 (s)	Latency P99 (s)			Throughput (Mbit/s)
Run 1	2578189	        0.00008831	0.00010623	0.00013439		Run 1	15427.22
Run 2	2657923	        0.00008575	0.00010239	0.00012927		Run 2	15272.12
Run 3	2700402	        0.00008447	0.00010111	0.00013183		Run 3	14871.35
Run 4	2571739	        0.00008575	0.00011519	0.00013823		Run 4	15344.72
Run 5	2476427	        0.00008703	0.00013055	0.00016895		Run 5	15193.2
Average	2596936	        0.000086262	0.000111094	0.000140534		Average	15221.722

=== cpumap NAPI patches ===
	Transactions	Latency P50 (s)	Latency P90 (s)	Latency P99 (s)			Throughput (Mbit/s)
Run 1	2554598	        0.00008703	0.00011263	0.00013055		Run 1	17090.29
Run 2	2478905	        0.00009087	0.00011391	0.00014463		Run 2	16742.27
Run 3	2418599	        0.00009471	0.00011007	0.00014207		Run 3	17555.3
Run 4	2562463	        0.00008959	0.00010367	0.00013055		Run 4	17892.3
Run 5	2716551	        0.00008127	0.00010879	0.00013439		Run 5	17578.32
Average	2546223.2	0.000088694	0.000109814	0.000136438		Average	17371.696
Delta	-1.95%	        2.82%	        -1.15%	        -2.91%			        14.12%


So it looks like the GRO patches work quite well out of the box. It's
curious that tcp_rr transactions go down a bit, though. I don't have any
intuition around that.

Lemme know if you wanna change some stuff and get a rerun.

Thanks,
Daniel

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help