Re: [PATCH net-next v3 3/4] page_pool: introduce page_pool_alloc() API

[PATCH net-next v3 0/4] introduce page_pool_alloc() API · Yunsheng Lin <hidden> · 2023-06-09
[PATCH net-next v3 2/4] page_pool: unify frag_count handling in page_pool_is_last_frag() · Yunsheng Lin <hidden> · 2023-06-09
[PATCH net-next v3 1/4] page_pool: frag API support for 32-bit arch with 64-bit DMA · Yunsheng Lin <hidden> · 2023-06-09
Re: [PATCH net-next v3 1/4] page_pool: frag API support for 32-bit arch with 64-bit DMA · Jesper Dangaard Brouer <hidden> · 2023-06-09
Re: [PATCH net-next v3 1/4] page_pool: frag API support for 32-bit arch with 64-bit DMA · Yunsheng Lin <hidden> · 2023-06-10
Re: [PATCH net-next v3 1/4] page_pool: frag API support for 32-bit arch with 64-bit DMA · Jesper Dangaard Brouer <hidden> · 2023-06-11
[PATCH net-next v3 4/4] page_pool: remove PP_FLAG_PAGE_FRAG flag · Yunsheng Lin <hidden> · 2023-06-09
[PATCH net-next v3 3/4] page_pool: introduce page_pool_alloc() API · Yunsheng Lin <hidden> · 2023-06-09
Re: [PATCH net-next v3 3/4] page_pool: introduce page_pool_alloc() API · Alexander Duyck <hidden> · 2023-06-13
Re: [PATCH net-next v3 3/4] page_pool: introduce page_pool_alloc() API · Yunsheng Lin <hidden> · 2023-06-14
Re: [PATCH net-next v3 3/4] page_pool: introduce page_pool_alloc() API · Alexander Duyck <hidden> · 2023-06-14
Re: [PATCH net-next v3 3/4] page_pool: introduce page_pool_alloc() API · Yunsheng Lin <hidden> · 2023-06-15
Re: [PATCH net-next v3 3/4] page_pool: introduce page_pool_alloc() API · Alexander Duyck <hidden> · 2023-06-15
Re: [PATCH net-next v3 3/4] page_pool: introduce page_pool_alloc() API · Jesper Dangaard Brouer <hidden> · 2023-06-15
Re: [PATCH net-next v3 3/4] page_pool: introduce page_pool_alloc() API · Yunsheng Lin <hidden> · 2023-06-16
Re: [PATCH net-next v3 3/4] page_pool: introduce page_pool_alloc() API · Jesper Dangaard Brouer <hidden> · 2023-06-16
Re: [PATCH net-next v3 3/4] page_pool: introduce page_pool_alloc() API · Alexander Duyck <hidden> · 2023-06-16
Re: [PATCH net-next v3 3/4] page_pool: introduce page_pool_alloc() API · Jesper Dangaard Brouer <hidden> · 2023-06-16
Re: [PATCH net-next v3 3/4] page_pool: introduce page_pool_alloc() API · Jakub Kicinski <kuba@kernel.org> · 2023-06-16
Re: [PATCH net-next v3 3/4] page_pool: introduce page_pool_alloc() API · Alexander Duyck <hidden> · 2023-06-16
Re: [PATCH net-next v3 3/4] page_pool: introduce page_pool_alloc() API · Lorenzo Bianconi <lorenzo@kernel.org> · 2023-06-18
Re: [PATCH net-next v3 3/4] page_pool: introduce page_pool_alloc() API · Alexander Duyck <hidden> · 2023-06-20
Re: [PATCH net-next v3 3/4] page_pool: introduce page_pool_alloc() API · Lorenzo Bianconi <lorenzo@kernel.org> · 2023-06-20
Re: [PATCH net-next v3 3/4] page_pool: introduce page_pool_alloc() API · Jesper Dangaard Brouer <hidden> · 2023-06-21
Re: [PATCH net-next v3 3/4] page_pool: introduce page_pool_alloc() API · Yunsheng Lin <hidden> · 2023-06-24
Re: [PATCH net-next v3 3/4] page_pool: introduce page_pool_alloc() API · Yunsheng Lin <hidden> · 2023-06-17
Re: [PATCH net-next v3 3/4] page_pool: introduce page_pool_alloc() API · Yunsheng Lin <hidden> · 2023-06-16
Re: [PATCH net-next v3 3/4] page_pool: introduce page_pool_alloc() API · Alexander Duyck <hidden> · 2023-06-16
Re: [PATCH net-next v3 3/4] page_pool: introduce page_pool_alloc() API · Yunsheng Lin <hidden> · 2023-06-17
Re: [PATCH net-next v3 3/4] page_pool: introduce page_pool_alloc() API · Alexander Duyck <hidden> · 2023-06-20
Re: [PATCH net-next v3 3/4] page_pool: introduce page_pool_alloc() API · Yunsheng Lin <hidden> · 2023-06-24

From: Alexander Duyck <hidden>
Date: 2023-06-20 16:19:46
Also in: bpf, lkml

On Sun, Jun 18, 2023 at 8:05 AM Lorenzo Bianconi [off-list ref] wrote:

[...]

quoted

Yes, precisely.
I distinctly remember what I tried to poke you and Eric on this approach
earlier, but I cannot find a link to that email.

I would really appreciate, if you Alex, could give the approach in
veth_convert_skb_to_xdp_buff() some review, as I believe that is a huge
potential for improvements that will lead to large performance
improvements. (I'm sure Maryam will be eager to help re-test performance
for her use-cases).

Well just looking at it the quick and dirty answer would be to look at
making use of something like page_frag_cache. I won't go into details
since it isn't too different from the frag allocator, but it is much
simpler since it is just doing reference count hacks instead of having
to do the extra overhead to keep the DMA mapping in place. The veth
would then just be sitting on at most an order 3 page while it is
waiting to fully consume it rather than waiting on a full pool of
pages.

Hi,

I did some experiments using page_frag_cache/page_frag_alloc() instead of
page_pools in a simple environment I used to test XDP for veth driver.
In particular, I allocate a new buffer in veth_convert_skb_to_xdp_buff() from
the page_frag_cache in order to copy the full skb in the new one, actually
"linearizing" the packet (since we know the original skb length).
I run an iperf TCP connection over a veth pair where the
remote device runs the xdp_rxq_info sample (available in the kernel source
tree, with action XDP_PASS):

TCP clietn -- v0 === v1 (xdp_rxq_info) -- TCP server

net-next (page_pool):
- MTU 1500B: ~  7.5 Gbps
- MTU 8000B: ~ 15.3 Gbps

net-next + page_frag_alloc:
- MTU 1500B: ~  8.4 Gbps
- MTU 8000B: ~ 14.7 Gbps

It seems there is no a clear "win" situation here (at least in this environment
and we this simple approach). Moreover:

For the 1500B packets it is a win, but for 8000B it looks like there
is a regression. Any idea what is causing it?

- can the linearization introduce any issue whenever we perform XDP_REDIRECT
  into a destination device?

It shouldn't. If it does it would probably point to an issue w/ the
destination driver rather than an issue with the code doing this.

- can the page_frag_cache introduce more memory fragmentation (IIRC we were
  experiencing this issue in mt76 before switching to page_pools).

I think it largely depends on where the packets are ending up. I know
this is the approach we are using for sockets, see
skb_page_frag_refill(). If nothing else, if you took a similar
approach to it you might be able to bypass the need for the
page_frag_cache itself, although you would likely still end up
allocating similar structures.

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help