Re: [PATCH net-next v3 01/18] net: Copy slab data for sendmsg(MSG_SPLICE_PAGES)

[PATCH net-next v3 00/18] splice, net: Switch over users of sendpage() and remove it · David Howells <dhowells@redhat.com> · 2023-06-20
[PATCH net-next v3 02/18] net: Display info about MSG_SPLICE_PAGES memory handling in proc · David Howells <dhowells@redhat.com> · 2023-06-20
Re: [PATCH net-next v3 02/18] net: Display info about MSG_SPLICE_PAGES memory handling in proc · Paolo Abeni <pabeni@redhat.com> · 2023-06-23
Re: [PATCH net-next v3 02/18] net: Display info about MSG_SPLICE_PAGES memory handling in proc · David Howells <dhowells@redhat.com> · 2023-06-23
[PATCH net-next v3 05/18] ceph: Use sendmsg(MSG_SPLICE_PAGES) rather than sendpage · David Howells <dhowells@redhat.com> · 2023-06-20
[PATCH net-next v3 06/18] net: Use sendmsg(MSG_SPLICE_PAGES) not sendpage in skb_send_sock() · David Howells <dhowells@redhat.com> · 2023-06-20
[PATCH net-next v3 04/18] siw: Use sendmsg(MSG_SPLICE_PAGES) rather than sendpage to transmit · David Howells <dhowells@redhat.com> · 2023-06-20
RE: [PATCH net-next v3 04/18] siw: Use sendmsg(MSG_SPLICE_PAGES) rather than sendpage to transmit · Bernard Metzler <hidden> · 2023-06-21
[PATCH net-next v3 01/18] net: Copy slab data for sendmsg(MSG_SPLICE_PAGES) · David Howells <dhowells@redhat.com> · 2023-06-20
Re: [PATCH net-next v3 01/18] net: Copy slab data for sendmsg(MSG_SPLICE_PAGES) · Jakub Kicinski <kuba@kernel.org> · 2023-06-22
Re: [PATCH net-next v3 01/18] net: Copy slab data for sendmsg(MSG_SPLICE_PAGES) · Alexander Duyck <hidden> · 2023-06-22
Re: [PATCH net-next v3 01/18] net: Copy slab data for sendmsg(MSG_SPLICE_PAGES) · David Howells <dhowells@redhat.com> · 2023-06-22
Re: [PATCH net-next v3 01/18] net: Copy slab data for sendmsg(MSG_SPLICE_PAGES) · Jakub Kicinski <kuba@kernel.org> · 2023-06-22
Re: [PATCH net-next v3 01/18] net: Copy slab data for sendmsg(MSG_SPLICE_PAGES) · David Howells <dhowells@redhat.com> · 2023-06-22
Re: [PATCH net-next v3 01/18] net: Copy slab data for sendmsg(MSG_SPLICE_PAGES) · Jakub Kicinski <kuba@kernel.org> · 2023-06-23
Re: [PATCH net-next v3 01/18] net: Copy slab data for sendmsg(MSG_SPLICE_PAGES) · David Howells <dhowells@redhat.com> · 2023-06-23
Re: [PATCH net-next v3 01/18] net: Copy slab data for sendmsg(MSG_SPLICE_PAGES) · Paolo Abeni <pabeni@redhat.com> · 2023-06-23
Re: [PATCH net-next v3 01/18] net: Copy slab data for sendmsg(MSG_SPLICE_PAGES) · David Howells <dhowells@redhat.com> · 2023-06-23
Re: [PATCH net-next v3 01/18] net: Copy slab data for sendmsg(MSG_SPLICE_PAGES) · Paolo Abeni <pabeni@redhat.com> · 2023-06-23
Re: [PATCH net-next v3 01/18] net: Copy slab data for sendmsg(MSG_SPLICE_PAGES) · Paolo Abeni <pabeni@redhat.com> · 2023-06-23
Re: [PATCH net-next v3 01/18] net: Copy slab data for sendmsg(MSG_SPLICE_PAGES) · David Howells <dhowells@redhat.com> · 2023-06-23
Re: [PATCH net-next v3 01/18] net: Copy slab data for sendmsg(MSG_SPLICE_PAGES) · Paolo Abeni <pabeni@redhat.com> · 2023-06-23
Re: [PATCH net-next v3 01/18] net: Copy slab data for sendmsg(MSG_SPLICE_PAGES) · David Howells <dhowells@redhat.com> · 2023-06-23
[PATCH net-next v3 08/18] rds: Use sendmsg(MSG_SPLICE_PAGES) rather than sendpage · David Howells <dhowells@redhat.com> · 2023-06-20
[PATCH net-next v3 11/18] nvme/target: Use sendmsg(MSG_SPLICE_PAGES) rather then sendpage · David Howells <dhowells@redhat.com> · 2023-06-20
[PATCH net-next v3 09/18] dlm: Use sendmsg(MSG_SPLICE_PAGES) rather than sendpage · David Howells <dhowells@redhat.com> · 2023-06-20
[PATCH net-next v3 07/18] ceph: Use sendmsg(MSG_SPLICE_PAGES) rather than sendpage() · David Howells <dhowells@redhat.com> · 2023-06-20
[PATCH net-next v3 12/18] smc: Drop smc_sendpage() in favour of smc_sendmsg() + MSG_SPLICE_PAGES · David Howells <dhowells@redhat.com> · 2023-06-20
[PATCH net-next v3 03/18] tcp_bpf, smc, tls, espintcp: Reduce MSG_SENDPAGE_NOTLAST usage · David Howells <dhowells@redhat.com> · 2023-06-20
[PATCH net-next v3 15/18] drdb: Send an entire bio in a single sendmsg · David Howells <dhowells@redhat.com> · 2023-06-20
[PATCH net-next v3 13/18] ocfs2: Use sendmsg(MSG_SPLICE_PAGES) rather than sendpage() · David Howells <dhowells@redhat.com> · 2023-06-20
[PATCH net-next v3 16/18] iscsi: Use sendmsg(MSG_SPLICE_PAGES) rather than sendpage · David Howells <dhowells@redhat.com> · 2023-06-20
[PATCH net-next v3 10/18] nvme/host: Use sendmsg(MSG_SPLICE_PAGES) rather then sendpage · David Howells <dhowells@redhat.com> · 2023-06-20
Re: [PATCH net-next v3 10/18] nvme/host: Use sendmsg(MSG_SPLICE_PAGES) rather then sendpage · Sagi Grimberg <sagi@grimberg.me> · 2023-06-21
Re: [PATCH net-next v3 10/18] nvme/host: Use sendmsg(MSG_SPLICE_PAGES) rather then sendpage · David Howells <dhowells@redhat.com> · 2023-06-21
Re: [PATCH net-next v3 10/18] nvme/host: Use sendmsg(MSG_SPLICE_PAGES) rather then sendpage · Sagi Grimberg <sagi@grimberg.me> · 2023-06-21
[PATCH net-next v3 14/18] drbd: Use sendmsg(MSG_SPLICE_PAGES) rather than sendpage() · David Howells <dhowells@redhat.com> · 2023-06-20
[PATCH net-next v3 17/18] sock: Remove ->sendpage*() in favour of sendmsg(MSG_SPLICE_PAGES) · David Howells <dhowells@redhat.com> · 2023-06-20
[PATCH net-next v3 18/18] net: Kill MSG_SENDPAGE_NOTLAST · David Howells <dhowells@redhat.com> · 2023-06-20

From: Jakub Kicinski <kuba@kernel.org>
Date: 2023-06-22 20:28:42
Also in: linux-mm, lkml

On Thu, 22 Jun 2023 20:40:43 +0100 David Howells wrote:

quoted

How did that happen? I thought MSG_SPLICE_PAGES comes from former
sendpage users and sendpage can't operate on slab pages.

Some of my patches, take the siw one for example, now aggregate all the bits
that make up a message into a single sendmsg() call, including any protocol
header and trailer in the same bio_vec[] as the payload where before it would
have to do, say, sendmsg+sendpage+sendpage+...+sendpage+sendmsg.

Maybe it's just me but I'd prefer to keep the clear rule that splice
operates on pages not slab objects. SIW is the software / fake
implementation of RDMA, right? You couldn't have picked a less
important user :(

Paolo indicated that he'll take a look tomorrow, we'll see what he
thinks.

I'm trying to make it so that I make the minimum number of sendmsg calls
(ie. 1 where possible) and the loop that processes the data is inside of that.

The in-kernel users can be fixed to not use slab, and user space can't
feed us slab objects.

This offers the opportunity, at least in the future, to append slab data to an
already-existing private fragment in the skbuff.

Maybe we can get Eric to comment. The ability to identify "frag type"
seems cool indeed, but I haven't thought about using it to attach
slab objects.

quoted

The locking is to local_bh_disable(). Does the milliont^w new frag
allocator have any additional benefits?

It is shareable because it does locking.  Multiple sockets of multiple
protocols can share the pages it has reserved.  It drops the lock around calls
to the page allocator so that GFP_KERNEL/GFP_NOFS can be used with it.

Without this, the page fragment allocator would need to be per-socket, I
think, or be done further up the stack where the higher level drivers would
have to have a fragment bucket per whatever unit they use to deal with the
lack of locking.

There's also the per task frag which can be used under normal conditions
(sk_use_task_frag).

Doing it here makes cleanup simpler since I just transfer my ref on the
fragment to the skbuff frag list and it will automatically be cleaned up with
the skbuff.

Willy suggested that I just allocate a page for each thing I want to copy, but
I would rather not do that for, say, an 8-byte bit of protocol data.

TBH my intuition would also be get a full page and let the callers who
care about performance fix themselves. Assuming we want to let slab
objects in in the first place.

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help