Re: [PATCH net-next 2/3] udp: avoid a cache miss on dequeue

[PATCH net-next 0/3] udp: reduce cache pressure · Paolo Abeni <pabeni@redhat.com> · 2017-05-29
[PATCH net-next 1/3] net: factor out a helper to decrement the skb refcount · Paolo Abeni <pabeni@redhat.com> · 2017-05-29
Re: [PATCH net-next 1/3] net: factor out a helper to decrement the skb refcount · Eric Dumazet <hidden> · 2017-05-31
[PATCH net-next 2/3] udp: avoid a cache miss on dequeue · Paolo Abeni <pabeni@redhat.com> · 2017-05-29
Re: [PATCH net-next 2/3] udp: avoid a cache miss on dequeue · Eric Dumazet <hidden> · 2017-05-31
Re: [PATCH net-next 2/3] udp: avoid a cache miss on dequeue · Paolo Abeni <pabeni@redhat.com> · 2017-06-01
Re: [PATCH net-next 2/3] udp: avoid a cache miss on dequeue · Eric Dumazet <hidden> · 2017-06-01
Re: [PATCH net-next 2/3] udp: avoid a cache miss on dequeue · Paolo Abeni <pabeni@redhat.com> · 2017-06-01
Re: [PATCH net-next 2/3] udp: avoid a cache miss on dequeue · Eric Dumazet <edumazet@google.com> · 2017-06-01
Re: [PATCH net-next 2/3] udp: avoid a cache miss on dequeue · Paolo Abeni <pabeni@redhat.com> · 2017-06-01
Re: [PATCH net-next 2/3] udp: avoid a cache miss on dequeue · Eric Dumazet <hidden> · 2017-06-01
[PATCH net-next 3/3] udp: try to avoid 2 cache miss on dequeue · Paolo Abeni <pabeni@redhat.com> · 2017-05-29
Re: [PATCH net-next 3/3] udp: try to avoid 2 cache miss on dequeue · Eric Dumazet <hidden> · 2017-05-31
Re: [PATCH net-next 3/3] udp: try to avoid 2 cache miss on dequeue · Paolo Abeni <pabeni@redhat.com> · 2017-06-01
[lkp-robot] [udp] bc0d3d0639: apachebench.requests_per_second -21% regression · kernel test robot <hidden> · 2017-06-07
Re: [PATCH net-next 0/3] udp: reduce cache pressure · David Miller <davem@davemloft.net> · 2017-05-31
Re: [PATCH net-next 0/3] udp: reduce cache pressure · Eric Dumazet <edumazet@google.com> · 2017-05-31

From: Paolo Abeni <pabeni@redhat.com>
Date: 2017-06-01 16:21:55

On Thu, 2017-06-01 at 08:58 -0700, Eric Dumazet wrote:

On Thu, 2017-06-01 at 12:39 +0200, Paolo Abeni wrote:

quoted

On Wed, 2017-05-31 at 10:00 -0700, Eric Dumazet wrote:

quoted

On Mon, 2017-05-29 at 17:27 +0200, Paolo Abeni wrote:

quoted

Since UDP no more uses sk->destructor, we can clear completely
the skb head state before enqueuing.

...

quoted

@@ -1739,6 +1740,9 @@ static int __udp_queue_rcv_skb(struct sock *sk, struct sk_buff *skb)
 		sk_mark_napi_id_once(sk, skb);
 	}
 
+	/* drop all pending head states; dst, nf and sk are dropped by caller */
+	secpath_reset(skb);
+

I wonder if using skb_release_head_state() would be more appropriate ?

Surely more descriptive and probably not more expensive since all
cache lines should be already hot at this point.

Thank you for reviewing this.

I would prefer not adding more code to the core, but I think we would
need something new, like:

skb_reset_head_state()
{
	skb_dst_drop(skb);
        secpath_reset(skb);
        nf_reset(skb);
	skb_orphan(skb);
}

because elsewhere the skb could be in inconsistent state: skb->sp !=
NULL but with its refcount is already decremented. WDYT?

I do not believe skb->sk is set anymore in UDP receive path.

If early demux sets skb->sk for a moment, skb_steal_sock() would set
skb->sk back to NULL

I'm sorry, I do not follow. I'm concerned about the secpath field (skb-

sp), which is the only one that can be not NULL in

__udp_queue_rcv_skb().

If the secpath is not NULL, calling there secpath_reset() (or the to-
be-introduced skb_reset_head_state()), we will properly release it and
we will clear the field, too.

Calling skb_release_head_state() in the same scenario, we release the
secpath, but we don't clear it. So if the packet is later dropped we
will get a double free, unless we add and use a specialized a
free_stateless_skb(), too.

Paolo

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help