Re: [net-next PATCH v2 8/8] net: Introduce SO_INCOMING_NAPI_ID

[net-next PATCH v2 0/8] Add busy poll support for epoll · Alexander Duyck <hidden> · 2017-03-23
[net-next PATCH v2 1/8] net: Busy polling should ignore sender CPUs · Alexander Duyck <hidden> · 2017-03-23
Re: [net-next PATCH v2 1/8] net: Busy polling should ignore sender CPUs · Eric Dumazet <hidden> · 2017-03-23
[net-next PATCH v2 2/8] tcp: Record Rx hash and NAPI ID in tcp_child_process · Alexander Duyck <hidden> · 2017-03-23
Re: [net-next PATCH v2 2/8] tcp: Record Rx hash and NAPI ID in tcp_child_process · Eric Dumazet <hidden> · 2017-03-24
[net-next PATCH v2 3/8] net: Only define skb_mark_napi_id in one spot instead of two · Alexander Duyck <hidden> · 2017-03-23
Re: [net-next PATCH v2 3/8] net: Only define skb_mark_napi_id in one spot instead of two · Eric Dumazet <hidden> · 2017-03-24
[net-next PATCH v2 4/8] net: Change return type of sk_busy_loop from bool to void · Alexander Duyck <hidden> · 2017-03-23
Re: [net-next PATCH v2 4/8] net: Change return type of sk_busy_loop from bool to void · Eric Dumazet <hidden> · 2017-03-24
[net-next PATCH v2 7/8] epoll: Add busy poll support to epoll with socket fds. · Alexander Duyck <hidden> · 2017-03-23
[net-next PATCH v2 5/8] net: Track start of busy loop instead of when it should end · Alexander Duyck <hidden> · 2017-03-23
Re: [net-next PATCH v2 5/8] net: Track start of busy loop instead of when it should end · Eric Dumazet <hidden> · 2017-03-24
Re: [net-next PATCH v2 5/8] net: Track start of busy loop instead of when it should end · Alexander Duyck <hidden> · 2017-03-24
Re: [net-next PATCH v2 5/8] net: Track start of busy loop instead of when it should end · Eric Dumazet <hidden> · 2017-03-24
Re: [net-next PATCH v2 5/8] net: Track start of busy loop instead of when it should end · Eric Dumazet <hidden> · 2017-03-24
Re: [net-next PATCH v2 5/8] net: Track start of busy loop instead of when it should end · Alexander Duyck <hidden> · 2017-03-24
Re: [net-next PATCH v2 5/8] net: Track start of busy loop instead of when it should end · Eric Dumazet <hidden> · 2017-03-24
Re: [net-next PATCH v2 5/8] net: Track start of busy loop instead of when it should end · Alexander Duyck <hidden> · 2017-03-24
[net-next PATCH v2 8/8] net: Introduce SO_INCOMING_NAPI_ID · Alexander Duyck <hidden> · 2017-03-23
Re: [net-next PATCH v2 8/8] net: Introduce SO_INCOMING_NAPI_ID · Eric Dumazet <hidden> · 2017-03-23
Re: [net-next PATCH v2 8/8] net: Introduce SO_INCOMING_NAPI_ID · Andy Lutomirski <luto@kernel.org> · 2017-03-23
Re: [net-next PATCH v2 8/8] net: Introduce SO_INCOMING_NAPI_ID · Alexander Duyck <hidden> · 2017-03-24
Re: [net-next PATCH v2 8/8] net: Introduce SO_INCOMING_NAPI_ID · Andy Lutomirski <luto@kernel.org> · 2017-03-24
Re: [net-next PATCH v2 8/8] net: Introduce SO_INCOMING_NAPI_ID · Eric Dumazet <edumazet@google.com> · 2017-03-24
[net-next PATCH v2 6/8] net: Commonize busy polling code to focus on napi_id instead of socket · Alexander Duyck <hidden> · 2017-03-23
Re: [net-next PATCH v2 6/8] net: Commonize busy polling code to focus on napi_id instead of socket · Eric Dumazet <hidden> · 2017-03-23
Re: [net-next PATCH v2 0/8] Add busy poll support for epoll · Alexei Starovoitov <hidden> · 2017-03-23
Re: [net-next PATCH v2 0/8] Add busy poll support for epoll · Alexander Duyck <hidden> · 2017-03-23
Re: [net-next PATCH v2 0/8] Add busy poll support for epoll · Eric Dumazet <edumazet@google.com> · 2017-03-23

From: Eric Dumazet <edumazet@google.com>
Date: 2017-03-24 05:07:34
Also in: linux-api, lkml

On Thu, Mar 23, 2017 at 9:47 PM, Andy Lutomirski [off-list ref] wrote:

So don't we want queue id, not NAPI id?  Or am I still missing something?

But I'm also a but confused as to the overall performance effect.
Suppose I have an rx queue that has its interrupt bound to cpu 0.  For
whatever reason (random chance if I'm hashing, for example), I end up
with the epoll caller on cpu 1.  Suppose further that cpus 0 and 1 are
on different NUMA nodes.

Now, let's suppose that I get lucky and *all* the packets are pulled
off the queue by epoll busy polling.  Life is great [1].  But suppose
that, due to a tiny hiccup or simply user code spending some cycles
processing those packets, an rx interrupt fires.  Now cpu 0 starts
pulling packets off the queue via NAPI, right?  So both NUMA nodes are
fighting over all the cachelines involved in servicing the queue *and*
the packets just got dequeued on the wrong NUMA node.

ISTM this would work better if the epoll busy polling could handle the
case where one epoll set polls sockets on different queues as long as
those queues are all owned by the same CPU.  Then user code could use
SO_INCOMING_CPU to sort out the sockets.

Of course you can do that already.

SO_REUSEPORT + appropriate eBPF filter can select the best socket to
receive your packets, based
on various smp/numa affinities ( BPF_FUNC_get_smp_processor_id or
BPF_FUNC_get_numa_node_id )

This new instruction is simply _allowing_ other schems, based on
queues ID, in the case each NIC queue
can be managed by a group of cores (presumably on same NUMA node)

Am I missing something?

[1] Maybe.  How smart is direct cache access?  If it's smart enough,
it'll pre-populate node 0's LLC, which means that life isn't so great
after all.

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help