Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net

[PATCH net-next RFC WIP] Patch for XDP support for virtio_net · Shrijeet Mukherjee <hidden> · 2016-10-22
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · Stephen Hemminger <stephen@networkplumber.org> · 2016-10-23
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · Shrijeet Mukherjee <hidden> · 2016-10-24
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · Alexei Starovoitov <hidden> · 2016-10-25
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · Jakub Kicinski <hidden> · 2016-10-25
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · Jesper Dangaard Brouer <hidden> · 2016-10-26
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · "Michael S. Tsirkin" <mst@redhat.com> · 2016-10-26
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · David Miller <davem@davemloft.net> · 2016-10-26
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · "Michael S. Tsirkin" <mst@redhat.com> · 2016-10-26
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · David Miller <davem@davemloft.net> · 2016-10-26
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · Jesper Dangaard Brouer <hidden> · 2016-10-27
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · John Fastabend <john.fastabend@gmail.com> · 2016-10-27
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · "Michael S. Tsirkin" <mst@redhat.com> · 2016-10-27
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · David Miller <davem@davemloft.net> · 2016-10-27
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · "Michael S. Tsirkin" <mst@redhat.com> · 2016-10-27
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · David Miller <davem@davemloft.net> · 2016-10-28
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · Alexander Duyck <hidden> · 2016-10-28
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · David Miller <davem@davemloft.net> · 2016-10-28
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · John Fastabend <john.fastabend@gmail.com> · 2016-10-28
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · Jakub Kicinski <hidden> · 2016-10-28
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · Alexei Starovoitov <hidden> · 2016-10-28
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · Alexander Duyck <hidden> · 2016-10-28
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · Jakub Kicinski <hidden> · 2016-10-28
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · Jakub Kicinski <hidden> · 2016-10-28
RE: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · Shrijeet Mukherjee <hidden> · 2016-10-29
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · Thomas Graf <tgraf@suug.ch> · 2016-10-29
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · Jesper Dangaard Brouer <hidden> · 2016-11-02
RE: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · Shrijeet Mukherjee <hidden> · 2016-11-03
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · "Michael S. Tsirkin" <mst@redhat.com> · 2016-11-03
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · John Fastabend <john.fastabend@gmail.com> · 2016-11-03
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · John Fastabend <john.fastabend@gmail.com> · 2016-11-03
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · "Michael S. Tsirkin" <mst@redhat.com> · 2016-11-03
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · John Fastabend <john.fastabend@gmail.com> · 2016-11-03
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · "Michael S. Tsirkin" <mst@redhat.com> · 2016-11-04
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · John Fastabend <john.fastabend@gmail.com> · 2016-11-04
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · "Michael S. Tsirkin" <mst@redhat.com> · 2016-11-06
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · David Miller <davem@davemloft.net> · 2016-10-28
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · "Michael S. Tsirkin" <mst@redhat.com> · 2016-10-30
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · Jesper Dangaard Brouer <hidden> · 2016-11-02
Re: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · Alexander Duyck <hidden> · 2016-11-02
RE: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · Shrijeet Mukherjee <hidden> · 2016-10-28
RE: [PATCH net-next RFC WIP] Patch for XDP support for virtio_net · Shrijeet Mukherjee <hidden> · 2016-10-28

From: "Michael S. Tsirkin" <mst@redhat.com>
Date: 2016-11-03 04:11:51

On Wed, Nov 02, 2016 at 06:28:34PM -0700, Shrijeet Mukherjee wrote:

quoted

-----Original Message-----
From: Jesper Dangaard Brouer [mailto:brouer@redhat.com]
Sent: Wednesday, November 2, 2016 7:27 AM
To: Thomas Graf <tgraf@suug.ch>
Cc: Shrijeet Mukherjee <redacted>; Alexei Starovoitov
[off-list ref]; Jakub Kicinski [off-list ref]; John
Fastabend [off-list ref]; David Miller
[off-list ref]; alexander.duyck@gmail.com; mst@redhat.com;
shrijeet@gmail.com; tom@herbertland.com; netdev@vger.kernel.org;
Roopa Prabhu [off-list ref]; Nikolay Aleksandrov
[off-list ref]; brouer@redhat.com
Subject: Re: [PATCH net-next RFC WIP] Patch for XDP support for

virtio_net

quoted

On Sat, 29 Oct 2016 13:25:14 +0200
Thomas Graf [off-list ref] wrote:

quoted

On 10/28/16 at 08:51pm, Shrijeet Mukherjee wrote:

quoted

Generally agree, but SRIOV nics with multiple queues can end up in a
bad spot if each buffer was 4K right ? I see a specific page pool to
be used by queues which are enabled for XDP as the easiest to swing
solution that way the memory overhead can be restricted to enabled
queues and shared access issues can be restricted to skb's using

that

quoted

pool no ?

Yes, that is why that I've been arguing so strongly for having the

flexibility to

quoted

attach a XDP program per RX queue, as this only change the memory model
for this one queue.

quoted

Isn't this clearly a must anyway? I may be missing something
fundamental here so please enlighten me :-)

If we dedicate a page per packet, that could translate to 14M*4K worth
of memory being mapped per second for just a 10G NIC under DoS attack.
How can one protect such as system? Is the assumption that we can
always drop such packets quickly enough before we start dropping
randomly due to memory pressure? If a handshake is required to
determine validity of a packet then that is going to be difficult.

Under DoS attacks you don't run out of memory, because a diverse set of
socket memory limits/accounting avoids that situation.  What does happen
is the maximum achievable PPS rate is directly dependent on the
time you spend on each packet.   This use of CPU resources (and
hitting mem-limits-safe-guards) push-back on the drivers speed to

process

quoted

the RX ring.  In effect, packets are dropped in the NIC HW as RX-ring

queue

quoted

is not emptied fast-enough.

Given you don't control what HW drops, the attacker will "successfully"
cause your good traffic to be among the dropped packets.

This is where XDP change the picture. If you can express (by eBPF) a

filter

quoted

that can separate "bad" vs "good" traffic, then you can take back

control.

quoted

Almost like controlling what traffic the HW should drop.
Given the cost of XDP-eBPF filter + serving regular traffic does not use

all of

quoted

your CPU resources, you have overcome the attack.

--

Jesper,  John et al .. to make this a little concrete I am going to spin
up a v2 which has only bigbuffers mode enabled for xdp acceleration, all
other modes will reject the xdp ndo ..

Do we have agreement on that model ?

It will need that all vhost implementations will need to start with
mergeable buffers disabled to get xdp goodness, but that sounds like a
safe thing to do for now ..

It's ok for experimentation, but really after speaking with Alexei it's
clear to me that xdp should have a separate code path in the driver,
e.g. the separation between modes is something that does not
make sense for xdp.

The way I imagine it working:

- when XDP is attached disable all LRO using VIRTIO_NET_CTRL_GUEST_OFFLOADS_SET
  (not used by driver so far, designed to allow dynamic LRO control with
   ethtool)
- start adding page-sized buffers
- do something with non-page-sized buffers added previously - what
  exactly? copy I guess? What about LRO packets that are too large -
  can we drop or can we split them up?

I'm fine with disabling XDP for some configurations as the first step,
and we can add that support later.

Ideas about mergeable buffers (optional):

At the moment mergeable buffers can't be disabled dynamically.
They do bring a small benefit for XDP if host MTU is large (see below)
and aren't hard to support:
- if header is by itself skip 1st page
- otherwise copy all data into first page
and it's nicer not to add random limitations that require guest reboot.
It might make sense to add a command that disables/enabled
mergeable buffers dynamically but that's for newer hosts.

Spec does not require it but in practice most hosts put all data
in the 1st page or all in the 2nd page so the copy will be nop
for these cases.

Large host MTU - newer hosts report the host MTU, older ones don't.
Using mergeable buffers we can at least detect this case
(and then what? drop I guess).




-- 
MST

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help