Re: [PATCH net-next v7 0/9] xen-netback: TX grant mapping with... | netdev

[PATCH net-next v7 0/9] xen-netback: TX grant mapping with SKBTX_DEV_ZEROCOPY instead of copy · Zoltan Kiss <hidden> · 2014-03-06
[PATCH net-next v7 2/9] xen-netback: Minor refactoring of netback code · Zoltan Kiss <hidden> · 2014-03-06
[PATCH net-next v7 4/9] xen-netback: Introduce TX grant mapping · Zoltan Kiss <hidden> · 2014-03-06
Re: [PATCH net-next v7 4/9] xen-netback: Introduce TX grant mapping · Ian Campbell <hidden> · 2014-03-13
Re: [PATCH net-next v7 4/9] xen-netback: Introduce TX grant mapping · Zoltan Kiss <hidden> · 2014-03-13
Re: [PATCH net-next v7 4/9] xen-netback: Introduce TX grant mapping · Ian Campbell <hidden> · 2014-03-13
Re: [Xen-devel] [PATCH net-next v7 4/9] xen-netback: Introduce TX grant mapping · David Vrabel <hidden> · 2014-03-13
Re: [Xen-devel] [PATCH net-next v7 4/9] xen-netback: Introduce TX grant mapping · Ian Campbell <hidden> · 2014-03-13
Re: [Xen-devel] [PATCH net-next v7 4/9] xen-netback: Introduce TX grant mapping · David Vrabel <hidden> · 2014-03-13
Re: [Xen-devel] [PATCH net-next v7 4/9] xen-netback: Introduce TX grant mapping · Wei Liu <hidden> · 2014-03-13
Re: [PATCH net-next v7 4/9] xen-netback: Introduce TX grant mapping · Zoltan Kiss <hidden> · 2014-03-13
Re: [PATCH net-next v7 4/9] xen-netback: Introduce TX grant mapping · Ian Campbell <hidden> · 2014-03-13
Re: [PATCH net-next v7 4/9] xen-netback: Introduce TX grant mapping · Zoltan Kiss <hidden> · 2014-03-13
[PATCH net-next v7 1/9] xen-netback: Use skb->cb for pending_idx · Zoltan Kiss <hidden> · 2014-03-06
[PATCH net-next v7 7/9] xen-netback: Handle guests with too many frags · Zoltan Kiss <hidden> · 2014-03-06
[PATCH net-next v7 9/9] xen-netback: Aggregate TX unmap operations · Zoltan Kiss <hidden> · 2014-03-06
Re: [PATCH net-next v7 9/9] xen-netback: Aggregate TX unmap operations · Zoltan Kiss <hidden> · 2014-03-19
RE: [PATCH net-next v7 9/9] xen-netback: Aggregate TX unmap operations · Paul Durrant <hidden> · 2014-03-20
Re: [PATCH net-next v7 9/9] xen-netback: Aggregate TX unmap operations · Wei Liu <hidden> · 2014-03-20
RE: [PATCH net-next v7 9/9] xen-netback: Aggregate TX unmap operations · Paul Durrant <hidden> · 2014-03-20
Re: [PATCH net-next v7 9/9] xen-netback: Aggregate TX unmap operations · Wei Liu <hidden> · 2014-03-20
Re: [PATCH net-next v7 9/9] xen-netback: Aggregate TX unmap operations · Zoltan Kiss <hidden> · 2014-03-20
[PATCH net-next v7 8/9] xen-netback: Timeout packets in RX path · Zoltan Kiss <hidden> · 2014-03-06
Re: [PATCH net-next v7 8/9] xen-netback: Timeout packets in RX path · Ian Campbell <hidden> · 2014-03-13
[PATCH net-next v7 6/9] xen-netback: Add stat counters for zerocopy · Zoltan Kiss <hidden> · 2014-03-06
[PATCH net-next v7 5/9] xen-netback: Remove old TX grant copy definitons and fix indentations · Zoltan Kiss <hidden> · 2014-03-06
[PATCH net-next v7 3/9] xen-netback: Handle foreign mapped pages on the guest RX path · Zoltan Kiss <hidden> · 2014-03-06
Re: [PATCH net-next v7 0/9] xen-netback: TX grant mapping with SKBTX_DEV_ZEROCOPY instead of copy · David Miller <davem@davemloft.net> · 2014-03-07
Re: [PATCH net-next v7 0/9] xen-netback: TX grant mapping with SKBTX_DEV_ZEROCOPY instead of copy · Zoltan Kiss <hidden> · 2014-03-08
Re: [PATCH net-next v7 0/9] xen-netback: TX grant mapping with SKBTX_DEV_ZEROCOPY instead of copy · David Miller <davem@davemloft.net> · 2014-03-08
Re: [PATCH net-next v7 0/9] xen-netback: TX grant mapping with SKBTX_DEV_ZEROCOPY instead of copy · Wei Liu <hidden> · 2014-03-10
Re: [PATCH net-next v7 0/9] xen-netback: TX grant mapping with SKBTX_DEV_ZEROCOPY instead of copy · Ian Campbell <hidden> · 2014-03-12
Re: [PATCH net-next v7 0/9] xen-netback: TX grant mapping with SKBTX_DEV_ZEROCOPY instead of copy · Zoltan Kiss <hidden> · 2014-03-12
Re: [PATCH net-next v7 0/9] xen-netback: TX grant mapping with SKBTX_DEV_ZEROCOPY instead of copy · Ian Campbell <hidden> · 2014-03-13
Re: [PATCH net-next v7 0/9] xen-netback: TX grant mapping with SKBTX_DEV_ZEROCOPY instead of copy · Ian Campbell <hidden> · 2014-03-13
Re: [PATCH net-next v7 0/9] xen-netback: TX grant mapping with SKBTX_DEV_ZEROCOPY instead of copy · Zoltan Kiss <hidden> · 2014-03-13

Re: [PATCH net-next v7 0/9] xen-netback: TX grant mapping with SKBTX_DEV_ZEROCOPY instead of copy

From: Zoltan Kiss <hidden>
Date: 2014-03-13 18:23:10
Also in: lkml

On 13/03/14 10:08, Ian Campbell wrote:

On Thu, 2014-03-06 at 21:48 +0000, Zoltan Kiss wrote:

quoted

A long known problem of the upstream netback implementation that on the TX
path (from guest to Dom0) it copies the whole packet from guest memory into
Dom0. That simply became a bottleneck with 10Gb NICs, and generally it's a
huge perfomance penalty. The classic kernel version of netback used grant
mapping, and to get notified when the page can be unmapped, it used page
destructors. Unfortunately that destructor is not an upstreamable solution.
Ian Campbell's skb fragment destructor patch series [1] tried to solve this
problem, however it seems to be very invasive on the network stack's code,
and therefore haven't progressed very well.
This patch series use SKBTX_DEV_ZEROCOPY flags to tell the stack it needs to
know when the skb is freed up. That is the way KVM solved the same problem,
and based on my initial tests it can do the same for us. Avoiding the extra
copy boosted up TX throughput from 6.8 Gbps to 7.9 (I used a slower AMD
Interlagos box, both Dom0 and guest on upstream kernel, on the same NUMA node,
running iperf 2.0.5, and the remote end was a bare metal box on the same 10Gb
switch)

Do you have any other numbers? e.g. for a modern Intel or AMD system? A
slower box is likely to make the difference between copy and map larger,
whereas modern Intel for example is supposed to be very good at copying.

Performance team made a lot of measurements, I've added Marcus to 
comment on that.
With the latest version and tip net-next kernel I could see even ~9.3 
Gbps peak throughput on the same AMD box, which is the practical maximum 
for 10G cards. However with older guests I couldn't reach that. A lot 
depends on netfront and TCP stack, e.g. the tcp_limit_output_bytes 
sysctl can cause an artificial cap.
Perf team now has 40 Gbps NICs I guess, it would be interesting to see 
how does this perform there.
I just checked the intrahost guest-to-guest throughput with 2 upstream 
kernel, I could get out 5.6-5.8 Gbps at most.

quoted

Based on my investigations the packet get only copied if it is delivered to
Dom0 IP stack through deliver_skb, which is due to this [2] patch. This affects
DomU->Dom0 IP traffic and when Dom0 does routing/NAT for the guest. That's a bit
unfortunate, but luckily it doesn't cause a major regression for this usecase.

Numbers?

I've checked that back in November:

https://lkml.org/lkml/2013/11/5/288

Originally it was 5.4 vs with my patch it was 5.2. I've checked DomU to 
Dom0 iperf again, about the same still with my series.

Zoli

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help