Re: [PATCH v3 0/5] vhost: optimize enqueue

[PATCH] optimize vhost enqueue · Zhihong Wang <hidden> · 2016-08-16
Re: [PATCH] optimize vhost enqueue · Maxime Coquelin <hidden> · 2016-08-16
Re: [PATCH] optimize vhost enqueue · Wang, Zhihong <hidden> · 2016-08-17
Re: [PATCH] optimize vhost enqueue · Yuanhan Liu <hidden> · 2016-08-17
Re: [PATCH] optimize vhost enqueue · Wang, Zhihong <hidden> · 2016-08-17
Re: [PATCH] optimize vhost enqueue · Maxime Coquelin <hidden> · 2016-08-17
Re: [PATCH] optimize vhost enqueue · Yuanhan Liu <hidden> · 2016-08-17
Re: [PATCH] optimize vhost enqueue · Wang, Zhihong <hidden> · 2016-08-18
Re: [PATCH] optimize vhost enqueue · Wang, Zhihong <hidden> · 2016-08-17
[PATCH v2 0/6] vhost: optimize enqueue · Zhihong Wang <hidden> · 2016-08-18
[PATCH v2 1/6] vhost: rewrite enqueue · Zhihong Wang <hidden> · 2016-08-18
Re: [PATCH v2 1/6] vhost: rewrite enqueue · Yuanhan Liu <hidden> · 2016-08-19
Re: [PATCH v2 1/6] vhost: rewrite enqueue · Wang, Zhihong <hidden> · 2016-08-19
[PATCH v2 2/6] vhost: remove obsolete · Zhihong Wang <hidden> · 2016-08-18
Re: [PATCH v2 2/6] vhost: remove obsolete · Yuanhan Liu <hidden> · 2016-08-19
Re: [PATCH v2 2/6] vhost: remove obsolete · Wang, Zhihong <hidden> · 2016-08-19
[PATCH v2 3/6] vhost: remove useless volatile · Zhihong Wang <hidden> · 2016-08-18
[PATCH v2 4/6] vhost: add desc prefetch · Zhihong Wang <hidden> · 2016-08-18
[PATCH v2 5/6] vhost: batch update used ring · Zhihong Wang <hidden> · 2016-08-18
[PATCH v2 6/6] vhost: optimize cache access · Zhihong Wang <hidden> · 2016-08-18
[PATCH v3 0/5] vhost: optimize enqueue · Zhihong Wang <hidden> · 2016-08-19
[PATCH v3 2/5] vhost: remove useless volatile · Zhihong Wang <hidden> · 2016-08-19
[PATCH v3 1/5] vhost: rewrite enqueue · Zhihong Wang <hidden> · 2016-08-19
Re: [PATCH v3 1/5] vhost: rewrite enqueue · Maxime Coquelin <hidden> · 2016-08-22
Re: [PATCH v3 1/5] vhost: rewrite enqueue · Wang, Zhihong <hidden> · 2016-08-23
Re: [PATCH v3 1/5] vhost: rewrite enqueue · Yuanhan Liu <hidden> · 2016-08-25
[PATCH v3 4/5] vhost: batch update used ring · Zhihong Wang <hidden> · 2016-08-19
Re: [PATCH v3 4/5] vhost: batch update used ring · Yuanhan Liu <hidden> · 2016-08-25
Re: [PATCH v3 4/5] vhost: batch update used ring · Wang, Zhihong <hidden> · 2016-08-25
[PATCH v3 3/5] vhost: add desc prefetch · Zhihong Wang <hidden> · 2016-08-19
[PATCH v3 5/5] vhost: optimize cache access · Zhihong Wang <hidden> · 2016-08-19
Re: [PATCH v3 0/5] vhost: optimize enqueue · Maxime Coquelin <hidden> · 2016-08-22
Re: [PATCH v3 0/5] vhost: optimize enqueue · Maxime Coquelin <hidden> · 2016-08-22
Re: [PATCH v3 0/5] vhost: optimize enqueue · Thomas Monjalon <hidden> · 2016-08-22
Re: [PATCH v3 0/5] vhost: optimize enqueue · Wang, Zhihong <hidden> · 2016-08-24
Re: [PATCH v3 0/5] vhost: optimize enqueue · Wang, Zhihong <hidden> · 2016-08-23
Re: [PATCH v3 0/5] vhost: optimize enqueue · Wang, Zhihong <hidden> · 2016-08-23
Re: [PATCH v3 0/5] vhost: optimize enqueue · Yuanhan Liu <hidden> · 2016-08-23
Re: [PATCH v3 0/5] vhost: optimize enqueue · Maxime Coquelin <hidden> · 2016-08-23
Re: [PATCH v3 0/5] vhost: optimize enqueue · Wang, Zhihong <hidden> · 2016-08-23
Re: [PATCH v3 0/5] vhost: optimize enqueue · Jianbo Liu <hidden> · 2016-09-21
Re: [PATCH v3 0/5] vhost: optimize enqueue · Wang, Zhihong <hidden> · 2016-09-21
Re: [PATCH v3 0/5] vhost: optimize enqueue · Jianbo Liu <hidden> · 2016-09-21
Re: [PATCH v3 0/5] vhost: optimize enqueue · Wang, Zhihong <hidden> · 2016-09-22
Re: [PATCH v3 0/5] vhost: optimize enqueue · Yuanhan Liu <hidden> · 2016-09-22
Re: [PATCH v3 0/5] vhost: optimize enqueue · Jianbo Liu <hidden> · 2016-09-22
Re: [PATCH v3 0/5] vhost: optimize enqueue · Wang, Zhihong <hidden> · 2016-09-22
Re: [PATCH v3 0/5] vhost: optimize enqueue · Jianbo Liu <hidden> · 2016-09-22
Re: [PATCH v3 0/5] vhost: optimize enqueue · Wang, Zhihong <hidden> · 2016-09-22
Re: [PATCH v3 0/5] vhost: optimize enqueue · Jianbo Liu <hidden> · 2016-09-22
Re: [PATCH v3 0/5] vhost: optimize enqueue · Wang, Zhihong <hidden> · 2016-09-23
Re: [PATCH v3 0/5] vhost: optimize enqueue · Jianbo Liu <hidden> · 2016-09-23
Re: [PATCH v3 0/5] vhost: optimize enqueue · Thomas Monjalon <hidden> · 2016-09-23
Re: [PATCH v3 0/5] vhost: optimize enqueue · Wang, Zhihong <hidden> · 2016-09-25
Re: [PATCH v3 0/5] vhost: optimize enqueue · Jianbo Liu <hidden> · 2016-09-26
Re: [PATCH v3 0/5] vhost: optimize enqueue · Wang, Zhihong <hidden> · 2016-09-26
Re: [PATCH v3 0/5] vhost: optimize enqueue · Jianbo Liu <hidden> · 2016-09-26
Re: [PATCH v3 0/5] vhost: optimize enqueue · Wang, Zhihong <hidden> · 2016-09-26
Re: [PATCH v3 0/5] vhost: optimize enqueue · Jianbo Liu <hidden> · 2016-09-26
Re: [PATCH v3 0/5] vhost: optimize enqueue · Luke Gorrie <hidden> · 2016-09-26
Re: [PATCH v3 0/5] vhost: optimize enqueue · Jianbo Liu <hidden> · 2016-09-26
Re: [PATCH v3 0/5] vhost: optimize enqueue · Yuanhan Liu <hidden> · 2016-09-27
Re: [PATCH v3 0/5] vhost: optimize enqueue · Wang, Zhihong <hidden> · 2016-09-27
Re: [PATCH v3 0/5] vhost: optimize enqueue · Wang, Zhihong <hidden> · 2016-10-09
Re: [PATCH v3 0/5] vhost: optimize enqueue · Yuanhan Liu <hidden> · 2016-10-10
Re: [PATCH v3 0/5] vhost: optimize enqueue · Jianbo Liu <hidden> · 2016-10-10
Re: [PATCH v3 0/5] vhost: optimize enqueue · Wang, Zhihong <hidden> · 2016-10-10
Re: [PATCH v3 0/5] vhost: optimize enqueue · Jianbo Liu <hidden> · 2016-10-10
Re: [PATCH v3 0/5] vhost: optimize enqueue · Wang, Zhihong <hidden> · 2016-10-10
Re: [PATCH v3 0/5] vhost: optimize enqueue · Yuanhan Liu <hidden> · 2016-10-12
Re: [PATCH v3 0/5] vhost: optimize enqueue · Wang, Zhihong <hidden> · 2016-10-12
Re: [PATCH v3 0/5] vhost: optimize enqueue · Thomas Monjalon <hidden> · 2016-10-12
Re: [PATCH v3 0/5] vhost: optimize enqueue · Wang, Zhihong <hidden> · 2016-10-13
Re: [PATCH v3 0/5] vhost: optimize enqueue · Jianbo Liu <hidden> · 2016-10-13
Re: [PATCH v3 0/5] vhost: optimize enqueue · Yuanhan Liu <hidden> · 2016-10-13
Re: [PATCH v3 0/5] vhost: optimize enqueue · Yuanhan Liu <hidden> · 2016-10-13
Re: [PATCH v3 0/5] vhost: optimize enqueue · Wang, Zhihong <hidden> · 2016-10-13
Re: [PATCH v3 0/5] vhost: optimize enqueue · Maxime Coquelin <hidden> · 2016-10-13
Re: [PATCH v3 0/5] vhost: optimize enqueue · Maxime Coquelin <hidden> · 2016-10-13
Re: [PATCH v3 0/5] vhost: optimize enqueue · Yuanhan Liu <hidden> · 2016-10-14
[PATCH v4 0/6] vhost: optimize enqueue · Zhihong Wang <hidden> · 2016-08-30
[PATCH v4 1/6] vhost: fix windows vm hang · Zhihong Wang <hidden> · 2016-08-30
Re: [dpdk-stable] [PATCH v4 1/6] vhost: fix windows vm hang · Yuanhan Liu <hidden> · 2016-09-05
Re: [dpdk-stable] [PATCH v4 1/6] vhost: fix windows vm hang · Wang, Zhihong <hidden> · 2016-09-05
Re: [dpdk-stable] [PATCH v4 1/6] vhost: fix windows vm hang · Yuanhan Liu <hidden> · 2016-09-05
[PATCH v4 2/6] vhost: rewrite enqueue · Zhihong Wang <hidden> · 2016-08-30
Re: [PATCH v4 2/6] vhost: rewrite enqueue · Yuanhan Liu <hidden> · 2016-09-08
Re: [PATCH v4 2/6] vhost: rewrite enqueue · Yuanhan Liu <hidden> · 2016-09-07
Re: [PATCH v4 2/6] vhost: rewrite enqueue · Wang, Zhihong <hidden> · 2016-09-07
[PATCH v4 3/6] vhost: remove useless volatile · Zhihong Wang <hidden> · 2016-08-30
[PATCH v4 4/6] vhost: add desc prefetch · Zhihong Wang <hidden> · 2016-08-30
[PATCH v4 5/6] vhost: batch update used ring · Zhihong Wang <hidden> · 2016-08-30
[PATCH v4 6/6] vhost: optimize cache access · Zhihong Wang <hidden> · 2016-08-30
[PATCH v5 0/6] vhost: optimize enqueue · Zhihong Wang <hidden> · 2016-09-09
[PATCH v5 1/6] vhost: fix windows vm hang · Zhihong Wang <hidden> · 2016-09-09
[PATCH v5 2/6] vhost: rewrite enqueue · Zhihong Wang <hidden> · 2016-09-09
Re: [PATCH v5 2/6] vhost: rewrite enqueue · Maxime Coquelin <hidden> · 2016-09-12
Re: [PATCH v5 2/6] vhost: rewrite enqueue · Wang, Zhihong <hidden> · 2016-09-14
Re: [PATCH v5 2/6] vhost: rewrite enqueue · Maxime Coquelin <hidden> · 2016-09-15
Re: [PATCH v5 2/6] vhost: rewrite enqueue · Maxime Coquelin <hidden> · 2016-09-12
Re: [PATCH v5 2/6] vhost: rewrite enqueue · Wang, Zhihong <hidden> · 2016-09-14
Re: [PATCH v5 2/6] vhost: rewrite enqueue · Yuanhan Liu <hidden> · 2016-09-18
Re: [PATCH v5 2/6] vhost: rewrite enqueue · Wang, Zhihong <hidden> · 2016-09-19
[PATCH v5 3/6] vhost: remove useless volatile · Zhihong Wang <hidden> · 2016-09-09
[PATCH v5 4/6] vhost: add desc prefetch · Zhihong Wang <hidden> · 2016-09-09
[PATCH v5 5/6] vhost: batch update used ring · Zhihong Wang <hidden> · 2016-09-09
Re: [PATCH v5 5/6] vhost: batch update used ring · Maxime Coquelin <hidden> · 2016-09-12
Re: [PATCH v5 5/6] vhost: batch update used ring · Wang, Zhihong <hidden> · 2016-09-14
Re: [PATCH v5 5/6] vhost: batch update used ring · Maxime Coquelin <hidden> · 2016-09-15
Re: [PATCH v5 5/6] vhost: batch update used ring · Yuanhan Liu <hidden> · 2016-09-18
Re: [PATCH v5 5/6] vhost: batch update used ring · Wang, Zhihong <hidden> · 2016-09-18
[PATCH v5 6/6] vhost: optimize cache access · Zhihong Wang <hidden> · 2016-09-09
Re: [PATCH v5 0/6] vhost: optimize enqueue · Maxime Coquelin <hidden> · 2016-09-12
Re: [PATCH v5 0/6] vhost: optimize enqueue · Maxime Coquelin <hidden> · 2016-09-12
Re: [PATCH v5 0/6] vhost: optimize enqueue · Yuanhan Liu <hidden> · 2016-09-12
[PATCH v6 0/6] vhost: optimize enqueue · Zhihong Wang <hidden> · 2016-09-20
[PATCH v6 1/6] vhost: fix windows vm hang · Zhihong Wang <hidden> · 2016-09-20
Re: [dpdk-stable] [PATCH v6 1/6] vhost: fix windows vm hang · Yuanhan Liu <hidden> · 2016-10-13
[PATCH v6 2/6] vhost: rewrite enqueue · Zhihong Wang <hidden> · 2016-09-20
Re: [PATCH v6 2/6] vhost: rewrite enqueue · Jianbo Liu <hidden> · 2016-09-22
Re: [PATCH v6 2/6] vhost: rewrite enqueue · Wang, Zhihong <hidden> · 2016-09-22
[PATCH v6 3/6] vhost: remove useless volatile · Zhihong Wang <hidden> · 2016-09-20
[PATCH v6 4/6] vhost: add desc prefetch · Zhihong Wang <hidden> · 2016-09-20
[PATCH v6 5/6] vhost: batch update used ring · Zhihong Wang <hidden> · 2016-09-20
[PATCH v6 6/6] vhost: optimize cache access · Zhihong Wang <hidden> · 2016-09-20
Re: [PATCH v6 6/6] vhost: optimize cache access · Maxime Coquelin <hidden> · 2016-09-21
Re: [PATCH v6 0/6] vhost: optimize enqueue · Yuanhan Liu <hidden> · 2016-09-21
Re: [PATCH v6 0/6] vhost: optimize enqueue · Maxime Coquelin <hidden> · 2016-09-21
[PATCH v7 0/7] vhost: optimize mergeable Rx path · Yuanhan Liu <hidden> · 2016-10-14
[PATCH v7 1/7] vhost: remove useless volatile · Yuanhan Liu <hidden> · 2016-10-14
[PATCH v7 2/7] vhost: optimize cache access · Yuanhan Liu <hidden> · 2016-10-14
[PATCH v7 3/7] vhost: simplify mergeable Rx vring reservation · Yuanhan Liu <hidden> · 2016-10-14
Re: [PATCH v7 3/7] vhost: simplify mergeable Rx vring reservation · Thomas Monjalon <hidden> · 2016-10-25
Re: [PATCH v7 3/7] vhost: simplify mergeable Rx vring reservation · Yuanhan Liu <hidden> · 2016-10-26
[PATCH v7 4/7] vhost: use last avail idx for avail ring reservation · Yuanhan Liu <hidden> · 2016-10-14
[PATCH v7 5/7] vhost: shadow used ring update · Yuanhan Liu <hidden> · 2016-10-14
[PATCH v7 7/7] vhost: retrieve avail head once · Yuanhan Liu <hidden> · 2016-10-14
[PATCH v7 6/7] vhost: prefetch avail ring · Yuanhan Liu <hidden> · 2016-10-14
Re: [PATCH v7 0/7] vhost: optimize mergeable Rx path · Jianbo Liu <hidden> · 2016-10-18
Re: [PATCH v7 0/7] vhost: optimize mergeable Rx path · Maxime Coquelin <hidden> · 2016-10-18
Re: [PATCH v7 0/7] vhost: optimize mergeable Rx path · Yuanhan Liu <hidden> · 2016-10-21

From: Wang, Zhihong <hidden>
Date: 2016-08-23 02:15:48

Subject: Re: [PATCH v3 0/5] vhost: optimize enqueue

Hi Zhihong,

[...]

quoted

The main optimization techniques are:

 1. Reorder code to reduce CPU pipeline stall cycles.

 2. Batch update the used ring for better efficiency.

 3. Prefetch descriptor to hide cache latency.

 4. Remove useless volatile attribute to allow compiler optimization.

Thanks for these details, this is helpful to understand where the perf
gain comes from.
I would suggest to add these information as comments in the code
where/if it makes sense. If more a general comment, at least add it in
the commit message of the patch introducing it.
Indeed, adding it to the cover letter is fine, but the information is
lost as soon as the series is applied.

Hi Maxime,

I did add these info in the later optimization patches to explain each
optimization techniques. The v1 was indeed hard to read.

You don't mention any figures, so I set up a benchmark on my side to
evaluate your series. It indeed shows an interesting performance gain.

My setup consists of one host running a guest.
The guest generates as much 64bytes packets as possible using
pktgen-dpdk. The hosts forwards received packets back to the guest
using testpmd on vhost pmd interface. Guest's vCPUs are pinned to
physical CPUs.

Thanks for doing the test!

I didn't publish any numbers since the gain varies in different platforms
and test setups.

In my phy to vm test on both IVB and HSW, where testpmd in the host rx from
the nic and enqueue to the guest, the enqueue efficiency (cycles per packet)
is 2.4x and 1.4x as fast as the current code for mergeable on and mergeable
off respectively, for v3 patch.

I tested it with and without your v1 patch, with and without
rx-mergeable feature turned ON.
Results are the average of 8 runs of 60 seconds:

Rx-Mergeable ON : 7.72Mpps
Rx-Mergeable ON + "vhost: optimize enqueue" v1: 9.19Mpps
Rx-Mergeable OFF: 10.52Mpps
Rx-Mergeable OFF + "vhost: optimize enqueue" v1: 10.60Mpps

Regards,
Maxime

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help