Thread (36 messages) 36 messages, 4 authors, 2017-08-05

Re: [PATCH 04/14] blk-mq-sched: improve dispatching from sw queue

From: Bart Van Assche <hidden>
Date: 2017-08-01 15:12:31
Also in: linux-scsi

On Tue, 2017-08-01 at 18:50 +0800, Ming Lei wrote:
On Tue, Aug 01, 2017 at 06:17:18PM +0800, Ming Lei wrote:
quoted
How can we get the accurate 'number of requests in progress' efficientl=
y?

Hello Ming,

How about counting the number of bits that have been set in the tag set?
I am aware that these bits can be set and/or cleared concurrently with the
dispatch code but that count is probably a good starting point.
quoted
From my test data of mq-deadline on lpfc, the performance is good,
please see it in cover letter.
=20
Forget to mention, ctx->list is one per-cpu list and the lock is percpu
lock, so changing to this way shouldn't be a performance issue.
Sorry but I don't consider this reply as sufficient. The latency of IB HCA'=
s
is significantly lower than that of any FC hardware I ran performance
measurements on myself. It's not because this patch series improves perform=
ance
for lpfc that that guarantees that there won't be a performance regression =
for
ib_srp, ib_iser or any other low-latency initiator driver for which q->dept=
h
!=3D 0.

Additionally, patch 03/14 most likely introduces a fairness problem. Should=
n't
blk_mq_dispatch_rq_from_ctxs() dequeue requests from the per-CPU queues in =
a
round-robin fashion instead of always starting at the first per-CPU queue i=
n
hctx->ctx_map?

Thanks,

Bart.
Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help