Re: [PATCH 04/14] blk-mq-sched: improve dispatching from sw queue

[PATCH 00/14] blk-mq-sched: fix SCSI-MQ performance regression · Ming Lei <hidden> · 2017-07-31
[PATCH 10/14] blk-mq-sched: introduce helpers for query, change busy state · Ming Lei <hidden> · 2017-07-31
[PATCH 00/14] blk-mq-sched: fix SCSI-MQ performance regression · Ming Lei <hidden> · 2017-07-31
[PATCH 01/14] blk-mq-sched: fix scheduler bad performance · Ming Lei <hidden> · 2017-07-31
Re: [PATCH 01/14] blk-mq-sched: fix scheduler bad performance · Bart Van Assche <hidden> · 2017-07-31
[PATCH 02/14] blk-mq: rename flush_busy_ctx_data as ctx_iter_data · Ming Lei <hidden> · 2017-07-31
Re: [PATCH 02/14] blk-mq: rename flush_busy_ctx_data as ctx_iter_data · Bart Van Assche <hidden> · 2017-07-31
[PATCH 03/14] blk-mq: introduce blk_mq_dispatch_rq_from_ctxs() · Ming Lei <hidden> · 2017-07-31
Re: [PATCH 03/14] blk-mq: introduce blk_mq_dispatch_rq_from_ctxs() · Bart Van Assche <hidden> · 2017-07-31
Re: [PATCH 03/14] blk-mq: introduce blk_mq_dispatch_rq_from_ctxs() · Ming Lei <hidden> · 2017-08-01
Re: [PATCH 03/14] blk-mq: introduce blk_mq_dispatch_rq_from_ctxs() · kbuild test robot <hidden> · 2017-08-02
[PATCH 04/14] blk-mq-sched: improve dispatching from sw queue · Ming Lei <hidden> · 2017-07-31
Re: [PATCH 04/14] blk-mq-sched: improve dispatching from sw queue · Bart Van Assche <hidden> · 2017-07-31
Re: [PATCH 04/14] blk-mq-sched: improve dispatching from sw queue · Ming Lei <hidden> · 2017-08-01
Re: [PATCH 04/14] blk-mq-sched: improve dispatching from sw queue · Ming Lei <hidden> · 2017-08-01
Re: [PATCH 04/14] blk-mq-sched: improve dispatching from sw queue · Bart Van Assche <hidden> · 2017-08-01
Re: [PATCH 04/14] blk-mq-sched: improve dispatching from sw queue · Ming Lei <hidden> · 2017-08-02
Re: [PATCH 04/14] blk-mq-sched: improve dispatching from sw queue · Bart Van Assche <hidden> · 2017-08-03
Re: [PATCH 04/14] blk-mq-sched: improve dispatching from sw queue · Ming Lei <hidden> · 2017-08-03
Re: [PATCH 04/14] blk-mq-sched: improve dispatching from sw queue · Bart Van Assche <hidden> · 2017-08-03
Re: [PATCH 04/14] blk-mq-sched: improve dispatching from sw queue · "hch@infradead.org" <hch@infradead.org> · 2017-08-05
Re: [PATCH 04/14] blk-mq-sched: improve dispatching from sw queue · Ming Lei <hidden> · 2017-08-05
[PATCH 05/14] blk-mq-sched: don't dequeue request until all in ->dispatch are flushed · Ming Lei <hidden> · 2017-07-31
Re: [PATCH 05/14] blk-mq-sched: don't dequeue request until all in ->dispatch are flushed · Bart Van Assche <hidden> · 2017-07-31
Re: [PATCH 05/14] blk-mq-sched: don't dequeue request until all in ->dispatch are flushed · Ming Lei <hidden> · 2017-08-01
Re: [PATCH 05/14] blk-mq-sched: don't dequeue request until all in ->dispatch are flushed · Bart Van Assche <hidden> · 2017-08-01
Re: [PATCH 05/14] blk-mq-sched: don't dequeue request until all in ->dispatch are flushed · Ming Lei <hidden> · 2017-08-02
Re: [PATCH 05/14] blk-mq-sched: don't dequeue request until all in ->dispatch are flushed · Bart Van Assche <hidden> · 2017-08-03
[PATCH 06/14] blk-mq-sched: introduce blk_mq_sched_queue_depth() · Ming Lei <hidden> · 2017-07-31
[PATCH 07/14] blk-mq-sched: use q->queue_depth as hint for q->nr_requests · Ming Lei <hidden> · 2017-07-31
[PATCH 08/14] blk-mq: introduce BLK_MQ_F_SHARED_DEPTH · Ming Lei <hidden> · 2017-07-31
[PATCH 09/14] blk-mq-sched: cleanup blk_mq_sched_dispatch_requests() · Ming Lei <hidden> · 2017-07-31
[PATCH 11/14] blk-mq: introduce helpers for operating ->dispatch list · Ming Lei <hidden> · 2017-07-31
[PATCH 12/14] blk-mq: introduce pointers to dispatch lock & list · Ming Lei <hidden> · 2017-07-31
[PATCH 13/14] blk-mq: pass 'request_queue *' to several helpers of operating BUSY · Ming Lei <hidden> · 2017-07-31
[PATCH 14/14] blk-mq-sched: improve IO scheduling on SCSI devcie · Ming Lei <hidden> · 2017-07-31

From: Ming Lei <hidden>
Date: 2017-08-05 13:40:21
Also in: linux-scsi

On Thu, Aug 03, 2017 at 05:33:13PM +0000, Bart Van Assche wrote:

On Thu, 2017-08-03 at 11:13 +0800, Ming Lei wrote:

quoted

On Thu, Aug 03, 2017 at 01:35:29AM +0000, Bart Van Assche wrote:

quoted

On Wed, 2017-08-02 at 11:31 +0800, Ming Lei wrote:

quoted

On Tue, Aug 01, 2017 at 03:11:42PM +0000, Bart Van Assche wrote:

quoted

On Tue, 2017-08-01 at 18:50 +0800, Ming Lei wrote:

quoted

On Tue, Aug 01, 2017 at 06:17:18PM +0800, Ming Lei wrote:

quoted

How can we get the accurate 'number of requests in progress' efficiently?

Hello Ming,

How about counting the number of bits that have been set in the tag set?
I am aware that these bits can be set and/or cleared concurrently with the
dispatch code but that count is probably a good starting point.

It has to be atomic_t, which is too too heavy for us, please see the report:

	http://marc.info/?t=149868448400003&r=1&w=2

Both Jens and I want to kill hd_struct.in_flight, but looks still no
good way.

Hello Ming,

Sorry but I disagree that a new atomic variable should be added to keep track
of the number of busy requests. Counting the number of bits that are set in
the tag set should be good enough in this context.

That won't work because the tag set is host wide and shared by all LUNs.

Hello Ming,

Are you aware that the SCSI core already keeps track of the number of busy requests
per LUN? See also the device_busy member of struct scsi_device. How about giving the
block layer core access in some way to that counter?

Yes, I know that.

Last time I mentioned it to Christoph that this counter can be used for
implementing Runtime PM for avoiding to introduce one new counter to 
account pending I/O.

But for this purpose(estimating how many requests to dequeue from hctxs),
it isn't a good idea:

1) strictly speaking, atomic counter isn't enough, and lock 
is needed, because we need to make sure that the counter can't
be changed during dequeuing requests, so exporting the counter
to block won't work

2) even though you may think it is just for estimating, and
not use a lock, it isn't good too, because for some SCSI devices,
q->queue_depth is very small, both qla2xxx and lfpc's .cmd_perf_lun
is 3. So it can be very inaccurate since it is normal to dequeue
requests from all hctx at the same time.

Also I have posted V2 today, from the test result on SRP, looks
it is good to dequeue one request one time, so I suggest that we
follow mq scheduler's way to dequeue request(pick up one in one time)
for blk-mq 'none' in this patchset. We may consider to improve
it in future if there is better & mature idea.


Thanks,
Ming

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help